Cool way to fine tune that I wanted to share.
#9 opened 26 days ago
by
SuperbEmphasis
Model decent when running with 6 active experts
#8 opened 2 months ago
by
userzyzz
Another question: How did you train this model?
#7 opened 2 months ago
by
marcuscedricridia
This is the first Qwen3 A3B model that doesnt immediately start repeating itself
3
#2 opened 2 months ago
by
SuperbEmphasis
Feedback after some use
❤️
👍
3
5
#1 opened 3 months ago
by
AlecFoster