Add assistant mask support to Qwen3-4B
#9 opened 10 days ago
by
waleko

UnslothVisionDataCollator problem
2
#8 opened 28 days ago
by
orkungedik

Translation task in low-resource language can be done pretty well
#7 opened about 1 month ago
by
luweigen
Why are the new 4B and 8B models slower than the previous 7B-1M model??
3
#6 opened about 1 month ago
by
stev236
Collections of Qwen3 4B model Bad Cases User Reviews and Comments
😔
1
#5 opened about 1 month ago
by
DeepNLP
YaRN: is "performance" referring to quality or speed?
👀
1
#4 opened about 2 months ago
by
kmouratidis

Use the more common reverse filter in template
#3 opened about 2 months ago
by
tahayassine

【Evaluation】Best practice for evaluating Qwen3 !!
🔥
🚀
2
#2 opened about 2 months ago
by
wangxingjun778

Add languages tag
#1 opened about 2 months ago
by
de-francophones
