During git clone: encounters 2 file(s) that may not have been copied correctly on Windows
#21 opened about 7 hours ago
by
guynich
Is this model can not use function calling?
#20 opened about 21 hours ago
by
Fatzard

test_for_ds_r1_qwq_8b
#19 opened 7 days ago
by
JunZhangf
A quick test comparing R1-0528-Qwen3-8B with Phi-4
#17 opened 24 days ago
by
gptlocalhost

ciudades turisticas
#15 opened 27 days ago
by
lolisponce

Model collapse after SFT
3
#14 opened about 1 month ago
by
Banjiuyufen

Vocab missing tool-related strings in chat template, poor performance with tools
4
#13 opened about 1 month ago
by
mattjcly
Can you please release how you post-train qwen3 on deepseek?
2
#12 opened about 1 month ago
by
ZeroWw
Tried it, but not good as expected.
3
#11 opened about 1 month ago
by
kk3dmax
/no_think 标签不能用了吗
4
#10 opened about 1 month ago
by
loong
Any plans for a Qwen3-32B model?
👍
13
7
#9 opened about 1 month ago
by
wanghf
BTW For programmer, `Gemma` series are best to help you write comments, docstrings, and documents.
🔥
1
1
#8 opened about 1 month ago
by
DOFOFFICIAL

DeepSeek-R1-Lite
🔥
❤️
20
7
#6 opened about 1 month ago
by
Dampfinchen
generation_config.json is missing
👀
👍
5
#5 opened about 1 month ago
by
Doctor-Chad-PhD

Model broken
👍
3
9
#4 opened about 1 month ago
by
sm54
Any plans on gemma series? ;-;
❤️
4
4
#2 opened about 1 month ago
by
Nakdesu

Any plans on 30B-A3B model?
🔥
30
7
#1 opened about 1 month ago
by
xxx777xxxASD
