Jack
qwertyjack
AI & ML interests
None yet
Recent Activity
new activity
13 days ago
unsloth/Qwen3-14B-GGUF:How to disable <think> tag on stream mode request?
new activity
3 months ago
Qwen/QwQ-32B:issue of "think too much" ,how to?(chinese)
new activity
4 months ago
unsloth/DeepSeek-R1-GGUF:Where did the BF16 come from?
Organizations
qwertyjack's activity
How to disable <think> tag on stream mode request?
1
#4 opened 26 days ago
by
celsowm

issue of "think too much" ,how to?(chinese)
2
#14 opened 3 months ago
by
fenglui
Where did the BF16 come from?
8
#10 opened 4 months ago
by
gshpychka
New research paper, R1 type reasoning models can be drastically improved in quality
2
#19 opened 4 months ago
by
krustik
Please make V3-lite
👍
❤️
46
4
#12 opened 5 months ago
by
rombodawg

感觉新版的Mistrial-LargeV3的GPTQ量化的int4版本对显存的需求大大提升了
2
#1 opened 6 months ago
by
YanchengQian

How to run the model OpenGVLab/InternVL2-40B-AWQ with vllm docker image?
2
#2 opened 10 months ago
by
andryevinnik
请教一下,cogvlm和glm4v的区别是什么呢
🔥
➕
8
3
#1 opened 12 months ago
by
rangehow
Having trouble loading this with transformers
➕
3
5
#8 opened about 1 year ago
by
codelion

GPTQ plz
10
#3 opened about 1 year ago
by
Parkerlambert123
Would you plan to optimize ChatGLM2-6B? and when?
4
#47 opened almost 2 years ago
by
Zuyuan