15 3 4

Jack

qwertyjack

AI & ML interests

None yet

Recent Activity

new activity 19 days ago

openai/gpt-oss-20b:Open AI open source models not able to run ...

new activity 4 months ago

unsloth/Qwen3-14B-GGUF:How to disable <think> tag on stream mode request?

new activity 6 months ago

Qwen/QwQ-32B:issue of "think too much" ，how to？(chinese)

View all activity

Organizations

New activity in openai/gpt-oss-20b 19 days ago

Open AI open source models not able to run ...

😔 3

#34 opened 20 days ago by

pskmattegunta

New activity in unsloth/Qwen3-14B-GGUF 4 months ago

How to disable <think> tag on stream mode request?

#4 opened 4 months ago by

celsowm

New activity in Qwen/QwQ-32B 6 months ago

issue of "think too much" ，how to？(chinese)

#14 opened 6 months ago by

fenglui

New activity in unsloth/DeepSeek-R1-GGUF 7 months ago

Where did the BF16 come from?

#10 opened 7 months ago by

gshpychka

New research paper, R1 type reasoning models can be drastically improved in quality

#19 opened 7 months ago by

krustik

liked a model 7 months ago

winninghealth/WiNGPT-Babel

Text Generation • 2B • Updated Jun 9 • 1.2k • • 40

New activity in deepseek-ai/DeepSeek-V3 8 months ago

Please make V3-lite

❤️ 👍 50

#12 opened 8 months ago by

rombodawg

updated a model 9 months ago

ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1

Text Generation • 7B • Updated Dec 18, 2024 • 9 • 51

New activity in TechxGenus/Mistral-Large-Instruct-2411-GPTQ 9 months ago

感觉新版的Mistrial-LargeV3的GPTQ量化的int4版本对显存的需求大大提升了

#1 opened 9 months ago by

YanchengQian

upvoted an article 11 months ago

Article

Making LLMs lighter with AutoGPTQ and transformers

and 5 others •

Aug 23, 2023

• 58

upvoted an article 12 months ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

and 2 others •

Jul 18, 2024

• 60

New activity in OpenGVLab/InternVL2-40B-AWQ about 1 year ago

How to run the model OpenGVLab/InternVL2-40B-AWQ with vllm docker image?

#2 opened about 1 year ago by

andryevinnik

updated a collection about 1 year ago

magic

Collection

2 items • Updated Jun 25, 2024

liked 2 models about 1 year ago

wave-on-discord/gemini-nano-adapter

Updated Jun 24, 2024 • 15 • 24

wave-on-discord/gemini-nano

Updated Jun 24, 2024 • 104

New activity in zai-org/glm-4v-9b about 1 year ago

请教一下，cogvlm和glm4v的区别是什么呢

➕ 🔥 8

#1 opened about 1 year ago by

rangehow

New activity in Qwen/CodeQwen1.5-7B-Chat over 1 year ago

Having trouble loading this with transformers

➕ 3

#8 opened over 1 year ago by

codelion

New activity in deepseek-ai/DeepSeek-V2-Chat over 1 year ago

GPTQ plz

#3 opened over 1 year ago by

Parkerlambert123