Zheng Han's picture

21 1 3

Zheng Han

traphix

·

AI & ML interests

None yet

Recent Activity

new activity 12 days ago

RedHatAI/DeepSeek-R1-0528-quantized.w4a16:Is 4 x H20 96G sufficient to run this model?

new activity 24 days ago

Qwen/Qwen3-30B-A3B-FP8:Remove vLLM FP8 Limitation

new activity 26 days ago

RedHatAI/Qwen3-235B-A22B-FP8-dynamic:Error running on A100?

View all activity

Organizations

None yet

traphix's activity

New activity in RedHatAI/DeepSeek-R1-0528-quantized.w4a16 12 days ago

Is 4 x H20 96G sufficient to run this model?

#2 opened 12 days ago by

New activity in Qwen/Qwen3-30B-A3B-FP8 24 days ago

Remove vLLM FP8 Limitation

#2 opened about 2 months ago by

New activity in RedHatAI/Qwen3-235B-A22B-FP8-dynamic 26 days ago

Error running on A100?

#4 opened about 1 month ago by

New activity in RedHatAI/Qwen3-235B-A22B-FP8-dynamic about 1 month ago

Any plans for int8 quantized.w8a8?

#5 opened about 1 month ago by

New activity in justinjja/Qwen3-235B-A22B-INT4-W4A16 about 1 month ago

How about int8 quantization?

#3 opened about 1 month ago by

New activity in RedHatAI/Qwen3-235B-A22B-FP8-dynamic about 1 month ago

How many RAM in GBs when quantizing Qwen3-235B-A22B?

#2 opened about 1 month ago by

Where are the safetensors?

#1 opened about 1 month ago by

New activity in RedHatAI/Qwen3-32B-FP8-dynamic about 1 month ago

What is the difference between Qwen/Qwen3-32B-FP8 and this quatinized model？

#1 opened about 1 month ago by

New activity in RedHatAI/DeepSeek-R1-quantized.w4a16 about 2 months ago

Does vllm 0.8.4 support this quantized model?

#1 opened about 2 months ago by

New activity in Qwen/QwQ-32B 3 months ago

This model beats Qwen Max!

#33 opened 3 months ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 4 months ago

why "MLA is not supported with awq_marlin quantization. Disabling MLA." with 4090 * 32 (4 node / vllm 0.7.2)

#14 opened 4 months ago by

Is there any accuracy results comparing to original DeepSeek-R1？

#15 opened 4 months ago by

Has anyone evaluated the performance of the AWQ version of the model on benchmarks?

#8 opened 4 months ago by

skips the thinking process

#5 opened 4 months ago by

Deployment framework

#2 opened 5 months ago by

New activity in cognitivecomputations/DeepSeek-V3-AWQ 4 months ago

vllm support a100

#2 opened 5 months ago by

HuggingLianWang

New activity in RedHatAI/DeepSeek-Coder-V2-Instruct-FP8 10 months ago

Can it run on A100/A800 with VLLM?

#1 opened 11 months ago by

Parkerlambert123