ghostplant
ghostplant
AI & ML interests
None yet
Recent Activity
new activity
3 days ago
deepseek-ai/DeepSeek-R1-0528:刚部署满血deepseek r1 0528版本,推理性能提升这么多嘛?不是架构没变嘛?
new activity
6 days ago
deepseek-ai/DeepSeek-R1-0528:How to run 0528version on GPU which don't support FP8
new activity
7 days ago
deepseek-ai/DeepSeek-R1-0528:这个问题大家的输出是什么?
Organizations
None yet
ghostplant's activity
刚部署满血deepseek r1 0528版本,推理性能提升这么多嘛?不是架构没变嘛?
12
#75 opened 7 days ago
by
jakyer
How to run 0528version on GPU which don't support FP8
4
#64 opened 7 days ago
by
Micdiane
这个问题大家的输出是什么?
6
#49 opened 8 days ago
by
ghostplant
Does R1 support long context (> 4K)?
#172 opened 3 months ago
by
ghostplant
can this model run on Hopper GPU
6
#8 opened 3 months ago
by
simonlindelta

can this model run on A800 ?
2
#10 opened 3 months ago
by
wang35
Why not use FP2 or IQ2 as kTransformers does?
#11 opened 3 months ago
by
ghostplant
Deploying production ready service with Unsloth GGUF quants on your AWS account. (4 x L40S)
🔥
2
8
#171 opened 3 months ago
by
samagra-tensorfuse
90+ tokens per second for MI300x8 using batch_size = 1
1
#166 opened 3 months ago
by
ghostplant
Q2_K_XL 好还是 Q4好呢
3
#34 opened 4 months ago
by
jializou

所以部署一个671B的模型 显存需要多少 有什么基准的硬件配置?
27
#118 opened 4 months ago
by
cena163

How much vram do you need?
8
#12 opened 4 months ago
by
hyun10
Is there a model removing non-shared MoE experts?
4
#17 opened 4 months ago
by
ghostplant
Please convert these models to GGUF format...
👍
2
5
#12 opened 4 months ago
by
Moodym