RedHatAI/DeepSeek-R1-0528-quantized.w4a16 · Is 4 x H20 96G sufficient to run this model?

Is 4 x H20 96G sufficient to run this model?

#2

by milongwong - opened 3 days ago

3 days ago

We have limited resource and have questions below:

Is 4 x H20 96G sufficient to run this model?
Has anyone tried to get it run by SGlang to get better performance output?

traphix

3 days ago

•

edited 3 days ago

The size of quantized params is 346GB. Still very large

4 x H20 96G can run it. But the context length will be very short.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment