144gb vram and 256gb ram

#12

by fuutott - opened 17 days ago

17 days ago

I'm trying to work out what's the best way for me to split the model to load as much as I can to rtx 6000 96gb and ada a6000 48 gb + 256 8ch ddr5
-ot ".ffn_(up)_exps.=CPU" ?

danielhanchen

Unsloth AI org 13 days ago

Sorry on the delay - if it helps, I wrote approximately on how to offload other layers in https://docs.unsloth.ai/basics/qwen3-coder#improving-generation-speed

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment