VLLM launch parametrs
2
#3 opened 20 days ago
by
Clutchkin
Why not FP8 with static and per-tensor quantization?
1
1
#2 opened 27 days ago
by
wanzhenchn
Thank you uploading this.
6
#1 opened 27 days ago
by
getfit
