Thanks for your efforts! Want to inquire the quantizing scripts and inference code.
#2 opened about 2 months ago
by
listen2you
Can this FP8 model be deployed on 4090? How is the speed?
2
#1 opened about 2 months ago
by
yoolv