quantize deepseek-r1-0528 please
#14
by
aabbccddwasd
- opened
r1-0528 is a awesome model, and fp4 model can achieve 80 token/s using microsoft tutel. we really need a fp4 version for r1-0528
r1-0528 is a awesome model, and fp4 model can achieve 80 token/s using microsoft tutel. we really need a fp4 version for r1-0528