quantize deepseek-r1-0528 please

#14
by aabbccddwasd - opened

r1-0528 is a awesome model, and fp4 model can achieve 80 token/s using microsoft tutel. we really need a fp4 version for r1-0528

Sign up or log in to comment