PowerInfer
/

SparseQwen2-7B

Model card Files Files and versions Community

yixinsong commited on 19 days ago

Commit

5efe432

·

1 Parent(s): db3f13d

minor

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -63,6 +63,15 @@ outputs = model.generate(**inputs)
 response = tokenizer.decode(outputs[0])
 ```
 ## Citation
 If you use this model in your research, please cite:

 response = tokenizer.decode(outputs[0])
 ```
+## Benchmarks
+The model has been evaluated on standard benchmarks to verify its performance:
+- **MMLU**: 69.19% (5-shot)
+- **IFEval**: 73.2% (Prompt Strict-Accuracy)
+These results demonstrate that the ReLU modification maintains competitive performance while achieving higher sparsity compared to the original model.
 ## Citation
 If you use this model in your research, please cite: