PowerInfer
/

SparseQwen2-7B

Model card Files Files and versions Community

yixinsong commited on Dec 25, 2024

Commit

4f2ba0d

·

1 Parent(s): 084c040

minor

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -69,6 +69,14 @@ The model has been evaluated on standard benchmarks to verify its performance:
 - **MMLU**: 69.19% (5-shot)
 - **IFEval**: 73.2% (Prompt Strict-Accuracy)
 These results demonstrate that the ReLU modification maintains competitive performance while achieving higher sparsity compared to the original model.

 - **MMLU**: 69.19% (5-shot)
 - **IFEval**: 73.2% (Prompt Strict-Accuracy)
+- **Livebench**:
+  - Average: 32.1%
+  - Coding: 39.8%
+  - Data Analysis: 45.3%
+  - Instruction Following: 58.1%
+  - Language: 9.0%
+  - Math: 22.0%
+  - Reasoning: 18.7%
 These results demonstrate that the ReLU modification maintains competitive performance while achieving higher sparsity compared to the original model.