Safetensors
qwen2
yixinsong commited on
Commit
4f2ba0d
·
1 Parent(s): 084c040
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -69,6 +69,14 @@ The model has been evaluated on standard benchmarks to verify its performance:
69
 
70
  - **MMLU**: 69.19% (5-shot)
71
  - **IFEval**: 73.2% (Prompt Strict-Accuracy)
 
 
 
 
 
 
 
 
72
 
73
  These results demonstrate that the ReLU modification maintains competitive performance while achieving higher sparsity compared to the original model.
74
 
 
69
 
70
  - **MMLU**: 69.19% (5-shot)
71
  - **IFEval**: 73.2% (Prompt Strict-Accuracy)
72
+ - **Livebench**:
73
+ - Average: 32.1%
74
+ - Coding: 39.8%
75
+ - Data Analysis: 45.3%
76
+ - Instruction Following: 58.1%
77
+ - Language: 9.0%
78
+ - Math: 22.0%
79
+ - Reasoning: 18.7%
80
 
81
  These results demonstrate that the ReLU modification maintains competitive performance while achieving higher sparsity compared to the original model.
82