arxiv:2412.21187
Zhiwei He
zwhe99
AI & ML interests
Natural Language Processing
Recent Activity
updated
a model
2 days ago
zwhe99/DeepSeek-R1-Distill-Qwen-1.5B
published
a model
2 days ago
zwhe99/DeepSeek-R1-Distill-Qwen-1.5B
upvoted
a
paper
18 days ago
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Organizations
None yet
Papers
3
spaces
1
models
7
datasets
13
zwhe99/aime90
Viewer
•
Updated
•
90
•
75
•
1
zwhe99/gsm8k
Viewer
•
Updated
•
8.79k
•
53
zwhe99/mathpile-text
Viewer
•
Updated
•
469k
•
134
zwhe99/mp-textbooks
Viewer
•
Updated
•
3.98k
•
52
zwhe99/MATH-DIFFIC
Viewer
•
Updated
•
17.5k
•
60
zwhe99/aime24
Viewer
•
Updated
•
30
•
190
•
2
zwhe99/MATH
Viewer
•
Updated
•
17.5k
•
494
•
1
zwhe99/amc23
Viewer
•
Updated
•
40
•
175
•
1
zwhe99/commonsense_170k
Viewer
•
Updated
•
170k
•
373
•
2
zwhe99/agent-sci-general
Viewer
•
Updated
•
24.5k
•
32