The collection for the Project "Simple Reinforcement Learning for Reasoning"
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
6
models
35
hkust-nlp/preselect-fasttext-classifier
Text Classification
•
Updated
•
43
•
4
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero
Updated
•
277
•
2
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL
Updated
•
292
•
2
hkust-nlp/qwen2.5-7b-coder_codeio_stage1
Updated
•
31
hkust-nlp/qwen2.5-7b-coder_codeio
Updated
•
35
hkust-nlp/qwen2.5-7b-coder_codeio_pp_stage1
Updated
•
51
hkust-nlp/qwen2.5-7b-coder_codeio_pp
Updated
•
82
•
5
hkust-nlp/llama3.1-8b_codeio_stage1
Updated
•
29
hkust-nlp/llama3.1-8b_codeio
Updated
•
32
hkust-nlp/llama3.1-8b_codeio_pp_stage1
Updated
•
34
datasets
21
hkust-nlp/PreSelect-100B
Viewer
•
Updated
•
54.5M
•
821
•
6
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
•
Updated
•
724
•
45
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
•
114
hkust-nlp/SynCSE-partial-NLI
Viewer
•
Updated
•
263k
•
96
•
2
hkust-nlp/SynCSE-scratch-NLI
Viewer
•
Updated
•
276k
•
110
•
2
hkust-nlp/gsm8k-fix
Viewer
•
Updated
•
7.47k
•
119
•
2
hkust-nlp/dart-math-uniform
Viewer
•
Updated
•
591k
•
112
•
9
hkust-nlp/vrt-baseline
Viewer
•
Updated
•
591k
•
72
•
1
hkust-nlp/dart-math-hard
Viewer
•
Updated
•
585k
•
137
•
13
hkust-nlp/dart-math-pool-gsm8k-query-info
Viewer
•
Updated
•
7.47k
•
75
•
2