https://github.com/dhcode-cpp/X-R1
xiaodongguaAIGC
xiaodongguaAIGC
AI & ML interests
RLHF
Recent Activity
updated
a collection
22 days ago
X-R1
updated
a collection
22 days ago
X-R1
updated
a collection
22 days ago
X-R1
Organizations
None yet
Collections
1
models
9

xiaodongguaAIGC/X-R1-3B-CN
Text Generation
•
Updated
•
41
•
2

xiaodongguaAIGC/X-R1-3B
Text Generation
•
Updated
•
198
•
1

xiaodongguaAIGC/X-R1-1.5B
Text Generation
•
Updated
•
47

xiaodongguaAIGC/X-R1-0.5B
Text Generation
•
Updated
•
99
•
1

xiaodongguaAIGC/xdg-math-step
Text Generation
•
Updated
•
37
•
1

xiaodongguaAIGC/xdg-math-step-0118
Text Generation
•
Updated
•
17

xiaodongguaAIGC/xdg-math-prm-lora
Updated
•
14

xiaodongguaAIGC/xdg-llama-3-8B
Text Generation
•
Updated
•
40
•
3

xiaodongguaAIGC/llama-3-debug
Text Generation
•
Updated
•
543
•
•
2
datasets
16
xiaodongguaAIGC/X-R1-TAL-SCQ5K
Viewer
•
Updated
•
10k
•
845
•
3
xiaodongguaAIGC/X-R1-TAL-SCQ2K
Viewer
•
Updated
•
3.33k
•
249
•
1
xiaodongguaAIGC/X-R1-7500
Viewer
•
Updated
•
12.5k
•
836
•
1
xiaodongguaAIGC/X-R1-1500
Viewer
•
Updated
•
2.5k
•
119
xiaodongguaAIGC/X-R1-750
Viewer
•
Updated
•
1.25k
•
5.31k
•
3
xiaodongguaAIGC/step_sft
Viewer
•
Updated
•
84.2k
•
130
xiaodongguaAIGC/step_prm
Viewer
•
Updated
•
108k
•
92
xiaodongguaAIGC/math_step_sft
Viewer
•
Updated
•
12.5k
•
81
xiaodongguaAIGC/GSM8k_step_sft
Viewer
•
Updated
•
8.79k
•
76
xiaodongguaAIGC/prm800k_step_sft
Viewer
•
Updated
•
121k
•
77