SATQuest Dataset Collections
Adam Yanxiao Zhao
sdpkjc
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a model
13 days ago
sdpkjc/Qwen2.5-0.5B-SFT-24quiz-checkpoint-800
published
a model
13 days ago
sdpkjc/Qwen2.5-0.5B-SFT-24quiz-checkpoint-800
updated
a model
13 days ago
sdpkjc/Qwen2.5-0.5B-SFT-24quiz-checkpoint-300
Organizations
Collections
1
Papers
2
models
100

sdpkjc/Qwen2.5-0.5B-SFT-24quiz-checkpoint-800
Text Generation
•
Updated
•
1

sdpkjc/Qwen2.5-0.5B-SFT-24quiz-checkpoint-300
Text Generation
•
Updated
•
1

sdpkjc/Qwen2.5-1.5B-Instruct-FT-DPO
Text Generation
•
Updated
•
38

sdpkjc/SmolLM2-FT-DPO
Text Generation
•
Updated
•
12

sdpkjc/SmolLM2-FT-MyDataset
Text Generation
•
Updated
•
9

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
datasets
17
sdpkjc/24problems_quiz-eval-n4-1-10-24
Viewer
•
Updated
•
55.5k
•
63
sdpkjc/24problems_quiz-eval-5
Viewer
•
Updated
•
100k
•
75
sdpkjc/24problems_quiz
Viewer
•
Updated
•
85.6k
•
153
sdpkjc/SATQuest-RFT-3k
Viewer
•
Updated
•
3k
•
31
sdpkjc/SATQuest-RFT-1k
Viewer
•
Updated
•
1k
•
14
sdpkjc/SATQuest-Tiny
Viewer
•
Updated
•
10
•
16
sdpkjc/SATQuest
Viewer
•
Updated
•
140
•
34
sdpkjc/SATQuest-G
Viewer
•
Updated
•
963
•
13
sdpkjc/NumBase-N01-S2g-B2g
Viewer
•
Updated
•
983k
•
20
sdpkjc/NumBase-N01-S2g-B28
Viewer
•
Updated
•
459k
•
20