·
AI & ML interests
None yet
Organizations
None yet
Preview
•
Updated
•
88
•
1
Viewer
•
Updated
•
1.41M
•
6
Viewer
•
Updated
•
1.41M
•
30
Viewer
•
Updated
•
1.41M
•
9
zd21/ReST-MCTS_SciGLM-6B_Self-Rewarding-DPO_2nd
Viewer
•
Updated
•
1
•
8
zd21/ReST-MCTS_SciGLM-6B_ReST-MCTS_Policy_2nd
Viewer
•
Updated
•
40.9k
•
7
zd21/ReST-MCTS_SciGLM-6B_ReST-EM-CoT_2nd
Viewer
•
Updated
•
28.9k
•
8
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_Self-Rewarding-DPO_2nd
Viewer
•
Updated
•
1
•
4
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-MCTS_2nd
Viewer
•
Updated
•
26k
•
6
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-EM-CoT_2nd
Viewer
•
Updated
•
36.6k
•
7
zd21/ReST-MCTS_Llama3-8b-Instruct_Self-Rewarding-DPO_2nd
Viewer
•
Updated
•
1
•
1
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-MCTS_Policy_2nd
Viewer
•
Updated
•
32.3k
•
3
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-EM-CoT_2nd
Viewer
•
Updated
•
33.2k
•
6
zd21/ReST-MCTS_SciGLM-6B_Self-Rewarding-DPO_1st
Viewer
•
Updated
•
33.5k
•
6
zd21/ReST-MCTS_SciGLM-6B_ReST-MCTS_Policy_1st
Viewer
•
Updated
•
30.1k
•
2
zd21/ReST-MCTS_SciGLM-6B_ReST-EM-CoT_1st
Viewer
•
Updated
•
55.8k
•
3
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_Self-Rewarding-DPO_1st
Viewer
•
Updated
•
1
•
4
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-MCTS_1st
Viewer
•
Updated
•
38.7k
•
4
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-EM-CoT_1st
Viewer
•
Updated
•
74k
•
4
zd21/ReST-MCTS_Llama3-8b-Instruct_Self-Rewarding-DPO_1st
Viewer
•
Updated
•
1
•
1
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-MCTS_Policy_1st
Viewer
•
Updated
•
33.7k
•
6
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-EM-CoT_1st
Viewer
•
Updated
•
73.1k
•
6
Viewer
•
Updated
•
474k
•
7
•
2
Viewer
•
Updated
•
91.8k
•
45
•
7
Viewer
•
Updated
•
91.8k
•
10
•
3
zd21/ReST-MCTS-Llama3-8b-Instruct-Policy-1st
Viewer
•
Updated
•
33.7k
•
4
•
7
zd21/ReST-MCTS-Llama3-8b-Instruct-PRM-1st
Viewer
•
Updated
•
673k
•
25
•
9