deepseek-ai/DeepSeek-Prover-V2-671B Text Generation • 685B • Updated Apr 30 • 3.73k • • 803
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published Feb 20 • 192
xukp20/Llama-3-8B-Instruct-SPPO-score-Iter3_gp_8b-table-0.002 Text Generation • 8B • Updated Sep 29, 2024 • 11
xukp20/Llama-3-8B-Instruct-SPPO-score-Iter3_bt_8b-table-0.002 Text Generation • 8B • Updated Sep 28, 2024 • 36
xukp20/Llama-3-8B-Instruct-SPPO-Iter3_bt_8b-table Text Generation • 8B • Updated Sep 28, 2024 • 13
xukp20/Llama-3-8B-Instruct-SPPO-score-Iter3_bt_2b-table-0.001 Text Generation • 8B • Updated Sep 28, 2024 • 14
xukp20/Llama-3-8B-Instruct-SPPO-Iter3_bt_2b-table Text Generation • 8B • Updated Sep 28, 2024 • 15
xukp20/Llama-3-8B-Instruct-SPPO-Iter3_gp_8b-table Text Generation • 8B • Updated Sep 28, 2024 • 14
xukp20/Llama-3-8B-Instruct-SPPO-score-Iter3_gp_2b-table-0.001 Text Generation • 8B • Updated Sep 28, 2024 • 30
xukp20/Llama-3-8B-Instruct-SPPO-Iter3_gp_2b-table Text Generation • 8B • Updated Sep 28, 2024 • 13