dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated 15 days ago • 12k • 57
dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated Apr 29 • 1k • 92
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated Apr 29 • 500 • 90
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated Apr 21 • 500 • 27
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_8192 Viewer • Updated Apr 19 • 12k • 24
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated Apr 19 • 12k • 36
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_32768 Viewer • Updated Apr 18 • 12k • 39