reasoning-proj/j_bigbench_mistake_eval_z_Phi-4-reasoning-plus Viewer • Updated 8 days ago • 2.19k • 42
reasoning-proj/j_bigbench_mistake_eval_z_Llama-3.1-Nemotron-Nano-8B-v1 Viewer • Updated 8 days ago • 2.19k • 95
reasoning-proj/j_bigbench_mistake_eval_z_DeepSeek-R1-Distill-Llama-70B Viewer • Updated 8 days ago • 2.19k • 46
reasoning-proj/bigbench_mistake_eval_z_Llama-3.1-Nemotron-Nano-8B-v1 Viewer • Updated 9 days ago • 2.19k • 483
reasoning-proj/bigbench_mistake_eval_z_Phi-4-reasoning-plus Viewer • Updated 9 days ago • 2.19k • 227
reasoning-proj/bigbench_mistake_eval_z_DeepSeek-R1-Distill-Llama-70B Viewer • Updated 9 days ago • 2.19k • 220