simonycl/gsm8k_training_negative_chain_of_thought_1k_gpt-4.1_transformed Viewer • Updated 1 day ago • 1.93k • 8
simonycl/gsm8k_training_negative_multi_turn_1k_gpt-4.1_transformed Viewer • Updated 1 day ago • 1.75k • 7
simonycl/gsm8k_training_positive_direct_multi_turn_1k_transformed Viewer • Updated 1 day ago • 1k • 8
simonycl/gsm8k_training_negative_combined_1k_gemini-2.5-flash_transformed Viewer • Updated 2 days ago • 1.76k • 22
simonycl/gsm8k_training_negative_vs_standard_1k_gemini-2.5-flash_transformed Viewer • Updated 2 days ago • 1.7k • 22
simonycl/gsm8k_training_negative_sequence_1k_gemini-2.5-flash_transformed Viewer • Updated 2 days ago • 1.78k • 21
simonycl/gsm8k_training_negative_direct_1k_gemini-2.5-flash_transformed Viewer • Updated 2 days ago • 1.49k • 22
simonycl/gsm8k_training_negative_combined_1k_gpt-4.1_transformed Viewer • Updated 2 days ago • 1.92k • 64
simonycl/gsm8k_training_negative_vs_standard_1k_gpt-4.1_transformed Viewer • Updated 2 days ago • 1.93k • 60
simonycl/gsm8k_training_negative_sequence_1k_gpt-4.1_transformed Viewer • Updated 2 days ago • 1.88k • 58
simonycl/gsm8k_training_negative_direct_1k_gpt-4.1_transformed Viewer • Updated 2 days ago • 1.65k • 81
simonycl/game-eval-Qwen-Qwen3-32B-vs-Qwen-Qwen3-32B-20250908-101728 Viewer • Updated 4 days ago • 11.5k • 42
simonycl/game-eval-Qwen-Qwen3-32B-vs-Qwen-Qwen3-32B-20250908-101654 Viewer • Updated 4 days ago • 5.72k • 37
simonycl/game-eval-Qwen-Qwen3-32B-vs-Qwen-Qwen3-32B-20250908-101501 Viewer • Updated 4 days ago • 2.3k • 40
simonycl/game-eval-Qwen-Qwen3-32B-vs-Qwen-Qwen3-32B-20250907-135811 Viewer • Updated 5 days ago • 46.1k • 57
simonycl/game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250829-232628 Viewer • Updated 14 days ago • 46.7k • 86
simonycl/game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250729-131038 Viewer • Updated Jul 29 • 59k • 9