simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_60k_win_only Text Generation • 4B • Updated 4 days ago • 5
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_60k_whole Text Generation • 4B • Updated 4 days ago • 5
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_30k_win_only Text Generation • 4B • Updated 4 days ago • 5
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_30k_whole Text Generation • 4B • Updated 4 days ago • 6
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_15k_win_only Text Generation • 4B • Updated 4 days ago • 8
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_5k_win_only Text Generation • 4B • Updated 4 days ago • 8
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_5k_whole Text Generation • 4B • Updated 4 days ago • 8
simonycl/octothinker-8b-hybrid-zero-cold-start-sft Text Generation • 8B • Updated 12 days ago • 3
simonycl/octothinker-8b-hybrid-zero-cold-start-sft-step-5 Text Generation • 8B • Updated 12 days ago • 246
simonycl/octothinker-3b-hybrid-zero-cold-start-sft Text Generation • 3B • Updated 16 days ago • 27
simonycl/octothinker-3b-hybrid-zero-cold-start-step-5 Text Generation • 3B • Updated 16 days ago • 397
simonycl/gemma_3_27b_cmv_hard_persuasion_judge_new Image-Text-to-Text • 27B • Updated May 15 • 5
simonycl/gemma_3_27b_cmv_hard_persuasion_judge_new_overwrites Image-Text-to-Text • 27B • Updated May 15 • 4