hamishivi/2911_rl_rag_NAR8_gpt5sft_noadaptive_27343__1__1765945349_checkpoints_step_650 8B • Updated 1 day ago • 61
hamishivi/2911_rl_rag_NAR8_gpt5sft_noapaptive_27343__1__1765945349_checkpoints_step_500 8B • Updated 2 days ago • 28
hamishivi/2911_rl_rag_NAR8_gpt5sft_noapaptive_27343__1__1765945349_checkpoints_step_1000 8B • Updated 2 days ago • 22
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1765674535_checkpoints_step_3450 8B • Updated 12 days ago • 102
hamishivi/1011_rl_rag_open_judge_no_citation_1037__1__1765453995_checkpoints_step_1500 8B • Updated 17 days ago • 31
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1765452191_checkpoints_step_3350 8B • Updated 17 days ago • 33
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1764018132_step_2450 8B • Updated about 1 month ago • 3
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1762677729_step_1300 8B • Updated Nov 26 • 2
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605__1__1762886037_checkpoints_step_1300 8B • Updated Nov 26 • 3
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1762677729_step1900 8B • Updated Nov 23 • 4
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1762677729_checkpoints_step_1700 8B • Updated Nov 20 • 3
hamishivi/1011_rl_rag_open_judge_no_citation_1037__1__1762832496_checkpoints_step_850 8B • Updated Nov 20 • 3
hamishivi/Nemotron-Research-Reasoning-Qwen-1.5B-v2-RLVE Text Generation • 2B • Updated Nov 11 • 11 • 2