sqres/v5_reasoning_sft__v6tulu3__bs128__lr5e-06__epochs2__20250624-124533 33B • Updated 8 days ago • 10
sqres/v5_reasoning_sft__v6tulu3__bs128__lr5e-06__epochs2__20250624-124533 33B • Updated 8 days ago • 10
sqres/v6_tulu3__Qwen2.532BInstruct__bs128__lr5e-06__epochs2__20250622-233816 33B • Updated 9 days ago • 168
sqres/v6_tulu3__Qwen2.532BInstruct__bs128__lr5e-06__epochs2__20250622-233816 33B • Updated 9 days ago • 168
sqres/v5_cpt_r_ablation_r10__Qwen2.532BInstruct__bs128__lr5e-06__epochs1__20250614-234205 33B • Updated 10 days ago • 39
sqres/v5_cpt_r_ablation_r9__Qwen2.532BInstruct__bs128__lr5e-06__epochs1__20250614-234202 33B • Updated 10 days ago • 39
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Paper • 2409.04109 • Published Sep 6, 2024 • 49
Configurable Foundation Models: Building LLMs from a Modular Perspective Paper • 2409.02877 • Published Sep 4, 2024 • 31
The Prompt Report: A Systematic Survey of Prompting Techniques Paper • 2406.06608 • Published Jun 6, 2024 • 65