view article Article FuseO1-Preview: System-II Reasoning Fusion of LLMs By Wanfq and 4 others • Jan 20 • 21
Through the Valley: Path to Effective Long CoT Training for Small Language Models Paper • 2506.07712 • Published 24 days ago • 18
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity Paper • 2411.02335 • Published Nov 4, 2024 • 11