Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper • 2506.09250 • Published 5 days ago • 19
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 19 items • Updated 2 days ago • 58
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 10 days ago • 36
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 10 days ago • 36
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated 9 days ago • 25
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others • Dec 23, 2024 • 20
👩💻 OlympicCoder Collection Reasoning datasets and models for competitive coding • 4 items • Updated May 13 • 18
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated 27 days ago • 151