Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper β’ 2506.09250 β’ Published 5 days ago β’ 19
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices β’ 19 items β’ Updated 2 days ago β’ 58
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper β’ 2506.05209 β’ Published 10 days ago β’ 36
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text β’ 4 items β’ Updated 9 days ago β’ 25
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others β’ Dec 23, 2024 β’ 20
π©βπ» OlympicCoder Collection Reasoning datasets and models for competitive coding β’ 4 items β’ Updated May 13 β’ 18
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 24 items β’ Updated 27 days ago β’ 151
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ 20 days ago β’ 44
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ May 15 β’ 113
INTELLECT-2 Collection INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. β’ 3 items β’ Updated May 11 β’ 22
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers Paper β’ 2505.04842 β’ Published May 7 β’ 12
view article Article Bamba-9B-v2 - Fast and powerful! By ibm-ai-platform and 12 others β’ Apr 29 β’ 32
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper β’ 2504.11651 β’ Published Apr 15 β’ 28
view article Article Tiny Agents: a MCP-powered agent in 50 lines of code By julien-c β’ Apr 25 β’ 272