view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other β’ 30 days ago β’ 637
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others β’ about 1 month ago β’ 614
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper β’ 2506.20920 β’ Published Jun 26 β’ 64
Built with Distill blog β€οΈ Collection Collection of all interactive blogs built on top of Distill template. To create your own check: https://huggingface.co/spaces/lvwerra/distill-blog-tem β’ 6 items β’ Updated Mar 14 β’ 1
view article Article Fixing Open LLM Leaderboard with Math-Verify By hynky and 3 others β’ Feb 14 β’ 30
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 241
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others β’ Dec 23, 2024 β’ 21
IrokoBench Collection a human-translated benchmark dataset for 16 African languages covering three tasks: NLI, MMLU and MGSM β’ 6 items β’ Updated May 31, 2024 β’ 20
view article Article Scaling AI-based Data Processing with Hugging Face + Dask By scj13 and 3 others β’ Oct 9, 2024 β’ 31
view article Article π¨πΏ BenCzechMark - Can your LLM Understand Czech? By mfajcik and 12 others β’ Oct 1, 2024 β’ 21
view article Article The 5 Most Under-Rated Tools on Hugging Face By derek-thomas β’ Aug 22, 2024 β’ 90
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper β’ 2406.17557 β’ Published Jun 25, 2024 β’ 98
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models Paper β’ 2405.20541 β’ Published May 30, 2024 β’ 24
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation Paper β’ 2312.12491 β’ Published Dec 19, 2023 β’ 73
VinaLLaMA: LLaMA-based Vietnamese Foundation Model Paper β’ 2312.11011 β’ Published Dec 18, 2023 β’ 22