view article Article Interactive Tools for machine learning, deep learning, and math By Suzana • 4 days ago • 33
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others • 8 days ago • 98
view changelog Changelog Xet is now the default storage option for new users and organizations 8 days ago • 47
view article Article NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other • 11 days ago • 24
LightLab: Controlling Light Sources in Images with Diffusion Models Paper • 2505.09608 • Published 16 days ago • 31
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing Paper • 2505.02370 • Published 26 days ago • 14
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 16 days ago • 104
view article Article Highlights from the First ICLR 2025 Watermarking Workshop By hadyelsahar and 4 others • 16 days ago • 10
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? By danaaubakirova and 6 others • 20 days ago • 49
view article Article AI Personas: The Impact of Design Choices By giadap and 1 other • 24 days ago • 13
Hugging Face community’s Wikimedia datasets Collection Wikimedia datasets created by the Hugging Face community, not Wikimedia. Sorted by Wikimedia project. • 17 items • Updated Jun 7, 2024 • 11
SwallowMath Collection Rewriting Pre-Training Data Boosts LLM Performance in Math and Code • 11 items • Updated 24 days ago • 3
SwallowCode Collection Rewriting Pre-Training Data Boosts LLM Performance in Math and Code • 66 items • Updated 24 days ago • 3
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published Apr 17 • 92