view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl By axolotl-ai-co and 1 other β’ Apr 4 β’ 14
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model By tomaarsen and 5 others β’ 6 days ago β’ 183
view article Article Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation By cbensimon and 3 others β’ 8 days ago β’ 47
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others β’ Jul 29 β’ 170
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI β¨ By Wauplin and 2 others β’ Jul 25 β’ 81
view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30 β’ 177
view article Article Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training By siro1 and 4 others β’ Aug 8 β’ 63
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others β’ Jun 12 β’ 134
view article Article Vision Language Model Alignment in TRL β‘οΈ By sergiopaniego and 4 others β’ Aug 7 β’ 80
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others β’ Aug 5 β’ 492
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others β’ Jun 3 β’ 86
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper β’ 2411.10442 β’ Published Nov 15, 2024 β’ 88
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper β’ 2402.03300 β’ Published Feb 5, 2024 β’ 130
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other β’ Jul 9 β’ 669
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others β’ Jul 8 β’ 653
view article Article cocogold: training Marigold for text-grounded segmentation By pcuenq β’ Jul 8 β’ 30