view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others • 25 days ago • 167
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! 22 days ago • 49
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • 16 days ago • 101
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence Paper • 2505.23747 • Published 29 days ago • 67
view article Article Manus AI: The Best Autonomous AI Agent Redefining Automation and Productivity By LLMhacker • Mar 6 • 172
view article Article 🥬 LettuceDetect Goes Multilingual: Fine-tuning EuroBERT on Synthetic Translations By adaamko and 1 other • May 19 • 9
RAGTruth LLM Translations Collection This collection includes our translated training data that we've used to create multilingual hallucination detection models. • 8 items • Updated May 18 • 3
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 462
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 29 days ago • 156
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 1 day ago • 52
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7 • 191