view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 17 days ago • 140
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 23 days ago • 112
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • 26 days ago • 417
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper • 2504.17432 • Published Apr 24 • 39
🌙 March 2025 - Open releases from the Chinese community Collection 32 items • Updated 22 days ago • 13
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents Paper • 2503.01935 • Published Mar 3 • 27
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published Mar 6 • 15
view article Article 🌁#89: AI in Action: How AI Engineers, Self-Optimizing Models, and Humanoid Robots Are Reshaping 2025 By Kseniase • Feb 25 • 4
view article Article What is test-time compute and how to scale it? By Kseniase and 1 other • Feb 6 • 89
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 232
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.25k
view article Article How to deploy and fine-tune DeepSeek models on AWS By pagezyhf and 2 others • Jan 30 • 52
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other • Jan 16 • 74
view article Article Topic 23: What is LLM Inference, it's challenges and solutions for it By Kseniase • Jan 17 • 6