view article Article FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages By davanstrien and 5 others β’ 4 days ago β’ 26
view article Article Explore, Build, and Innovate AI Reasoning with NVIDIAβs Open Models and Recipes By nvidia and 2 others β’ Jun 4 β’ 21
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published Jun 2 β’ 113
view article Article Tiny Agents: a MCP-powered agent in 50 lines of code By julien-c β’ Apr 25 β’ 285
SmolVLM: Redefining small and efficient multimodal models Paper β’ 2504.05299 β’ Published Apr 7 β’ 192
Unified Reward Model for Multimodal Understanding and Generation Paper β’ 2503.05236 β’ Published Mar 7 β’ 124
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality By saurabhdash and 3 others β’ Mar 4 β’ 75
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. β’ 5 items β’ Updated Apr 15 β’ 69
CHASE Collection Generate challenging synthetic data to evaluate LLMs β’ 5 items β’ Updated Feb 21 β’ 4
How to Get Your LLM to Generate Challenging Problems for Evaluation Paper β’ 2502.14678 β’ Published Feb 20 β’ 18
MMTEB: Massive Multilingual Text Embedding Benchmark Paper β’ 2502.13595 β’ Published Feb 19 β’ 38
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions Paper β’ 2502.13791 β’ Published Feb 19 β’ 5
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper β’ 2501.17161 β’ Published Jan 28 β’ 123
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper β’ 2501.07301 β’ Published Jan 13 β’ 99
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper β’ 2501.02045 β’ Published Jan 3 β’ 21
EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation Paper β’ 2501.01895 β’ Published Jan 3 β’ 56