view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events By vinid and 6 others • 9 days ago • 26
view article Article Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 9 days ago • 116
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • 24 days ago • 70
view article Article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs By davidberenstein1957 and 3 others • 24 days ago • 14
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! Jun 6 • 52
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 84
view article Article The Environmental Impacts of AI -- Primer By sasha and 2 others • Sep 3, 2024 • 42
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2 • 117
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others • Jun 3 • 208
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 276
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova • May 16 • 30