Fully Autonomous AI Agents Should Not be Developed Paper β’ 2502.02649 β’ Published 24 days ago β’ 24
Presumed Cultural Identity: How Names Shape LLM Responses Paper β’ 2502.11995 β’ Published 11 days ago β’ 10
Demystifying Long Chain-of-Thought Reasoning in LLMs Paper β’ 2502.03373 β’ Published 23 days ago β’ 54
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 24 days ago β’ 195
view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 25 days ago β’ 109
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated 16 days ago β’ 91
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI β’ Jan 15 β’ 41
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper β’ 2406.11896 β’ Published Jun 14, 2024 β’ 20
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ Jan 2 β’ 40
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated Jan 8 β’ 563