INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning Paper β’ 2505.07291 β’ Published 19 days ago β’ 11 β’ 2
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning Paper β’ 2111.10952 β’ Published Nov 22, 2021 β’ 2 β’ 1
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22 β’ 394 β’ 6
Beyond Release: Access Considerations for Generative AI Systems Paper β’ 2502.16701 β’ Published Feb 23 β’ 16 β’ 4