T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 119
GAS: Improving Discretization of Diffusion ODEs via Generalized Adversarial Solver Paper • 2510.17699 • Published Oct 20, 2025 • 24
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 776
DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization Paper • 2505.20975 • Published May 27, 2025 • 36
ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models Paper • 2505.22569 • Published May 28, 2025 • 55
Accelerating Nash Learning from Human Feedback via Mirror Prox Paper • 2505.19731 • Published May 26, 2025 • 6
Accelerating Nash Learning from Human Feedback via Mirror Prox Paper • 2505.19731 • Published May 26, 2025 • 6
Accelerating Nash Learning from Human Feedback via Mirror Prox Paper • 2505.19731 • Published May 26, 2025 • 6 • 2
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 258
On Teacher Hacking in Language Model Distillation Paper • 2502.02671 • Published Feb 4, 2025 • 18 • 2