Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation Paper β’ 2505.00612 β’ Published 23 days ago β’ 9
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ 10 days ago β’ 100
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset Paper β’ 2504.16891 β’ Published about 1 month ago β’ 21
The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks Paper β’ 2504.15521 β’ Published Apr 22 β’ 64
view article Article Visualize and understand GPU memory in PyTorch By qgallouedec β’ Dec 24, 2024 β’ 223
kaitchup/DeepSeek-R1-Distill-Qwen-14B-AutoRound-GPTQ-4bit Text Generation β’ Updated Jan 27 β’ 433 β’ 6
Light-R1 Collection Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond β’ 7 items β’ Updated Mar 13 β’ 12