Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published 10 days ago • 58
Mergenetic: a Simple Evolutionary Model Merging Library Paper • 2505.11427 • Published 8 days ago • 12
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 18 days ago • 159