MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published 20 days ago • 250
Gromov series [GRPO] Collection Specific datasets particulary effective in GRPO • 6 items • Updated May 28 • 1