CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models Paper • 2505.12504 • Published 20 days ago • 23
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision Paper • 2505.13427 • Published 19 days ago • 25
MM-IFEngine Collection Datasets, Benchmark and Checkpoints for MM-IFEngine • 2 items • Updated Apr 26 • 5
MM-IFEngine Collection Datasets, Benchmark and Checkpoints for MM-IFEngine • 2 items • Updated Apr 26 • 5