MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 21 items • Updated 4 days ago • 37
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 24 days ago • 32 • 8
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 24 days ago • 32 • 8
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages Paper • 2308.12038 • Published Aug 23, 2023 • 2
A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs Paper • 2411.17265 • Published Nov 26, 2024 • 1
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 24 days ago • 32
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 24 days ago • 32 • 8
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 24 days ago • 32 • 8
RLPR Collection Extrapolating RLVR to General Domains without Verifiers • 6 items • Updated 6 days ago • 3
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 24 days ago • 32
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 24 days ago • 32 • 8