Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages Paper • 2308.12038 • Published Aug 23, 2023 • 2
A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs Paper • 2411.17265 • Published Nov 26, 2024
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 2 days ago • 26
RLPR Collection Extrapolating RLVR to General Domains without Verifiers • 6 items • Updated about 22 hours ago • 2
RLPR Collection Extrapolating RLVR to General Domains without Verifiers • 6 items • Updated about 22 hours ago • 2
RLPR Collection Extrapolating RLVR to General Domains without Verifiers • 6 items • Updated about 22 hours ago • 2