R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13 • 37.8k • 262 • 47 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18 • 5k • 49 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3 • 5k • 341 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3 • 5k • 18 • 2
R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13 • 37.8k • 262 • 47 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18 • 5k • 49 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3 • 5k • 341 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3 • 5k • 18 • 2