Post
2816
Orsta 🔥 vision language models trained with V-Triune, a unified reinforcement learning system by MiniMax AI
One-RL-to-See-Them-All/one-rl-to-see-them-all-6833d27abce23898b2f9815a
✨ 7B & 32B with MIT license
✨ Masters 8 visual tasks: math, science QA, charts, puzzles, object detection, grounding, OCR, and counting
✨ Uses Dynamic IoU rewards for better visual understanding
✨Strong performance in visual reasoning and perception
One-RL-to-See-Them-All/one-rl-to-see-them-all-6833d27abce23898b2f9815a
✨ 7B & 32B with MIT license
✨ Masters 8 visual tasks: math, science QA, charts, puzzles, object detection, grounding, OCR, and counting
✨ Uses Dynamic IoU rewards for better visual understanding
✨Strong performance in visual reasoning and perception