Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Paper • 2410.16163 • Published Oct 21, 2024 • 1
GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking Paper • 2506.01078 • Published Jun 1 • 2
Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning Paper • 2503.18013 • Published Mar 23 • 20