BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper • 2505.09568 • Published 10 days ago • 82
GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning Paper • 2505.11049 • Published 8 days ago • 50
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 4 days ago • 113