VisionArena: 230K Real World User-VLM Conversations with Preference Labels Paper • 2412.08687 • Published 14 days ago • 13 • 3
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published Nov 15 • 68 • 4
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Paper • 2406.08418 • Published Jun 12 • 28 • 3