Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization Paper • 2504.18397 • Published Apr 25 • 2