2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published Jan 1 • 99
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model Paper • 2407.07053 • Published Jul 9, 2024 • 44
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow Paper • 2306.07209 • Published Jun 12, 2023 • 2
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives Paper • 2401.02009 • Published Jan 4, 2024 • 1
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model Paper • 2407.07053 • Published Jul 9, 2024 • 44