ColorFlow: Retrieval-Augmented Image Sequence Colorization Paper β’ 2412.11815 β’ Published 10 days ago β’ 26
BrushEdit: All-In-One Image Inpainting and Editing Paper β’ 2412.10316 β’ Published 12 days ago β’ 33
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper β’ 2412.09645 β’ Published 15 days ago β’ 35
Byte Latent Transformer: Patches Scale Better Than Tokens Paper β’ 2412.09871 β’ Published 13 days ago β’ 75
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models Paper β’ 2412.12606 β’ Published 9 days ago β’ 41
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper β’ 2412.13018 β’ Published 9 days ago β’ 40
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Paper β’ 2412.14171 β’ Published 7 days ago β’ 22
AnySat: An Earth Observation Model for Any Resolutions, Scales, and Modalities Paper β’ 2412.14123 β’ Published 7 days ago β’ 11