Outline-Guided Object Inpainting with Diffusion Models Paper • 2402.16421 • Published Feb 26, 2024 • 1
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding Paper • 2506.05551 • Published Jun 5 • 5
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding Paper • 2506.05551 • Published Jun 5 • 5 • 2
EarthMind Collection The model, training, and evaluation data of EarthMind. • 4 items • Updated Jun 5 • 1
EarthMind Collection The model, training, and evaluation data of EarthMind. • 4 items • Updated Jun 5 • 1
EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models Paper • 2506.01667 • Published Jun 2 • 21
EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models Paper • 2506.01667 • Published Jun 2 • 21
EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models Paper • 2506.01667 • Published Jun 2 • 21 • 2