Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models Paper • 2505.17015 • Published 2 days ago • 6
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models Paper • 2505.17015 • Published 2 days ago • 6 • 2
PointLLM: Empowering Large Language Models to Understand Point Clouds Paper • 2308.16911 • Published Aug 31, 2023 • 1
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI Paper • 2312.16170 • Published Dec 26, 2023 • 1
Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator Paper • 2308.16906 • Published Aug 31, 2023