Abstract
Retrieving images from the same location as a given query is an important component of multiple computer vision tasks, like Visual Place Recognition, Landmark Retrieval, Visual Localization, 3D reconstruction, and SLAM. However, existing solutions are built to specifically work for one of these tasks, and are known to fail when the requirements slightly change or when they meet out-of-distribution data. In this paper we combine a variety of existing methods, training techniques, and datasets to train a retrieval model, called MegaLoc, that is performant on multiple tasks. We find that MegaLoc (1) achieves state of the art on a large number of Visual Place Recognition datasets, (2) impressive results on common Landmark Retrieval datasets, and (3) sets a new state of the art for Visual Localization on the LaMAR datasets, where we only changed the retrieval method to the existing localization pipeline. The code for MegaLoc is available at https://github.com/gmberton/MegaLoc
Community
MegaLoc! A new retrieval model for localization, achieves SOTA on Visual Place Recognition (outdoor and indoor!), Visual Localization pipelines (LaMAR) and Landmark Retrieval, making it the perfect choice for any localization pipeline.
Try our demo at https://2a95f3be4f70bd018e.gradio.live
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization (2025)
- IDEA: Image Description Enhanced CLIP-Adapter (2025)
- PRVQL: Progressive Knowledge-guided Refinement for Robust Egocentric Visual Query Localization (2025)
- Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval (2025)
- OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning (2024)
- Vision-Language In-Context Learning Driven Few-Shot Visual Inspection Model (2025)
- Towards Identity-Aware Cross-Modal Retrieval: a Dataset and a Baseline (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper