DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 6 days ago • 242
LLMDet Collection LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models • 5 items • Updated Jul 26 • 3
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 516
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models Paper • 2411.07232 • Published Nov 11, 2024 • 68