MMDetection: Open MMLab Detection Toolbox and Benchmark Paper • 1906.07155 • Published Jun 17, 2019
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction Paper • 2412.13187 • Published Dec 17, 2024
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond Paper • 1904.11492 • Published Apr 25, 2019
GroupViT: Semantic Segmentation Emerges from Text Supervision Paper • 2202.11094 • Published Feb 22, 2022
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Paper • 2303.04803 • Published Mar 8, 2023