4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Paper • 2406.09406 • Published Jun 13, 2024 • 15
BRAVE: Broadening the visual encoding of vision-language models Paper • 2404.07204 • Published Apr 10, 2024 • 19
Unraveling the Key Components of OOD Generalization via Diversification Paper • 2312.16313 • Published Dec 26, 2023 • 1