UCSC-VLAA/ViT-L-16-HTxt-Recap-CLIP Zero-Shot Image Classification β’ Updated Jun 24, 2024 β’ 1.45k β’ 17
What If We Recaption Billions of Web Images with LLaMA-3? Paper β’ 2406.08478 β’ Published Jun 12, 2024 β’ 42