Multimodal Chaptering for Long-Form TV Newscast Video Paper β’ 2406.17590 β’ Published Mar 20, 2024 β’ 2
Moments Lab Research papers Collection All of Moments Lab Research papers available on Hugging Face β’ 3 items β’ Updated Sep 2, 2024 β’ 1
Towards Retrieval Augmented Generation over Large Video Libraries Paper β’ 2406.14938 β’ Published Jun 21, 2024 β’ 21
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model May 14, 2024 β’ 247
Inserting Faces inside Captions: Image Captioning with Attention Guided Merging Paper β’ 2405.02305 β’ Published Mar 20, 2024 β’ 2