Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA Paper โข 2406.09396 โข Published Jun 13, 2024 โข 3
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy Paper โข 2406.20095 โข Published Jun 28, 2024 โข 18
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy Paper โข 2406.20095 โข Published Jun 28, 2024 โข 18
Perceptual Grouping in Contrastive Vision-Language Models Paper โข 2210.09996 โข Published Oct 18, 2022
Perceptual Grouping in Contrastive Vision-Language Models Paper โข 2210.09996 โข Published Oct 18, 2022 โข 1