Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA Paper โข 2406.09396 โข Published Jun 13, 2024 โข 3
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy Paper โข 2406.20095 โข Published Jun 28, 2024 โข 18