TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Paper • 2501.15513 • Published Jan 26