Large-scale Pre-training for Grounded Video Caption Generation Paper • 2503.10781 • Published Mar 13 • 16