Yansong Shi's picture

7 1 3

Yansong Shi

nanamma

·

https://huggingface.co/nanamma

AI & ML interests

multi modality, video understanding, robotics

Recent Activity

upvoted a paper about 1 month ago

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

authored a paper 3 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

authored a paper 3 months ago

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

View all activity

Organizations

Papers 2

arxiv:2410.19702

arxiv:2403.15377

models 1

nanamma/umt_0907

Updated Sep 7, 2024

datasets 0

None public yet