Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jianghan Chao's picture
3 1

Jianghan Chao

roverx12345
·

AI & ML interests

Robotics

Recent Activity

published a dataset 8 days ago
roverx12345/audios
liked a model 3 months ago
mispeech/r1-aqa
upvoted a paper 7 months ago
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
View all activity

Organizations

Renmin University of China's profile picture

published a dataset 8 days ago

roverx12345/audios

Updated 8 days ago • 5
liked a model 3 months ago

mispeech/r1-aqa

Audio-Text-to-Text • Updated Mar 28 • 2.37k • 16
upvoted a paper 7 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 99
upvoted 2 papers 8 months ago

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published Oct 22, 2024 • 48

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23, 2024 • 37
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs