Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
VITA-MLLM 's Collections
VITA-Audio
Long-VITA

VITA-Audio

updated May 15
Upvote
1

  • VITA-MLLM/VITA-Audio-Boost

    10B • Updated May 15 • 184 • 3

  • VITA-MLLM/VITA-Audio-Balance

    10B • Updated Apr 28 • 89 • 3

  • VITA-MLLM/VITA-Audio-Plus-Boost

    11B • Updated May 15 • 319 • 3

  • VITA-MLLM/VITA-Audio-Plus-Vanilla

    8B • Updated May 6 • 2.03k • 4

  • VITA-MLLM/VITA-Audio-Data

    Preview • Updated May 8 • 33 • 6

  • VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    Paper • 2505.03739 • Published May 6 • 8
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs