Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HKUSTAudio 's Collections
Llasa
YuE

Llasa

updated May 11

TTS foundation model compatible with Llama framework (160k hours tokenized speech data released)

Upvote
18

  • HKUSTAudio/xcodec2

    Audio-to-Audio • 0.8B • Updated Feb 23 • 164k • 77

  • HKUSTAudio/Llasa-1B

    Text-to-Speech • 1B • Updated May 10 • 9.32k • 98

  • HKUSTAudio/Llasa-3B

    Text-to-Speech • 4B • Updated May 10 • 3.43k • • 510

  • HKUSTAudio/Llasa-8B

    Text-to-Speech • 9B • Updated Mar 9 • 4.88k • 94

  • HKUSTAudio/Llasa-1B-Multilingual

    Text-to-Speech • 2B • Updated Mar 5 • 62.6k • 38

  • HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized

    Updated Feb 13 • 1.16k • 29

  • Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

    Paper • 2502.04128 • Published Feb 6 • 26

  • HKUSTAudio/Llasa-3B-Preserve-TextChat

    Text-to-Speech • 4B • Updated Feb 13 • 4 • 2

  • HKUSTAudio/Llasa-1B-Preserve-TextChat

    Text-to-Speech • 2B • Updated Feb 13 • 724 • 2

  • HKUSTAudio/Llasa-1B-multi-speakers-genshin-zh-en-ja-ko

    Text-to-Speech • 2B • Updated Feb 13 • 1.11k • 4

  • HKUSTAudio/Llasa-1B-two-speakers-kore-puck

    Text-to-Speech • 2B • Updated Feb 13 • 11 • 4
Upvote
18
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs