Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
duk guo's picture
5 2 8

duk guo

dukkkk
·

AI & ML interests

None yet

Organizations

WenetSpeech4TTS's profile picture

authored 4 papers 12 months ago

Text-aware and Context-aware Expressive Audiobook Speech Synthesis

Paper • 2406.05672 • Published Jun 9, 2024

WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark

Paper • 2406.05763 • Published Jun 9, 2024

HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS

Paper • 2309.13907 • Published Sep 25, 2023

Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

Paper • 2312.09746 • Published Dec 15, 2023
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs