Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yuancheng Wang's picture
19 4 22

Yuancheng Wang

Hecheng0625
Mira1sen's profile picture 21world's profile picture lmxue's profile picture
·
https://hecheng0625.github.io/
  • Hecheng0625

AI & ML interests

ML, DL, Speech, Audio, NLP

Recent Activity

upvoted a paper about 10 hours ago
TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling
authored a paper 2 days ago
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
authored a paper 2 days ago
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
View all activity

Organizations

Amphion's profile picture

Papers 13

arxiv:2508.16790
arxiv:2508.04195
arxiv:2505.13000
arxiv:2502.03128

models 11

Hecheng0625/TasCodec_6_25hz

0.5B • Updated Apr 16

Hecheng0625/AUDIT

Updated Apr 11 • 1

Hecheng0625/fast_maskgct

Updated Nov 23, 2024

Hecheng0625/codec_24k_480hopsize_12layer_vocos

Updated Jun 5, 2024

Hecheng0625/RepCodec

Updated Jun 5, 2024

Hecheng0625/SoundStorm

Updated Jun 1, 2024

Hecheng0625/semantic_kmeans

Updated May 31, 2024

Hecheng0625/codec_16k_320hopsize_8layer_vocos

Updated May 16, 2024

Hecheng0625/naturalspeech2

Updated May 6, 2024 • 1

Hecheng0625/latent_codec_gpt_tts

Updated Apr 28, 2024
View 11 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs