Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
19
4
22
Yuancheng Wang
Hecheng0625
Follow
Mira1sen's profile picture
21world's profile picture
lmxue's profile picture
9 followers
·
20 following
https://hecheng0625.github.io/
Hecheng0625
AI & ML interests
ML, DL, Speech, Audio, NLP
Recent Activity
upvoted
a
paper
about 10 hours ago
TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling
authored
a paper
2 days ago
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
authored
a paper
2 days ago
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
View all activity
Organizations
Papers
13
arxiv:
2508.16790
arxiv:
2508.04195
arxiv:
2505.13000
arxiv:
2502.03128
Expand 13 papers
models
11
Sort: Recently updated
Hecheng0625/TasCodec_6_25hz
0.5B
•
Updated
Apr 16
Hecheng0625/AUDIT
Updated
Apr 11
•
1
Hecheng0625/fast_maskgct
Updated
Nov 23, 2024
Hecheng0625/codec_24k_480hopsize_12layer_vocos
Updated
Jun 5, 2024
Hecheng0625/RepCodec
Updated
Jun 5, 2024
Hecheng0625/SoundStorm
Updated
Jun 1, 2024
Hecheng0625/semantic_kmeans
Updated
May 31, 2024
Hecheng0625/codec_16k_320hopsize_8layer_vocos
Updated
May 16, 2024
Hecheng0625/naturalspeech2
Updated
May 6, 2024
•
1
Hecheng0625/latent_codec_gpt_tts
Updated
Apr 28, 2024
View 11 models
datasets
0
None public yet