-
Reinforcement Learning: An Overview
Paper • 2412.05265 • Published • 7 -
Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Paper • 2411.01156 • Published • 6 -
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
Paper • 2503.21755 • Published • 32 -
Qwen2.5-Omni Technical Report
Paper • 2503.20215 • Published • 139
LI
RogerZhuo
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
3 days ago
Animagine XL 4.0
liked
a dataset
3 days ago
cagliostrolab/860k-ordered-tags
liked
a model
3 days ago
Wan-AI/Wan2.1-FLF2V-14B-720P
Organizations
Collections
9
-
ElectricAlexis/NotaGen
Updated • 136 -
ASLP-lab/LLaSE-G1
Audio-to-Audio • Updated • 20 -
549
Di♪♪Rhythm
🎶Blazingly Fast and Embarrassingly Simple Song Generation
-
DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Paper • 2503.01183 • Published • 26
models
None public yet
datasets
None public yet