Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset Paper • 2506.18851 • Published 22 days ago • 29
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published 22 days ago • 73
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition Paper • 2506.17201 • Published 25 days ago • 52
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning Paper • 2506.15154 • Published 27 days ago • 8
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model Paper • 2506.13642 • Published 29 days ago • 27
PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework Paper • 2506.10741 • Published Jun 12 • 27
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Paper • 2506.09350 • Published Jun 11 • 48
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Paper • 2504.21650 • Published Apr 30 • 16
FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios Paper • 2505.03730 • Published May 6 • 28
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation Paper • 2504.14899 • Published Apr 21 • 21
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance Paper • 2504.06232 • Published Apr 8 • 14
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published Apr 11 • 129
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Paper • 2504.02160 • Published Apr 2 • 37
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published Apr 8 • 172
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis Paper • 2504.04842 • Published Apr 7 • 36
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization Paper • 2504.03011 • Published Apr 3 • 11