LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation Paper • 2309.09294 • Published Sep 17, 2023
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos Paper • 2304.01186 • Published Apr 3, 2023
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Paper • 2310.07702 • Published Oct 11, 2023
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net Paper • 2308.14221 • Published Aug 27, 2023 • 1
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation Paper • 2211.12194 • Published Nov 22, 2022
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild Paper • 2211.14758 • Published Nov 27, 2022 • 2
T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations Paper • 2301.06052 • Published Jan 15, 2023
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing Paper • 2301.06281 • Published Jan 16, 2023
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models Paper • 2407.10285 • Published Jul 14, 2024 • 5
StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos Paper • 2409.07447 • Published Sep 11, 2024
Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach Paper • 2410.03160 • Published Oct 4, 2024 • 5
AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation Paper • 2411.17383 • Published Nov 26, 2024 • 7
CV-VAE: A Compatible Video VAE for Latent Generative Video Models Paper • 2405.20279 • Published May 30, 2024
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published Dec 24, 2024 • 19
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training Paper • 2412.15646 • Published Dec 20, 2024
DEIM: DETR with Improved Matching for Fast Convergence Paper • 2412.04234 • Published Dec 5, 2024 • 1
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning Paper • 2505.23504 • Published May 29 • 7
AB-Cache: Training-Free Acceleration of Diffusion Models via Adams-Bashforth Cached Feature Reuse Paper • 2504.10540 • Published Apr 13