ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback Paper โข 2505.17908 โข Published May 23 โข 3
Long-Video Audio Synthesis with Multi-Agent Collaboration Paper โข 2503.10719 โข Published Mar 13 โข 9
RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification Paper โข 2503.02537 โข Published Mar 4 โข 12
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation Paper โข 2503.01370 โข Published Mar 3 โข 15
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation Paper โข 2503.01370 โข Published Mar 3 โข 15
TransPixar: Advancing Text-to-Video Generation with Transparency Paper โข 2501.03006 โข Published Jan 6 โข 27
GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs Paper โข 2412.11258 โข Published Dec 15, 2024 โข 13
GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs Paper โข 2412.11258 โข Published Dec 15, 2024 โข 13
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper โข 2411.13503 โข Published Nov 20, 2024 โข 35
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Paper โข 2409.18124 โข Published Sep 26, 2024 โข 34
SEED-Story: Multimodal Long Story Generation with Large Language Model Paper โข 2407.08683 โข Published Jul 11, 2024 โข 26
SEED-Story: Multimodal Long Story Generation with Large Language Model Paper โข 2407.08683 โข Published Jul 11, 2024 โข 26
LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching Paper โข 2311.11284 โข Published Nov 19, 2023 โข 20
LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching Paper โข 2311.11284 โข Published Nov 19, 2023 โข 20