UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation Paper • 2506.03147 • Published 2 days ago • 52
ImgEdit: A Unified Image Editing Dataset and Benchmark Paper • 2505.20275 • Published 10 days ago • 17
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation Paper • 2505.20292 • Published 10 days ago • 52
OpenS2V-Nexus Collection OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation • 5 items • Updated 9 days ago • 3
Open-Sora Plan: Open-Source Large Video Generation Model Paper • 2412.00131 • Published Nov 28, 2024 • 35
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators Paper • 2404.05014 • Published Apr 7, 2024 • 35
MagicTime Collection MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators • 4 items • Updated Nov 29, 2024 • 13
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation Paper • 2406.18522 • Published Jun 26, 2024 • 21
ChronoMagic-Bench Collection ChronoMagic-Bench : A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation • 6 items • Updated Nov 29, 2024 • 10
ConsisID Collection Identity-Preserving Text-to-Video Generation by Frequency Decomposition • 4 items • Updated Dec 3, 2024 • 12
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published Nov 26, 2024 • 38