SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Paper • 2403.07711 • Published Mar 12, 2024 • 1
MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation Paper • 2511.22989 • Published 29 days ago • 15
MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation Paper • 2511.22989 • Published 29 days ago • 15 • 2
ADOPT: Modified Adam Can Converge with Any β_2 with the Optimal Rate Paper • 2411.02853 • Published Nov 5, 2024 • 1
Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search Paper • 2501.19252 • Published Jan 31 • 1
SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Paper • 2403.07711 • Published Mar 12, 2024 • 1
ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate Paper • 2411.02853 • Published Nov 5, 2024 • 1
MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation Paper • 2511.22989 • Published 29 days ago • 15
Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search Paper • 2501.19252 • Published Jan 31 • 1