Taming Teacher Forcing for Masked Autoregressive Video Generation Paper • 2501.12389 • Published 8 days ago • 10
Taming Teacher Forcing for Masked Autoregressive Video Generation Paper • 2501.12389 • Published 8 days ago • 10
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3, 2024 • 83
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3, 2024 • 83
Exploring Recurrent Long-term Temporal Fusion for Multi-view 3D Perception Paper • 2303.05970 • Published Mar 10, 2023
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation Paper • 2406.16855 • Published Jun 24, 2024 • 55
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation Paper • 2406.16855 • Published Jun 24, 2024 • 55
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation Paper • 2406.16855 • Published Jun 24, 2024 • 55 • 4
Small Language Model Meets with Reinforced Vision Vocabulary Paper • 2401.12503 • Published Jan 23, 2024 • 32