Differentiable Solver Search for Fast Diffusion Sampling Paper • 2505.21114 • Published 4 days ago • 4
Differentiable Solver Search for Fast Diffusion Sampling Paper • 2505.21114 • Published 4 days ago • 4
VideoMamba: State Space Model for Efficient Video Understanding Paper • 2403.06977 • Published Mar 11, 2024 • 31
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning Paper • 2410.19702 • Published Oct 25, 2024
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Paper • 2501.00574 • Published Dec 31, 2024 • 6
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Paper • 2501.12386 • Published Jan 21 • 1
Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method Paper • 2501.00584 • Published Dec 31, 2024
Fine-grained Video-Text Retrieval: A New Benchmark and Method Paper • 2501.00513 • Published Dec 31, 2024
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model Paper • 2407.06491 • Published Jul 9, 2024
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models Paper • 2504.15271 • Published Apr 21 • 65
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging Paper • 2504.12364 • Published Apr 16 • 21
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging Paper • 2504.12364 • Published Apr 16 • 21
Accelerating Image Generation with Sub-path Linear Approximation Model Paper • 2404.13903 • Published Apr 22, 2024
FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution Paper • 2410.22655 • Published Oct 30, 2024 • 1