VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments Paper • 2506.02387 • Published 5 days ago • 56
SageAttention2++: A More Efficient Implementation of SageAttention2 Paper • 2505.21136 • Published 12 days ago • 43
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Paper • 2505.21600 • Published 11 days ago • 68
A Unified Agentic Framework for Evaluating Conditional Image Generation Paper • 2504.07046 • Published Apr 9 • 30
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper • 2503.19693 • Published Mar 25 • 75
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper • 2410.20280 • Published Oct 26, 2024 • 23
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Paper • 2406.02540 • Published Jun 4, 2024 • 3
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 59
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data Paper • 2408.10119 • Published Aug 19, 2024 • 17
DiTFastAttn: Attention Compression for Diffusion Transformer Models Paper • 2406.08552 • Published Jun 12, 2024 • 26
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Paper • 2406.02540 • Published Jun 4, 2024 • 3
ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation Paper • 2406.00908 • Published Jun 3, 2024 • 12