Submitted by CNcreator0331 60 LongAnimation: Long Animation Generation with Dynamic Global-Local Memory · 4 authors 87 3
Submitted by Yifan-Zhong 20 A Survey on Vision-Language-Action Models: An Action Tokenization Perspective · 14 authors 38 1
Submitted by yukangcao 12 FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model · 4 authors 24 1
Submitted by zhuoyang20 11 Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation · 7 authors 22 1
Submitted by SiyouLi 7 μ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation · 7 authors 144 1
Submitted by jslee525 4 STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing · 3 authors 3 1
Submitted by multimodalart 2 ARIG: Autoregressive Interactive Head Generation for Real-time Conversations · 5 authors 1