Negative Token Merging: Image-based Adversarial Feature Guidance Paper β’ 2412.01339 β’ Published 24 days ago β’ 21
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published 21 days ago β’ 118
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters Paper β’ 2412.00174 β’ Published 26 days ago β’ 22
Open-Sora Plan: Open-Source Large Video Generation Model Paper β’ 2412.00131 β’ Published 28 days ago β’ 32