Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes Paper • 2601.02356 • Published 28 days ago • 14
Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations Paper • 2512.21004 • Published Dec 24, 2025 • 13
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published Dec 18, 2025 • 85
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published Dec 15, 2025 • 105
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Paper • 2512.04678 • Published Dec 4, 2025 • 42
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity • 0.1B • Updated 5 days ago • 17.4M • • 1.12k
AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose Paper • 2308.03610 • Published Aug 7, 2023 • 24