SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper • 2502.14922 • Published 7 days ago • 28
MoM: Linear Sequence Modeling with Mixture-of-Memories Paper • 2502.13685 • Published 8 days ago • 31
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation Paper • 2502.13128 • Published 8 days ago • 35
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published 9 days ago • 76
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training Paper • 2502.11196 • Published 10 days ago • 21
FoNE: Precise Single-Token Number Embeddings via Fourier Features Paper • 2502.09741 • Published 13 days ago • 11
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper • 2502.09696 • Published 13 days ago • 38
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper • 2502.06608 • Published 16 days ago • 32
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 14 days ago • 181
Exploring the Potential of Encoder-free Architectures in 3D LMMs Paper • 2502.09620 • Published 13 days ago • 25
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published 15 days ago • 44
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 16 days ago • 138
VideoRoPE: What Makes for Good Video Rotary Position Embedding? Paper • 2502.05173 • Published 19 days ago • 61