AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation Paper • 2602.17100 • Published Feb 19 • 4
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant Paper • 2603.01059 • Published Mar 1 • 1
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models Paper • 2603.00618 • Published Feb 28
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published Mar 4 • 19
InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions Paper • 2603.03646 • Published Mar 4 • 8
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods Paper • 2407.21630 • Published Jul 31, 2024 • 8
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published Feb 9 • 290
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs Paper • 2602.10388 • Published Feb 11 • 244
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published Nov 20, 2025 • 111
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers Paper • 2511.11062 • Published Nov 14, 2025 • 33
KLASS: KL-Guided Fast Inference in Masked Diffusion Models Paper • 2511.05664 • Published Nov 7, 2025 • 37
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs Paper • 2511.12710 • Published Nov 16, 2025 • 39
Truncated Step-Level Sampling with Process Rewards for Retrieval-Augmented Reasoning Paper • 2602.23440 • Published Feb 26 • 3
Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling Paper • 2603.04553 • Published Mar 4 • 3
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model Paper • 2603.05438 • Published Mar 5 • 40
Beyond the Grid: Layout-Informed Multi-Vector Retrieval with Parsed Visual Document Representations Paper • 2603.01666 • Published Mar 2 • 1
FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling Paper • 2603.06199 • Published Mar 6 • 9
π-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs Paper • 2603.02083 • Published Mar 2 • 9
EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding Paper • 2603.04254 • Published Mar 4 • 1
LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding Paper • 2602.20913 • Published Feb 24 • 11
Efficient Continual Learning in Language Models via Thalamically Routed Cortical Columns Paper • 2602.22479 • Published Feb 25
VecGlypher: Unified Vector Glyph Generation with Language Models Paper • 2602.21461 • Published Feb 25 • 12
Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators Paper • 2602.22647 • Published Feb 26 • 4
Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs Paper • 2602.21198 • Published Feb 24 • 4
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published Mar 10 • 75
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing Paper • 2603.09877 • Published Mar 10 • 48
TALON: Test-time Adaptive Learning for On-the-Fly Category Discovery Paper • 2603.08075 • Published Mar 9
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning Paper • 2603.05863 • Published Mar 6 • 6
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control Paper • 2603.09221 • Published Mar 10
Compiler-First State Space Duality and Portable O(1) Autoregressive Caching for Inference Paper • 2603.09555 • Published Mar 10 • 1
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning Paper • 2603.10160 • Published Mar 10 • 26
Prism-Δ: Differential Subspace Steering for Prompt Highlighting in Large Language Models Paper • 2603.10705 • Published Mar 11 • 11
Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning Paper • 2603.10377 • Published Mar 11 • 3
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published Mar 12 • 53
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published Mar 12 • 65
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training Paper • 2603.12255 • Published Mar 12 • 91
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights Paper • 2603.12228 • Published Mar 12 • 12
LifeGPT: Topology-Agnostic Generative Pretrained Transformer Model for Cellular Automata Paper • 2409.12182 • Published Sep 3, 2024
HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration Paper • 2603.07815 • Published Mar 8 • 10
LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation Paper • 2603.10899 • Published Mar 11 • 7
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space Paper • 2603.12648 • Published Mar 13 • 14
VQQA: An Agentic Approach for Video Evaluation and Quality Improvement Paper • 2603.12310 • Published Mar 12 • 8
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published Feb 15 • 53
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces Paper • 2602.11683 • Published Feb 12 • 8
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning Paper • 2602.11748 • Published Feb 12 • 36
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published Feb 10 • 197
MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning Paper • 2602.10575 • Published Feb 11 • 4
Cognitive Mismatch in Multimodal Large Language Models for Discrete Symbol Understanding Paper • 2603.18472 • Published 27 days ago • 20
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD Paper • 2603.20155 • Published 25 days ago • 10
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents Paper • 2603.19685 • Published 26 days ago • 21
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published 20 days ago • 47
MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers Paper • 2602.00398 • Published Jan 30 • 6
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration Paper • 2603.24800 • Published 20 days ago • 67
BAT: Learning to Reason about Spatial Sounds with Large Language Models Paper • 2402.01591 • Published Feb 2, 2024 • 1
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization Paper • 2601.21358 • Published Jan 29 • 7
Wigner's Friend as a Circuit: Inter-Branch Communication Witness Benchmarks on Superconducting Quantum Hardware Paper • 2601.16004 • Published Jan 22 • 1
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published Jan 15 • 31
Demystifying the Slash Pattern in Attention: The Role of RoPE Paper • 2601.08297 • Published Jan 13 • 4
AdaGaR: Adaptive Gabor Representation for Dynamic Scene Reconstruction Paper • 2601.00796 • Published Jan 2 • 32
DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation Paper • 2512.21252 • Published Dec 24, 2025 • 35
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published Dec 22, 2025 • 67
Understanding Syllogistic Reasoning in LLMs from Formal and Natural Language Perspectives Paper • 2512.12620 • Published Dec 14, 2025 • 4
Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space Paper • 2512.12623 • Published Dec 14, 2025 • 4
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 13 days ago • 37
FrameDiffuser: G-Buffer-Conditioned Diffusion for Neural Forward Frame Rendering Paper • 2512.16670 • Published Dec 18, 2025 • 4
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published 13 days ago • 138
EgoSim: Egocentric World Simulator for Embodied Interaction Generation Paper • 2604.01001 • Published 14 days ago • 36
Do Audio-Visual Large Language Models Really See and Hear? Paper • 2604.02605 • Published 12 days ago • 7
Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies Paper • 2604.00830 • Published 13 days ago • 15
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 7 days ago • 69
INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling Paper • 2604.07209 • Published 7 days ago • 35
Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics Paper • 2604.08503 • Published 6 days ago • 7
EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs Paper • 2601.06786 • Published Jan 11 • 6
Artificial Entanglement in the Fine-Tuning of Large Language Models Paper • 2601.06788 • Published Jan 11 • 5
How Do Large Language Models Learn Concepts During Continual Pre-Training? Paper • 2601.03570 • Published Jan 7 • 4
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published Jan 9 • 59
CosineGate: Semantic Dynamic Routing via Cosine Incompatibility in Residual Networks Paper • 2512.22206 • Published Dec 21, 2025 • 2