arxiv:2507.06203

A Survey on Latent Reasoning

Published on Jul 8

· Submitted by

ridger on Jul 9

#1 Paper of the day

Upvote

Authors:

Abstract

Latent reasoning enhances large language models by performing multi-step inference in continuous hidden states, improving efficiency and expressiveness beyond token-level supervision.

AI-generated summary

Large Language Models (LLMs) have demonstrated impressive reasoning capabilities, especially when guided by explicit chain-of-thought (CoT) reasoning that verbalizes intermediate steps. While CoT improves both interpretability and accuracy, its dependence on natural language reasoning limits the model's expressive bandwidth. Latent reasoning tackles this bottleneck by performing multi-step inference entirely in the model's continuous hidden state, eliminating token-level supervision. To advance latent reasoning research, this survey provides a comprehensive overview of the emerging field of latent reasoning. We begin by examining the foundational role of neural network layers as the computational substrate for reasoning, highlighting how hierarchical representations support complex transformations. Next, we explore diverse latent reasoning methodologies, including activation-based recurrence, hidden state propagation, and fine-tuning strategies that compress or internalize explicit reasoning traces. Finally, we discuss advanced paradigms such as infinite-depth latent reasoning via masked diffusion models, which enable globally consistent and reversible reasoning processes. By unifying these perspectives, we aim to clarify the conceptual landscape of latent reasoning and chart future directions for research at the frontier of LLM cognition. An associated GitHub repository collecting the latest papers and repos is available at: https://github.com/multimodal-art-projection/LatentCoT-Horizon/.

View arXiv page View PDF GitHub 15 Add to collection

Community

ridger

Paper submitter about 5 hours ago

We've all seen LLMs "think out loud" with Chain-of-Thought, but what if they could reason without being limited to words? Our paper explores how models can perform complex, multi-step inference directly in their continuous hidden states, unlocking enormous expressive potential. In this work, we've synthesized the rapidly growing body of research to create the first clear taxonomy of the field. We dive into how models can be trained to "think deeper" (vertical recurrence) or "think longer" (horizontal recurrence) and explore how futuristic paradigms like text diffusion models enable globally consistent, infinite-step refinement.