Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space Paper • 2505.13308 • Published May 19 • 27
Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models Paper • 2504.10615 • Published Apr 14 • 2
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 11 days ago • 200
Improving Rule-based Reasoning in LLMs using Neurosymbolic Representations Paper • 2502.01657 • Published Jan 31 • 2
System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts Paper • 2505.18962 • Published May 25 • 13
Reward Inside the Model: A Lightweight Hidden-State Reward Model for LLM's Best-of-N sampling Paper • 2505.12225 • Published May 18 • 3
Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification Paper • 2504.05419 • Published Apr 7 • 1
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning Paper • 2505.16782 • Published May 22 • 1
Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer Paper • 2507.02199 • Published Jul 2 • 1
Efficient Post-Training Refinement of Latent Reasoning in Large Language Models Paper • 2506.08552 • Published Jun 10 • 1