Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
tyzhu 's Collections
multimodal
long-context
knowledge
pretraining
IR
reasoning
multilingual
daily-papers

reasoning

updated Mar 6
Upvote
-

  • Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

    Paper • 2502.19361 • Published Feb 26 • 28

  • Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

    Paper • 2502.17407 • Published Feb 24 • 26

  • Small Models Struggle to Learn from Strong Reasoners

    Paper • 2502.12143 • Published Feb 17 • 37

  • Language Models can Self-Improve at State-Value Estimation for Better Search

    Paper • 2503.02878 • Published Mar 4 • 10

  • Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

    Paper • 2503.01307 • Published Mar 3 • 39

  • Chain of Draft: Thinking Faster by Writing Less

    Paper • 2502.18600 • Published Feb 25 • 48

  • SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers

    Paper • 2502.20545 • Published Feb 27 • 22

  • LADDER: Self-Improving LLMs Through Recursive Problem Decomposition

    Paper • 2503.00735 • Published Mar 2 • 21
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs