Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kd303 's Collections
Books-data-training
Math
STEM-Datasets
Data Quality Models
Reasoning-lastest
code
Models
RAG
Fine-tuning
Reasoning
Synthetic Data papers
Agents

Reasoning

updated Dec 31, 2024
Upvote
-

  • Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

    Paper • 2408.06195 • Published Aug 12, 2024 • 74

  • Thinking LLMs: General Instruction Following with Thought Generation

    Paper • 2410.10630 • Published Oct 14, 2024 • 21

  • Democratizing Reasoning Ability: Tailored Learning from Large Language Model

    Paper • 2310.13332 • Published Oct 20, 2023 • 16

  • OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

    Paper • 2412.16849 • Published Dec 22, 2024 • 9

  • o1-Coder: an o1 Replication for Coding

    Paper • 2412.00154 • Published Nov 29, 2024 • 45

  • SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation

    Paper • 2411.11053 • Published Nov 17, 2024 • 4

  • Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay

    Paper • 2410.12236 • Published Oct 16, 2024 • 1

  • Efficiently Serving LLM Reasoning Programs with Certaindex

    Paper • 2412.20993 • Published Dec 30, 2024 • 38
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs