Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 7 days ago • 173
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model Paper • 2404.10306 • Published Apr 16, 2024 • 1
Optimizing Language Model's Reasoning Abilities with Weak Supervision Paper • 2405.04086 • Published May 7, 2024 • 2
Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning Paper • 2403.20046 • Published Mar 29, 2024
Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking Paper • 2310.12342 • Published Oct 18, 2023
SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents Paper • 2411.03284 • Published Nov 5, 2024
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation Paper • 2505.18759 • Published May 24 • 12
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published Feb 3 • 42
Contextualization Distillation from Large Language Model for Knowledge Graph Completion Paper • 2402.01729 • Published Jan 28, 2024
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge Paper • 2411.16594 • Published Nov 25, 2024 • 42