LADDER: Self-Improving LLMs Through Recursive Problem Decomposition Paper • 2503.00735 • Published Mar 2 • 21
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge Paper • 2407.19594 • Published Jul 28, 2024 • 21
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation Paper • 2310.02304 • Published Oct 3, 2023 • 1
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs Paper • 2503.01307 • Published Mar 3 • 38
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published Nov 12, 2024 • 67
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published Dec 23, 2024 • 48
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification Paper • 2502.01839 • Published Feb 3 • 8
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement Paper • 2410.04444 • Published Oct 6, 2024 • 3
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching Paper • 2406.06326 • Published Jun 10, 2024 • 2
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers Paper • 2503.14434 • Published Mar 18 • 7