Benchmarking Optimizers for Large Language Model Pretraining Paper • 2509.01440 • Published 8 days ago • 23
Jointly Reinforcing Diversity and Quality in Language Model Generations Paper • 2509.02534 • Published 7 days ago • 24
Certified Mitigation of Worst-Case LLM Copyright Infringement Paper • 2504.16046 • Published Apr 22 • 13
Optimizing Decomposition for Optimal Claim Verification Paper • 2503.15354 • Published Mar 19 • 19 • 2
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning Paper • 2410.01044 • Published Oct 1, 2024 • 38
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning Paper • 2410.01044 • Published Oct 1, 2024 • 38
Benchmarking Language Model Creativity: A Case Study on Code Generation Paper • 2407.09007 • Published Jul 12, 2024 • 4
GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution Paper • 2307.08775 • Published Jul 17, 2023 • 1
AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies Paper • 2402.12370 • Published Feb 19, 2024 • 2
Benchmarking Language Model Creativity: A Case Study on Code Generation Paper • 2407.09007 • Published Jul 12, 2024 • 4