efficient-reasoning Collection Checkpoints for models trained in https://arxiv.org/abs/2502.04463 • 10 items • Updated Apr 13 • 1
Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models Paper • 2305.15074 • Published May 24, 2023 • 1