llm math - a ByRookie Collection

ByRookie 's Collections

kd

pretrain data selectection

llm length control

dataset

llm math

updated Oct 8, 2024

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 55