O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper • 2411.16489 • Published Nov 25, 2024 • 48
DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments Paper • 2504.03160 • Published Apr 4 • 2