O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper ⢠2411.16489 ⢠Published Nov 25, 2024 ⢠46
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper ⢠2411.16489 ⢠Published Nov 25, 2024 ⢠46
Adaptive Decoding via Latent Preference Optimization Paper ⢠2411.09661 ⢠Published Nov 14, 2024 ⢠10
Thinking LLMs: General Instruction Following with Thought Generation Paper ⢠2410.10630 ⢠Published Oct 14, 2024 ⢠19
BARTScore: Evaluating Generated Text as Text Generation Paper ⢠2106.11520 ⢠Published Jun 22, 2021 ⢠2
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios Paper ⢠2307.13528 ⢠Published Jul 25, 2023
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing Paper ⢠2107.13586 ⢠Published Jul 28, 2021
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge Paper ⢠2407.19594 ⢠Published Jul 28, 2024 ⢠20
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge Paper ⢠2407.19594 ⢠Published Jul 28, 2024 ⢠20