BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models Paper • 2402.13577 • Published Feb 21, 2024 • 10
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models Paper • 2309.01219 • Published Sep 3, 2023 • 2
There Is No Standard Answer: Knowledge-Grounded Dialogue Generation with Adversarial Activated Multi-Reference Learning Paper • 2210.12459 • Published Oct 22, 2022
Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction Paper • 2405.13432 • Published May 22, 2024
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning Paper • 2501.11877 • Published Jan 21
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper • 2505.14810 • Published May 20 • 61
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt Paper • 2406.16377 • Published Jun 24, 2024 • 13