-
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Paper • 2409.04109 • Published • 49 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 141 -
Reward-Robust RLHF in LLMs
Paper • 2409.15360 • Published • 6 -
EuroLLM: Multilingual Language Models for Europe
Paper • 2409.16235 • Published • 27
Haote Yang
Hoter
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
VisionThink: Smart and Efficient Vision Language Model via Reinforcement
Learning