Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19 • 134
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 11 days ago • 489
ViLLM-Eval: A Comprehensive Evaluation Suite for Vietnamese Large Language Models Paper • 2404.11086 • Published Apr 17 • 2