view article Article Preference Tuning LLMs with Direct Preference Optimization Methods Jan 18, 2024 • 55
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published 20 days ago • 40
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 21 days ago • 110
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated 29 days ago • 86
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • Updated 8 days ago • 141k • • 1.12k
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published Mar 2 • 62
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated Feb 26 • 67