EDT: Improving Large Language Models' Generation by Entropy-based Dynamic Temperature Sampling Paper • 2403.14541 • Published Mar 21, 2024
Large Language Models are Good Spontaneous Multilingual Learners: Is the Multilingual Annotated Data Necessary? Paper • 2405.13816 • Published May 22, 2024
Running 59 59 Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks 📝 Evaluate multilingual models using FineTasks