Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages Paper • 2411.12240 • Published Nov 19 • 6
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch Paper • 2411.11171 • Published Nov 17 • 8
Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement Paper • 2412.04003 • Published 21 days ago • 9