view article Article Improving Parquet Dedupe on Hugging Face Hub By yuchenglow and 1 other • Oct 5, 2024 • 38
view article Article <p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p> By hba123 and 2 others • 20 days ago • 11
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published 26 days ago • 19
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 25 days ago • 602
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • May 21 • 31
view article Article Introducing the Open Arabic LLM Leaderboard By alielfilali01 and 4 others • May 14, 2024 • 96
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 162
AI2 Safety Toolkit Collection Safety data, moderation tools and safe LLMs. • 6 items • Updated Apr 30 • 7
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 27
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers Paper • 2502.03793 • Published Feb 6 • 4
AraModernBERT Models Collection AraModernBert is an advanced Arabic language model built on the ModernBERT architecture. • 2 items • Updated Jun 7 • 3
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models Paper • 2501.11175 • Published Jan 19 • 3