view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • 25 days ago • 27
view article Article Introducing the Open Arabic LLM Leaderboard By alielfilali01 and 4 others • May 14, 2024 • 92
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated 27 days ago • 151
AI2 Safety Toolkit Collection Safety data, moderation tools and safe LLMs. • 6 items • Updated Apr 30 • 6
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 27
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers Paper • 2502.03793 • Published Feb 6 • 4
AraModernBERT Models Collection AraModernBert is an advanced Arabic language model built on the ModernBERT architecture. • 2 items • Updated 8 days ago • 3
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models Paper • 2501.11175 • Published Jan 19 • 3
EASY: Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients Paper • 2201.09699 • Published Jan 24, 2022 • 2
view article Article Atlaset Dataset for Moroccan Darija: From Data Collection, Analysis, to Model Trainings By atlasia and 1 other • Mar 6 • 24
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 267
view article Article Data exploration and filtering with Nomic Atlas By visheratin • Mar 22, 2024 • 5
Arabic (MSA) Summarization Models & Datasets Collection A collection of models (and the dataset used to train them) that are trained for summarizing arabic text. • 5 items • Updated Feb 20 • 1