view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 17 days ago • 107
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16 • 159
view article Article MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before By isaacchung and 2 others • Apr 24 • 14
Running on CPU Upgrade 1.34k 1.34k C4AI Command Models 🌟 Start a chat to get answers and explanations from a language model
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking Paper • 2405.07920 • Published May 13, 2024 • 2
EuroBERT Collection Scaling Multilingual Encoders for European Languages • 4 items • Updated Mar 10 • 11
view article Article 🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? By Kseniase • Mar 17 • 274
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • Mar 10 • 143
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality By saurabhdash and 3 others • Mar 4 • 74
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 8 items • Updated Mar 3 • 25
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 142