Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. ⢠46 items ⢠Updated Feb 26 ⢠600
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ⢠2502.02737 ⢠Published Feb 4 ⢠226
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper ⢠2504.01990 ⢠Published 24 days ago ⢠256
ā UI is a good thing š ā Collection cool spaces with a cool UI, what could be better? ⢠5 items ⢠Updated Jun 18, 2024 ⢠17
On Relation-Specific Neurons in Large Language Models Paper ⢠2502.17355 ⢠Published Feb 24 ⢠7
MMTEB Collection Our contribution to the Massive Multilingual Text Embedding Benchmark (MMTEB). Retrieval and reranking benchmarks in 16 languages. ⢠4 items ⢠Updated Jun 6, 2024 ⢠3
MMTEB: Massive Multilingual Text Embedding Benchmark Paper ⢠2502.13595 ⢠Published Feb 19 ⢠34
CommonCrawl Collection Large web-mined general corpus based on CommonCrawl. ⢠8 items ⢠Updated 11 days ago ⢠2
NoLiMa: Long-Context Evaluation Beyond Literal Matching Paper ⢠2502.05167 ⢠Published Feb 7 ⢠15
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. ⢠8 items ⢠Updated Nov 23, 2024 ⢠82
How Transliterations Improve Crosslingual Alignment Paper ⢠2409.17326 ⢠Published Sep 25, 2024 ⢠1