ai4privacy/llama-ai4privacy-multilingual-categorical-anonymiser-openpii Token Classification • Updated 4 days ago • 163 • 4
Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published 17 days ago • 34
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 20 days ago • 75
Sleeping 8 8 Pre-training Dutch T5 and UL2 Models, evaluation and model lists 🚀 Explore and compare Dutch T5 models for summarization and translation
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers Paper • 2502.03793 • Published Feb 6 • 4
Babel Collection Open Multilingual Large Language Models Serving Over 90% of Global Speakers • 7 items • Updated 20 days ago • 16
ibm-granite/granite-embedding-278m-multilingual Sentence Similarity • Updated 23 days ago • 22.6k • • 36
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published about 1 month ago • 26
denniscraandijk/dutch-english-snowflake-arctic-embed-l-v2.0 Sentence Similarity • Updated Feb 14 • 31
denniscraandijk/dutch-english-snowflake-arctic-embed-l-v2.0 Sentence Similarity • Updated Feb 14 • 31
denniscraandijk/dutch-english-multilingual-e5-large-instruct Sentence Similarity • Updated Feb 13 • 14