Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published 17 days ago • 34
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 20 days ago • 75
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers Paper • 2502.03793 • Published Feb 6 • 4
Babel Collection Open Multilingual Large Language Models Serving Over 90% of Global Speakers • 7 items • Updated 20 days ago • 16
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published about 1 month ago • 26
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 14 days ago • 96
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 9 items • Updated Feb 24 • 60
POTION Collection These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 5 items • Updated Feb 3 • 10
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3, 2024 • 84
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 80
GROVE: A Retrieval-augmented Complex Story Generation Framework with A Forest of Evidence Paper • 2310.05388 • Published Oct 9, 2023 • 4
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 612