LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published 6 days ago • 139
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published 7 days ago • 162
rusBEIR-datasets Collection Collection of datasets used in rusBEIR • 57 items • Updated 7 days ago • 4
Russian Q&A datasets Collection Datasets collected from scraping Russian question answering websites • 4 items • Updated Mar 15, 2024 • 1
The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design Paper • 2408.12503 • Published Aug 22, 2024 • 24