Niklas Muennighoff's picture

Niklas Muennighoff

Muennighoff

·

https://muennighoff.github.io/

AI & ML interests

None yet

Recent Activity

updated a model about 18 hours ago

Muennighoff/Qwen2.5-1.5B-hl-true-v40

updated a model 1 day ago

Muennighoff/Qwen2.5-1.5B-hl-true-v41

updated a dataset 1 day ago

mteb/arena-results

View all activity

Organizations

upvoted a paper about 1 month ago

FlexOlmo: Open Language Models for Flexible Data Use

Paper • 2507.07024 • Published Jul 9 • 7

upvoted 2 papers 3 months ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

Paper • 2506.01789 • Published Jun 2 • 14

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4 • 44

upvoted 3 papers 4 months ago

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8 • 8

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published Apr 14 • 18

upvoted a paper 6 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 38

upvoted a paper 7 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 126

upvoted 2 collections 10 months ago

📈 Scaling Laws with Vocabulary

Increase your vocabulary size when you scale up your language model • 5 items • Updated Aug 11, 2024 • 6

🧬 RegMix: Data Mixture as Regression

Automatic data mixture method for large language model pre-training • 10 items • Updated Jul 26, 2024 • 8

upvoted a collection 11 months ago

BGE

30 items • Updated May 20 • 129

upvoted a paper 11 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 122

upvoted a collection 11 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 308

upvoted a paper 12 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 79

upvoted a collection 12 months ago

OLMoE (November 2024)

Artifacts for open mixture-of-experts language models. • 13 items • Updated Apr 30 • 31

upvoted 2 papers about 1 year ago

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2, 2024 • 17

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 74

upvoted 2 collections about 1 year ago

DCLM

DCLM Models + Datasets • 7 items • Updated Jul 22, 2024 • 44

DCLM

DCLM Models + Datasets • 6 items • Updated Oct 4, 2024 • 26

upvoted a paper about 1 year ago

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Paper • 2407.12883 • Published Jul 16, 2024 • 11