EleutherAI

non-profit

Verified

https://eleuther.ai

AIEleuther

EleutherAI

Activity Feed Request to join this org

AI & ML interests

Large language models, scaling laws, AI Alignment, democratization of DL

Recent Activity

nev updated a model 3 days ago

EleutherAI/SmolLM2-135M-mp-sae

bfattori authored a paper 12 days ago

Lessons from the Trenches on Reproducible Evaluation of Language Models

craffel authored a paper about 1 month ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

View all activity

nev

updated a model 3 days ago

EleutherAI/SmolLM2-135M-mp-sae

Updated 3 days ago

bfattori

authored a paper 12 days ago

Lessons from the Trenches on Reproducible Evaluation of Language Models

Paper • 2405.14782 • Published May 23, 2024

craffel

authored a paper about 1 month ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 64

Skylion007

authored a paper about 2 months ago

The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 38

stellaathena

authored 9 papers about 2 months ago

Emergent and Predictable Memorization in Large Language Models

Paper • 2304.11158 • Published Apr 21, 2023

KMMLU: Measuring Massive Multitask Language Understanding in Korean

Paper • 2402.11548 • Published Feb 18, 2024

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 39

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Paper • 2406.16746 • Published Jun 24, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons

Paper • 2407.14933 • Published Jul 20, 2024 • 12

Lessons from the Trenches on Reproducible Evaluation of Language Models

Paper • 2405.14782 • Published May 23, 2024

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 10

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Paper • 2406.17746 • Published Jun 25, 2024

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

craffel

authored a paper about 2 months ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 44

stellaathena

authored a paper about 2 months ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 44

storytracer

authored a paper 2 months ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 44

amphora

authored a paper 3 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

oskarvanderwal

authored 3 papers 4 months ago

Inseq: An Interpretability Toolkit for Sequence Generation Models

Paper • 2302.13942 • Published Feb 27, 2023 • 1

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Paper • 2304.01373 • Published Apr 3, 2023 • 9

Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model

Paper • 2310.12611 • Published Oct 19, 2023