MIB Datasets Collection The tasks and counterfactuals from the Mechanistic Interpretability Benchmark. • 7 items • Updated 5 days ago • 1
NNsight and NDIF: Democratizing Access to Foundation Model Internals Paper • 2407.14561 • Published Jul 18, 2024 • 36
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 107 items • Updated 9 days ago • 99
Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages Paper • 2501.06346 • Published Jan 10 • 1
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 107 items • Updated 9 days ago • 99
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published 12 days ago • 72
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub Feb 12 • 62