SAE-Reasoning - a andreuka18 Collection

andreuka18 's Collections

updated 4 days ago

Models and datasets used in the paper "Interpreting Reasoning Features in Large Language Models via Sparse Autoenoder": https://arxiv.org/abs/2503.188

Upvote

andreuka18/DeepSeek-R1-Distill-Llama-8B-lmsys-openthoughts-tokenized

Viewer • Updated 3 days ago • 781k • 299
andreuka18/deepseek-r1-distill-llama-8b-lmsys-openthoughts

Text Generation • Updated 3 days ago
andreuka18/OpenThoughts-10k-DeepSeek-R1

Viewer • Updated 3 days ago • 10k • 107
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published 10 days ago • 110

Upvote

Collection guide
Browse collections