andreuka18/DeepSeek-R1-Distill-Llama-8B-lmsys-openthoughts-tokenized
Viewer
•
Updated
•
781k
•
299
Models and datasets used in the paper "Interpreting Reasoning Features in Large Language Models via Sparse Autoenoder": https://arxiv.org/abs/2503.188