SAEs for use with the SAELens library

This repository contains the following SAEs:

  • blocks.19.hook_resid_post

Model described in the paper I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders. Code available at https://github.com/AIRI-Institute/SAE-Reasoning

Load these SAEs using SAELens as below:

from sae_lens import SAE

sae, cfg_dict, sparsity = SAE.from_pretrained("andreuka18/deepseek-r1-distill-llama-8b-lmsys-openthoughts", "<sae_id>")
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including andreuka18/deepseek-r1-distill-llama-8b-lmsys-openthoughts