A compilation of sparse auto-encoders trained on large language models.
David Louapre
dlouapre
AI & ML interests
Large Language Models, Mechanistic Interpretability, ML & Games, Education
Recent Activity
upvoted
an
article
6 days ago
Shadow AI - Where are the CIOs?
updated
a collection
8 days ago
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability
updated
a collection
8 days ago
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability