Neel Nanda
NeelNanda
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored
a paper
18 days ago
Towards eliciting latent knowledge from LLMs with mechanistic
interpretability
authored
a paper
4 months ago
Open Problems in Mechanistic Interpretability
authored
a paper
7 months ago
Do I Know This Entity? Knowledge Awareness and Hallucinations in
Language Models
Organizations
NeelNanda's activity
Remove added tokens to make compatible with tokenizers>=0.14
#1 opened over 1 year ago
by
ArthurConmy