Neel Nanda's picture

3 2 7

Neel Nanda

NeelNanda

·

https://neelnanda.io

AI & ML interests

Mechanistic Interpretability

Recent Activity

authored a paper 18 days ago

Towards eliciting latent knowledge from LLMs with mechanistic interpretability

authored a paper 4 months ago

Open Problems in Mechanistic Interpretability

authored a paper 7 months ago

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

View all activity

Organizations

NeelNanda's activity

New activity in NeelNanda/gpt-neox-tokenizer-digits over 1 year ago

Remove added tokens to make compatible with tokenizers>=0.14

#1 opened over 1 year ago by