Bartosz Cywiński

bcywinski

AI & ML interests

Mechanistic Interpretability

Recent Activity

updated a collection about 14 hours ago
Llama-3.1-8B-Instruct-taboo
updated a collection about 14 hours ago
Eliciting Secret Knowledge from Language Models
updated a model 9 days ago
bcywinski/gemma-2-9b-it-occupation-doctor
View all activity

Organizations

None yet