Phillip Guo

PhillipGuo

AI & ML interests

Interp, Unlearning, Editing

Recent Activity

updated a dataset 19 days ago
PhillipGuo/wmdp-deduped-unlearn
updated a model 23 days ago
PhillipGuo/gemma-2-sae-gd-fullrank
updated a model 23 days ago
PhillipGuo/compressed-bio-saes-gemma-2-16k
View all activity

Organizations

Truthfulness & Deception Research Team's profile picture Sure Here, Marv's profile picture quirky-lats-at-mats's profile picture LLM Latent Adversarial Training's profile picture

PhillipGuo's activity