Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
scale-safety-research
's Collections
Alignment Faking Datasets
Gemma 2 9b Emergent Misalignment
Apollo Deception Probes Datasets
Helpful-Only Synthetic Documents
Gemma 2 9b Emergent Misalignment
updated
Apr 16
Upvote
-
abhayesian/em-gemma-2-9b-it-layer-11-15
Updated
Apr 16
abhayesian/em-gemma-2-9b-it-layer-12
Updated
Apr 16
abhayesian/em-gemma-2-9b-it-layer-16
Updated
Apr 16
abhayesian/em-gemma-2-9b-it-layer-11-15-evaluations
Viewer
•
Updated
Apr 16
•
128
•
30
abhayesian/em-gemma-2-9b-it-layer-12-evaluations
Viewer
•
Updated
Apr 16
•
51
•
26
abhayesian/em-gemma-2-9b-it-layer-16-evaluations
Viewer
•
Updated
Apr 16
•
53
•
29
Upvote
-
Share collection
View history
Collection guide
Browse collections