Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
scale-safety-research
's Collections
Alignment Faking Datasets
Gemma 2 9b Emergent Misalignment
Apollo Deception Probes Datasets
Helpful-Only Synthetic Documents
Apollo Deception Probes Datasets
updated
Mar 18
Upvote
-
scale-safety-research/instructed_pairs
Viewer
•
Updated
Mar 18
•
612
•
18
scale-safety-research/roleplaying
Viewer
•
Updated
Mar 18
•
742
•
19
scale-safety-research/insider_trading
Viewer
•
Updated
Mar 18
•
1.01k
•
21
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections