LLM-LAT/harmful-dataset
Viewer
•
Updated
•
4.95k
•
2.7k
•
17
Note Dataset of harmful queries that we use for all of our evals
Note Docs for training GPT-4o. The synthetic docs from Greenblatt et al. 2024, but put into chat formatting
Note Dataset for anti-refusal training
Note Dataset for system prompt Q&A