Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).
Lakera
company
Verified
AI & ML interests
AI Safety, Computer Vision, NLP, Responsible AI, AI Fairness, Model validation
Recent Activity
View all activity
A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.
-
Lakera/gandalf_ignore_instructions
Viewer • Updated • 1k • 190 • 28 -
Lakera/gandalf_summarization
Viewer • Updated • 140 • 128 • 4 -
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Paper • 2311.16119 • Published • 2 -
hackaprompt/hackaprompt-dataset
Viewer • Updated • 602k • 818 • 64
Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).
A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.
-
Lakera/gandalf_ignore_instructions
Viewer • Updated • 1k • 190 • 28 -
Lakera/gandalf_summarization
Viewer • Updated • 140 • 128 • 4 -
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Paper • 2311.16119 • Published • 2 -
hackaprompt/hackaprompt-dataset
Viewer • Updated • 602k • 818 • 64