Lessons from Defending Gemini Against Indirect Prompt Injections Paper • 2505.14534 • Published 4 days ago • 8
Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair Paper • 2505.13103 • Published 5 days ago • 6
Lessons from Defending Gemini Against Indirect Prompt Injections Paper • 2505.14534 • Published 4 days ago • 8
Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair Paper • 2505.13103 • Published 5 days ago • 6
Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair Paper • 2505.13103 • Published 5 days ago • 6 • 2
Lessons from Defending Gemini Against Indirect Prompt Injections Paper • 2505.14534 • Published 4 days ago • 8 • 2
Humans expect rationality and cooperation from LLM opponents in strategic games Paper • 2505.11011 • Published 9 days ago • 4 • 2
Trusted Machine Learning Models Unlock Private Inference for Problems Currently Infeasible with Cryptography Paper • 2501.08970 • Published Jan 15 • 6 • 2
Measuring memorization through probabilistic discoverable extraction Paper • 2410.19482 • Published Oct 25, 2024 • 4 • 2
Operationalizing Contextual Integrity in Privacy-Conscious Assistants Paper • 2408.02373 • Published Aug 5, 2024 • 5
Operationalizing Contextual Integrity in Privacy-Conscious Assistants Paper • 2408.02373 • Published Aug 5, 2024 • 5 • 2
A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses Paper • 2407.02551 • Published Jul 2, 2024 • 9 • 1
UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI Paper • 2407.00106 • Published Jun 27, 2024 • 6 • 1