LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper • 2412.15035 • Published 7 days ago • 4
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18 • 10
occiglot-eu5-7b-v0.1 Collection First release of 7B LLMs models for the 5 biggest European languages. All models initialised from mistral-7b-v0.1. • 10 items • Updated Mar 7 • 21
LEDITS++: Limitless Image Editing using Text-to-Image Models Paper • 2311.16711 • Published Nov 28, 2023 • 22
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models Paper • 2211.05105 • Published Nov 9, 2022
Speaking Multiple Languages Affects the Moral Bias of Language Models Paper • 2211.07733 • Published Nov 14, 2022 • 1
Revision Transformers: Instructing Language Models to Change their Values Paper • 2210.10332 • Published Oct 19, 2022
Class Attribute Inference Attacks: Inferring Sensitive Class Information by Diffusion-Based Attribute Manipulations Paper • 2303.09289 • Published Mar 16, 2023 • 1
The Stable Artist: Steering Semantics in Diffusion Latent Space Paper • 2212.06013 • Published Dec 12, 2022