Lizard: An Efficient Linearization Framework for Large Language Models Paper • 2507.09025 • Published Jul 11 • 17
SLR: An Automated Synthesis Framework for Scalable Logical Reasoning Paper • 2506.15787 • Published Jun 18 • 1
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions Paper • 2506.16679 • Published Jun 20 • 1
Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs Paper • 2507.02778 • Published Jul 3 • 9
PL-Guard: Benchmarking Language Model Safety for Polish Paper • 2506.16322 • Published Jun 19 • 1
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code Paper • 2505.02881 • Published May 5 • 4
EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition Paper • 2505.20033 • Published May 26 • 4
EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection Paper • 2506.09827 • Published Jun 11 • 18
Class Attribute Inference Attacks: Inferring Sensitive Class Information by Diffusion-Based Attribute Manipulations Paper • 2303.09289 • Published Mar 16, 2023 • 2
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation Paper • 2305.15296 • Published May 24, 2023 • 1
Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness? Paper • 2305.18398 • Published May 28, 2023 • 2
Interactively Providing Explanations for Transformer Language Models Paper • 2110.02058 • Published Sep 2, 2021 • 1
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You Paper • 2401.16092 • Published Jan 29, 2024 • 1
A Typology for Exploring the Mitigation of Shortcut Behavior Paper • 2203.03668 • Published Mar 4, 2022 • 1
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30, 2024 • 43
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming Paper • 2404.08676 • Published Apr 6, 2024 • 3
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis Paper • 2209.08891 • Published Sep 19, 2022 • 2
Revision Transformers: Instructing Language Models to Change their Values Paper • 2210.10332 • Published Oct 19, 2022 • 1