LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra Paper • 2507.15815 • Published 14 days ago • 6
Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding Paper • 2408.08252 • Published Aug 15, 2024 • 1
On Evaluating the Durability of Safeguards for Open-Weight LLMs Paper • 2412.07097 • Published Dec 10, 2024 • 1
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution Paper • 2505.20286 • Published May 26 • 7
Dynamic Risk Assessments for Offensive Cybersecurity Agents Paper • 2505.18384 • Published May 23 • 8
Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications Paper • 2306.04539 • Published Jun 7, 2023
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs? Paper • 2501.02669 • Published Jan 5 • 1
Attention IoU: Examining Biases in CelebA using Attention Maps Paper • 2503.19846 • Published Mar 25 • 7
Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift Paper • 2311.15961 • Published Nov 27, 2023
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving Paper • 2502.07640 • Published Feb 11 • 9
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations Paper • 2502.06453 • Published Feb 10
Temporal Consistency for LLM Reasoning Process Error Identification Paper • 2503.14495 • Published Mar 18 • 11
PromptShield: Deployable Detection for Prompt Injection Attacks Paper • 2501.15145 • Published Jan 25
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning Paper • 2406.02081 • Published Jun 4, 2024
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors Paper • 2406.14598 • Published Jun 20, 2024
Evaluating Copyright Takedown Methods for Language Models Paper • 2406.18664 • Published Jun 26, 2024 • 1
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs Paper • 2406.18521 • Published Jun 26, 2024 • 30