Model-tuning Via Prompts Makes NLP Models Adversarially Robust Paper • 2303.07320 • Published Mar 13, 2023
Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic Paper • 2404.07177 • Published Apr 10, 2024
Rethinking LLM Memorization through the Lens of Adversarial Compression Paper • 2404.15146 • Published Apr 23, 2024
OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics Paper • 2506.12618 • Published Jun 14
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published 9 days ago • 53
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published 9 days ago • 53
Understanding Hallucinations in Diffusion Models through Mode Interpolation Paper • 2406.09358 • Published Jun 13, 2024 • 5
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models Paper • 2405.20541 • Published May 30, 2024 • 24
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling Paper • 2401.16380 • Published Jan 29, 2024 • 51