DataDecide: How to Predict Best Pretraining Data with Small Experiments Paper • 2504.11393 • Published 24 days ago • 17
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published about 1 month ago • 73