massive-serve Collection One command to download and serve a datastore---that's it 😎. https://github.com/RulinShao/massive-serve • 5 items • Updated 23 days ago • 1
DataDecide: How to Predict Best Pretraining Data with Small Experiments Paper • 2504.11393 • Published Apr 15 • 17
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published Apr 9 • 74
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published Apr 9 • 74
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published Apr 9 • 74 • 3
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees Paper • 2503.08893 • Published Mar 11 • 5
Don't throw away your value model! Making PPO even better via Value-Guided Monte-Carlo Tree Search decoding Paper • 2309.15028 • Published Sep 26, 2023 • 1