-
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Paper • 2501.19324 • Published • 30 -
s1: Simple test-time scaling
Paper • 2501.19393 • Published • 76 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 17 -
The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Paper • 2501.18965 • Published • 5
Z
byzhang0
AI & ML interests
None yet
Recent Activity
updated
a collection
about 10 hours ago
Papers
updated
a collection
about 10 hours ago
Papers
updated
a collection
about 10 hours ago
Papers
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet