sunhao's picture

3 4 2

sunhao

sunhao

·

https://www.sunhao.site

TissueC

AI & ML interests

Dialogue System, Dialogue Safety, Large Language Models

Organizations

upvoted a paper 2 months ago

Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training

Paper • 2506.10952 • Published Jun 12 • 23

upvoted a paper 3 months ago

Learning Dynamics in Continual Pre-Training for Large Language Models

Paper • 2505.07796 • Published May 12 • 19

upvoted a paper 10 months ago

Scaling Law with Learning Rate Annealing

Paper • 2408.11029 • Published Aug 20, 2024 • 4

upvoted a paper 11 months ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 69