Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wen's picture
3 8

wen

zhengwenzhen
brain-zhang's profile picture PPPPkaqiu's profile picture SpectreX's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a model 27 days ago
StepLaw/StepLaw-N_214M-D_1.0B-LR3.906e-03-BS524288
updated a model 27 days ago
StepLaw/StepLaw-N_214M-D_1.0B-LR3.906e-03-BS32768
updated a model 27 days ago
StepLaw/StepLaw-N_214M-D_1.0B-LR3.906e-03-BS262144
View all activity

Organizations

Langboat's profile picture StepLaw's profile picture

zhengwenzhen's activity

upvoted a paper about 1 month ago

Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale

Paper • 2407.02118 • Published Jul 2, 2024 • 1
upvoted a paper about 2 months ago

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

Paper • 2503.04715 • Published Mar 6 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs