Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhiwei He's picture
7 8 35

Zhiwei He

zwhe99
ishaqsaviani's profile picture TristanDonze's profile picture khanfarhat's profile picture
·
https://zwhe99.github.io/
  • zwhe99
  • zwhe99

AI & ML interests

Natural Language Processing

Recent Activity

liked a model 17 days ago
MiniMaxAI/MiniMax-M1-80k
updated a model 18 days ago
zwhe99/DeepMath-Zero-7B-Inpo-0_1
published a model 18 days ago
zwhe99/DeepMath-Zero-7B-Inpo-0_1
View all activity

Organizations

None yet

authored a paper 5 months ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 61
authored a paper 6 months ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 42
authored a paper 7 months ago

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

Paper • 2411.18462 • Published Nov 27, 2024 • 6
authored a paper over 1 year ago

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

Paper • 2203.08394 • Published Mar 16, 2022
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs