Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shuaipeng Li's picture
3

Shuaipeng Li

unlimblue
ishaqsaviani's profile picture 21world's profile picture SteveSHEN's profile picture
·
  • unlimblue
  • unlimblue

AI & ML interests

None yet

Organizations

Tencent's profile picture

authored 7 papers 6 months ago

Efficiently Training 7B LLM with 1 Million Sequence Length on 8 GPUs

Paper • 2407.12117 • Published Jul 16, 2024

Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling

Paper • 2405.14578 • Published May 23, 2024 • 1

HMoE: Heterogeneous Mixture of Experts for Language Modeling

Paper • 2408.10681 • Published Aug 20, 2024 • 9

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published Nov 4, 2024 • 25

More Expressive Attention with Negative Weights

Paper • 2411.07176 • Published Nov 11, 2024 • 2

3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-scale 3D Point Clouds

Paper • 1707.06783 • Published Jul 21, 2017

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 27
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs