Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiangtao Zh's picture
3 1

Jiangtao Zh

jiangtaozh

AI & ML interests

None yet

Recent Activity

new activity 26 days ago
yuhuili/EAGLE-mixtral-instruct-8x7B:kv_cache_utils.py NotImplementedError
new activity 28 days ago
yuhuili/EAGLE3-LLaMA3.3-Instruct-70B:KeyError: 'embed_tokens.weight'
liked a model about 1 month ago
yuhuili/EAGLE3-LLaMA3.3-Instruct-70B
View all activity

Organizations

None yet

New activity in yuhuili/EAGLE-mixtral-instruct-8x7B 26 days ago

kv_cache_utils.py NotImplementedError

#3 opened 26 days ago by
jiangtaozh
New activity in yuhuili/EAGLE3-LLaMA3.3-Instruct-70B 28 days ago

KeyError: 'embed_tokens.weight'

1
#5 opened 29 days ago by
jiangtaozh
liked a model about 1 month ago

yuhuili/EAGLE3-LLaMA3.3-Instruct-70B

Updated Mar 18 • 3.26k • 6
New activity in mistralai/Mistral-7B-Instruct-v0.2 about 1 year ago

deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.

6
#92 opened about 1 year ago by
jiangtaozh

deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.

6
#92 opened about 1 year ago by
jiangtaozh

deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.

6
#92 opened about 1 year ago by
jiangtaozh
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs