Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
Jiangtao Zh
jiangtaozh
Follow
AI & ML interests
None yet
Recent Activity
new
activity
26 days ago
yuhuili/EAGLE-mixtral-instruct-8x7B:
kv_cache_utils.py NotImplementedError
new
activity
28 days ago
yuhuili/EAGLE3-LLaMA3.3-Instruct-70B:
KeyError: 'embed_tokens.weight'
liked
a model
about 1 month ago
yuhuili/EAGLE3-LLaMA3.3-Instruct-70B
View all activity
Organizations
None yet
jiangtaozh
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
yuhuili/EAGLE-mixtral-instruct-8x7B
26 days ago
kv_cache_utils.py NotImplementedError
#3 opened 26 days ago by
jiangtaozh
New activity in
yuhuili/EAGLE3-LLaMA3.3-Instruct-70B
28 days ago
KeyError: 'embed_tokens.weight'
1
#5 opened 29 days ago by
jiangtaozh
liked
a model
about 1 month ago
yuhuili/EAGLE3-LLaMA3.3-Instruct-70B
Updated
Mar 18
•
3.26k
•
6
New activity in
mistralai/Mistral-7B-Instruct-v0.2
about 1 year ago
deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.
6
#92 opened about 1 year ago by
jiangtaozh
deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.
6
#92 opened about 1 year ago by
jiangtaozh
deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.
6
#92 opened about 1 year ago by
jiangtaozh
Load more