Jiangtao Zh's picture

3 1

Jiangtao Zh

jiangtaozh

AI & ML interests

None yet

Recent Activity

new activity 26 days ago

yuhuili/EAGLE-mixtral-instruct-8x7B:kv_cache_utils.py NotImplementedError

new activity 28 days ago

yuhuili/EAGLE3-LLaMA3.3-Instruct-70B:KeyError: 'embed_tokens.weight'

liked a model about 1 month ago

yuhuili/EAGLE3-LLaMA3.3-Instruct-70B

View all activity

Organizations

None yet

New activity in yuhuili/EAGLE-mixtral-instruct-8x7B 26 days ago

kv_cache_utils.py NotImplementedError

#3 opened 26 days ago by

New activity in yuhuili/EAGLE3-LLaMA3.3-Instruct-70B 28 days ago

KeyError: 'embed_tokens.weight'

#5 opened 29 days ago by

liked a model about 1 month ago

yuhuili/EAGLE3-LLaMA3.3-Instruct-70B

Updated Mar 18 • 3.26k • 6

New activity in mistralai/Mistral-7B-Instruct-v0.2 about 1 year ago

deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.

#92 opened about 1 year ago by

deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.

#92 opened about 1 year ago by

deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.

#92 opened about 1 year ago by