jack seaver
tutu329
AI & ML interests
None yet
Organizations
None yet
tutu329's activity
it is a 70B model. why using almost 280GB disk?
2
#7 opened 5 months ago
by
tutu329
KeyError: 'model.layers.45.block_sparse_moe.gate.g_idx'
5
#2 opened 7 months ago
by
tutu329
will there be a gptq int4 version?
1
#1 opened 7 months ago
by
tutu329
no special_tokens_map.json tokenizer_config.json and tokenizer.json
#1 opened 7 months ago
by
tutu329
vllm can not inter this model (other 70b gptq model are ok)
15
#1 opened 9 months ago
by
tutu329
hope there is a qwen-72b-chat-awq(can be inferenced by vllm)
#3 opened 11 months ago
by
tutu329