NO function_call
#11 opened 6 days ago
by
justinliu12138
could you give me a reason why you ignore kv_a_proj_with_mqa layer when quantizing this model?
1
#10 opened 29 days ago
by
superahn
Frequent interruptions during reasoning with vllm 0.8.1
#9 opened 2 months ago
by
alwinzhang
Stuck when run on 8xH100
1
#8 opened 2 months ago
by
Thai
Accuracy test
#1 opened 3 months ago
by
zhnagchenchne