I want to use this model to speculative decoding in VLLM, but "LlamaForCausalLMEagle3" error while I start vllm server.
#3 opened 7 days ago
by
quietred

"head_dim": 80
5
#2 opened about 1 month ago
by
rjmehta
Adding `safetensors` variant of this model
#1 opened about 1 month ago
by
SFconvertbot
