Make `config.json` compatible with standard sliding window config
#6
by
hmellor
HF Staff
- opened
This will add layer_types
to the loaded config class so that libraries such as vLLM can load hybrid attention models in the standard Hugging Face format.
Since we do not edit configuration_phi4flash.py
this change is backwards compatible.
Once this change has been merged along with https://github.com/vllm-project/vllm/pull/21927 we can update configuration_phi4flash.py
so that the the modelling code works in the standard way too.
renll
changed pull request status to
merged