Make `config.json` compatible with standard sliding window config

#6
by hmellor HF Staff - opened

This will add layer_types to the loaded config class so that libraries such as vLLM can load hybrid attention models in the standard Hugging Face format.

Since we do not edit configuration_phi4flash.py this change is backwards compatible.

Once this change has been merged along with https://github.com/vllm-project/vllm/pull/21927 we can update configuration_phi4flash.py so that the the modelling code works in the standard way too.

renll changed pull request status to merged

Sign up or log in to comment