Use matryoshka_dimensions to control the allowed output dimensions.
#13
by
noooop0000
- opened
No description provided.
Once https://github.com/vllm-project/vllm/pull/16970 merged, matryoshka_dimensions will take effect.