vllm support

#15
by yaronr - opened

Hi
Can you please share whether you plan on adding support for your model in vllm? ('SolarForCausalLM' architecture)
We would love to run our independent analysis on solar and share our results (and we use vllm).
Thank you!

yaronr changed discussion title from Chunked prefill & prefix caching to vllm support

Sign up or log in to comment