Model generating non-stop when used in Cline through vLLM

#15
by mhwang093 - opened

Hi I wonder if anyone experience the same, when hosting the model with vLLM following instructions from huggingface, then connect the model to Cline(VScode extension), it's generating nonstop for any questions I asked.

Sign up or log in to comment