infering by multi-model session but get wrong output

by enlei - opened Jun 23

Discussion

enlei

Jun 23

loading ChatML-preset qwen2 in mutlti-model session，get wrong putput by inputing "what can you do?"

as shown in the image below

SFoNX

Jun 23

When you use this model in LM Studio - you need to use the included ChatML preset.
Then in Settings (Right hand side chat screen) Go to -> Model Initialization -> Flash Attention -> Turn it on

jklj077 changed discussion status to closed Jul 15

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment