Does anyone feel Qwen3 often fails to follow instructions accurately?

#18

by DOFOFFICIAL - opened 15 days ago

Discussion

DOFOFFICIAL

15 days ago

Does anyone feel Qwen3 often fails to follow instructions accurately?

DOFOFFICIAL

15 days ago

Like, I asked it to write code to numerically solve some problems but it always tries to mathematically derive it. I asked to numerically solve since it does not exist an analytical solution.
But qwen3 just keeps reasoning and saying "Well,,," "But,..." "Wait,,,"

DOFOFFICIAL

15 days ago

It's like,,,, benchmark oriented? :( like llama4?

Jakry

15 days ago

i think qwq 32b is much better in instruction following

praneth02

15 days ago

I feel it is really good at instructions as I have been testing on my rag chatbot project. It performs really well when i make the instruction more precise.
Previously I used Mistral-7B-Instruct-v0.3 which is like dogsh*t compared to qwen3 when it comes to answers.

DOFOFFICIAL

14 days ago

"i think qwq 32b is much better in instruction following"

Yeah, same for me

AdamF92

8 days ago

@DOFOFFICIAL I have the same problems and QwQ seems to be much better in complex reasoning. Qwen3 have similar problems as QwQ-Preview. Please check my topic: https://huggingface.co/Qwen/Qwen3-235B-A22B/discussions/32

devopsML

7 days ago

in either case, it is highly recommended to use both, since they have both pros and cons. for now, both of these models topple that of deepseek r1-distill-32b (esp. on huggingchat!), since the latter hallucinates too much.
https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B/discussions/45

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment