Please can I get an MLX version?
👍
1
1
#11 opened about 1 month ago
by
bulk52

strange, why is Q3K_XL even smaller than Q3K_M?
2
#10 opened 2 months ago
by
X5R
How to run the 128k models
6
#7 opened 3 months ago
by
rogerooberg

How can I change the number of experts for inference?
🧠
1
#5 opened 3 months ago
by
win10

Seems not supporting tools calling
2
#4 opened 3 months ago
by
bingw5
Umm, another bump on the road? :/
2
#2 opened 3 months ago
by
MrDevolver

How do I extend a Qwen3 model that has been pulled by Ollama using the YaRN method?
2
#1 opened 3 months ago
by
MikeNate
