YaRN not enabled correctly
#3
by
CISCai
- opened
First off the GGUFs are missing all the YaRN metadata, but in addition to that there's something not quite right with the context lengths, the original model's context length is 40960, not 32768 and as such a scaling factor of 4.0 should then yield 163840, not 131072.