There really seems to be some problem with the tokenizer or the ggufs

#12

by Smorty100 - opened Sep 18, 2024

Sep 18, 2024

I am attempting to use the model in a conversation, however the output seems really incoherent.
Here is an example I got out of ollama (using the default config and demperature)

>>> respond with the word "hello" and nothing else
hello
user
repeat what I just said there but switch hello to goodbye and repeat this three times after your response here as
well.
}
repeat_rpt

player

rjmehta

Sep 18, 2024

It is not using the correct eos token. Can you check what token is being printed after "hello"? Put that as eos token or stop token id.

hunkim

upstage org Sep 18, 2024

I cannot reproduce it, but we will definitely check with the @ollama team. Thank you for reporting the issue!

ollama

Sep 18, 2024

This comment has been hidden

jmorganca

Sep 18, 2024

@Smorty100 did you install the latest version of Ollama? https://ollama.com/download – 0.3.11 is required for Solar Pro Preview

nonetrix

Sep 19, 2024

•

edited Sep 19, 2024

I'm running ollama 0.3.11, I think saying it's the best single GPU model is too bullish if something isn't wrong. It's world knowledge is really bad, worse than even llama 8B. Is it just poor at world knowledge and better at math etc.? Or is it contaminated and preforms much worse in real world? I use this as a vibe check since it's somewhat obscure and used to trick up most models in the past, but these days most 90% of models easily get it right but not this one even some 7B's do now

>>> Who is the Russian music artist Victor Tsoi?
Victor Tsai, also known as Victor Tsai or Viktar Cio, is a popular Belarusian-born hip hop and electronic music producer. Despite his name sounding similar to 'Tsai', he was born in Minsk, Belarus, on October 30, 1982. He is 
well-known for creating music under various aliases such as Dino MC47 and Victor DC. His work spans multiple genres including hip hop, pop, electronic dance music (EDM), and R&B.

Also, how does it compare to Mistral Small? I am getting better results with that personally and it's similar size

hunkim

upstage org Sep 19, 2024

Solar-Pro is primarily designed for RAG and instruction following tasks. Different models have their own advantages and disadvantages. You can check out https://eqbench.com/ for EQ and MAGI-Hard.

hunkim changed discussion status to closed Sep 19, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment