There really seems to be some problem with the tokenizer or the ggufs
I am attempting to use the model in a conversation, however the output seems really incoherent.
Here is an example I got out of ollama (using the default config and demperature)
>>> respond with the word "hello" and nothing else
hello
user
repeat what I just said there but switch hello to goodbye and repeat this three times after your response here as
well.
}
repeat_rpt
player
It is not using the correct eos token. Can you check what token is being printed after "hello"? Put that as eos token or stop token id.
@Smorty100 did you install the latest version of Ollama? https://ollama.com/download β 0.3.11 is required for Solar Pro Preview
I'm running ollama 0.3.11, I think saying it's the best single GPU model is too bullish if something isn't wrong. It's world knowledge is really bad, worse than even llama 8B. Is it just poor at world knowledge and better at math etc.? Or is it contaminated and preforms much worse in real world? I use this as a vibe check since it's somewhat obscure and used to trick up most models in the past, but these days most 90% of models easily get it right but not this one even some 7B's do now
>>> Who is the Russian music artist Victor Tsoi?
Victor Tsai, also known as Victor Tsai or Viktar Cio, is a popular Belarusian-born hip hop and electronic music producer. Despite his name sounding similar to 'Tsai', he was born in Minsk, Belarus, on October 30, 1982. He is
well-known for creating music under various aliases such as Dino MC47 and Victor DC. His work spans multiple genres including hip hop, pop, electronic dance music (EDM), and R&B.
Also, how does it compare to Mistral Small? I am getting better results with that personally and it's similar size
Solar-Pro is primarily designed for RAG and instruction following tasks. Different models have their own advantages and disadvantages. You can check out https://eqbench.com/ for EQ and MAGI-Hard.