PPL Chart?

#1
by John198 - opened

Hey so taking a brief look at the ppl v bpw chart for llama 70B showed that the most dramatic difference between exl2 and exl3 is at the lower end with bpw < 4. While significantly more efficient overall, I'm very curious to see how much of an improvement it is for Mistral large in particular because the current sweetspot is 2.7-2.85 bpw for 2x 3090s (depending on context) and where the disparity is most likely to be felt when updating.

Any chance you would be able to do a similar chart for 123B? It's a bit of work but I think it presents a very exciting opportunity to possibly 'upgrade' from 70B models to 123b models by default depending on the results.

Added it now

Can we get exl3 version of legendary mistral large 2407? 2411 kinda lacks at some ... stuff.

@Alastar-Smith What size do you want?

@Alastar-Smith Uploaded exl3 quants of 2407 here.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment