Math & Code Benchmark/Testing for GGUFs

by bobchenyx - opened 18 days ago

18 days ago

Hi, thanks for releasing such great quantization versions.

I would like to ask if there are any open source frameworks/tools that could be used to test code/math benchmark for GGUF models?

Thanks!

shimmyshimmer

Unsloth AI org 18 days ago

Hi, thanks for releasing such great quantization versions.

I would like to ask if there are any open source frameworks/tools that could be used to test code/math benchmark for GGUF models?

Thanks!

You could use elethur ai's lm harness

bobchenyx

18 days ago

•

edited 18 days ago

Hi, thanks for releasing such great quantization versions.

I would like to ask if there are any open source frameworks/tools that could be used to test code/math benchmark for GGUF models?

Thanks!

You could use elethur ai's lm harness

Did you use that framework to test these GGUF models?
Caused I'v tried before and they didn't seems to be working.
and they are truly slow.

shimmyshimmer

Unsloth AI org 17 days ago

Hi, thanks for releasing such great quantization versions.

I would like to ask if there are any open source frameworks/tools that could be used to test code/math benchmark for GGUF models?

Thanks!

You could use elethur ai's lm harness

Did you use that framework to test these GGUF models?
Caused I'v tried before and they didn't seems to be working.
and they are truly slow.

Yes we did. However they did not match official MMLU scores so we needed to make our own custom evaluation framework

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment