Independent evaluation results

#3
by yaronr - opened

Dear Equall team,
I am following up on my previous post.

I'm pleased to share our independent evaluation of the model using our implementation of the MMLU-Pro benchmark.
We know that MMLU-Pro is probably not the best benchmark for a legal model, but we decided to share this with you nonetheless, hoping you may find this useful.

We will be happy to work together with you on this and other benchmarks.

Sign up or log in to comment