Can't reproduce MATH performance

#66

by jpiabrantes - opened Jul 30

Jul 30

In the model card you say you achieved 51.9 on MATH using Llama 3.1 8B with zero-shots.

I can't reproduce this. Did you use lm-evaluation-harness or some other code?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment