Benchmarks for WRN-2?
#1
by
Tonic
- opened
🙋🏻♂️are there any benchmarks, common on standard for benchmarking this model (idea?)
Hey Tonic,
We did HumanEval, but it is not the best fit for this type of a model. We’re in the process of creating our own internal evaluation right now.