logical-reasoning / data /Llama3.1-70B-Chinese-Chat_shots_metrics.csv
dh-mc's picture
ready to run 10-shots for 70/72B models
809e98c
raw
history blame
246 Bytes
shots,model,run,accuracy,precision,recall,f1,ratio_valid_classifications
0,Llama3.1-70B-Chinese-Chat,shenzhi-wang/Llama3.1-70B-Chinese-Chat/shots-00,0.7636666666666667,0.7806653325131986,0.7636666666666667,0.7525813484548423,0.009666666666666667