ytu-ce-cosmos
/

Turkish-Gemma-9b-v0.1

Text Generation

Model card Files Files and versions Community

atahanuz commited on May 21

Commit

097139b

·

verified ·

1 Parent(s): f084f4d

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -39,6 +39,12 @@ The table below summarizes the evaluation results:
 | google/gemma-2-9b-it                         | 54.13%          |
 | ytu-ce-cosmos/Turkish-Llama-8b-DPO-v0.1      | 36.89%          |
 ### 📊 Turkish Evaluation Benchmark Results (via `malhajar17/lm-evaluation-harness_turkish`)
 | Model Name                                   | Average | MMLU  | Truthful_QA | ARC   | Hellaswag | Gsm8K | Winogrande |

 | google/gemma-2-9b-it                         | 54.13%          |
 | ytu-ce-cosmos/Turkish-Llama-8b-DPO-v0.1      | 36.89%          |
+### Voting Metodology
+A question and two answers from different models were presented to human judges. The judges selected the better answer based on their preferences. For example, in the question below, the judge selected the answer on the right:
+![Alt text](https://i.imgur.com/AcR9ymM.png)
 ### 📊 Turkish Evaluation Benchmark Results (via `malhajar17/lm-evaluation-harness_turkish`)
 | Model Name                                   | Average | MMLU  | Truthful_QA | ARC   | Hellaswag | Gsm8K | Winogrande |