lievan commited on
Commit
622628b
·
verified ·
1 Parent(s): 0d47df5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -19,7 +19,7 @@ pipeline_tag: text-generation
19
 
20
  # Introduction
21
 
22
- Eurus-7B-KTO is KTO fine-tuned from Eurus-7B-SFT on all multi-turn trajectory pairs in UltraInteract and all pairs in UltraFeedback.
23
 
24
  It achieves the best overall performance among open-source models of similar sizes and even outperforms specialized models in corresponding domains in many cases. Notably, EURUS-7B-KTO outperforms baselines that are 5× larger.
25
 
 
19
 
20
  # Introduction
21
 
22
+ Eurus-7B-KTO is [KTO](https://arxiv.org/abs/2402.01306) fine-tuned from Eurus-7B-SFT on all multi-turn trajectory pairs in UltraInteract and all pairs in UltraFeedback.
23
 
24
  It achieves the best overall performance among open-source models of similar sizes and even outperforms specialized models in corresponding domains in many cases. Notably, EURUS-7B-KTO outperforms baselines that are 5× larger.
25