Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ pipeline_tag: text-generation
|
|
27 |
|
28 |
# Qwen2.5-Lumen-14B
|
29 |
|
30 |
-
* *Direct preference optimization finetuned for 3 epoch
|
31 |
|
32 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f74b6e6389380c77562762/ccriYlPOxZLDUI-o2XZ0K.png)
|
33 |
|
@@ -41,6 +41,8 @@ Trained [Qwen2.5-14B-Instruct] for 2 epochs on [jondurbin/gutenberg-dpo-v0.1] sa
|
|
41 |
|
42 |
[Tanliboy](https://huggingface.co/tanliboy) trained [Qwen2.5-14B-Instruct] for 1 epoch on [HuggingFaceH4/ultrafeedback_binarized].
|
43 |
|
|
|
|
|
44 |
## Merge
|
45 |
|
46 |
* Merged with a sophosympatheia <b>SLERP</b> *Ultrafeedback-Binarized DPO* and *Gutenberg DPO*
|
|
|
27 |
|
28 |
# Qwen2.5-Lumen-14B
|
29 |
|
30 |
+
* *Direct preference optimization finetuned for 3 epoch*
|
31 |
|
32 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f74b6e6389380c77562762/ccriYlPOxZLDUI-o2XZ0K.png)
|
33 |
|
|
|
41 |
|
42 |
[Tanliboy](https://huggingface.co/tanliboy) trained [Qwen2.5-14B-Instruct] for 1 epoch on [HuggingFaceH4/ultrafeedback_binarized].
|
43 |
|
44 |
+
*Mass checkpoint merged, Based on Qwen2.5-14B-Instruct.*
|
45 |
+
|
46 |
## Merge
|
47 |
|
48 |
* Merged with a sophosympatheia <b>SLERP</b> *Ultrafeedback-Binarized DPO* and *Gutenberg DPO*
|