Complete 1st round DPO training (10/10 epochs).
Browse files
model-00001-of-00002.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4965799096
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b2bff44179e46840bdba4d0101f5160ab7f3302625ec4e75f5a186868de2d6ee
|
3 |
size 4965799096
|
model-00002-of-00002.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1459729952
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:203ed4c8f522e3d6b4f08b6931d8db0c9a3aae352bdb6e19f20ad22a78a57593
|
3 |
size 1459729952
|