lianghsun commited on
Commit
89cf561
·
1 Parent(s): 4fe149b

Complete 1st round DPO training (10/10 epochs).

Browse files
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2a4129d85cadd57a2bd201e67d424bf472aad2d1f708fc479d6888ff859dca4d
3
  size 4965799096
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b2bff44179e46840bdba4d0101f5160ab7f3302625ec4e75f5a186868de2d6ee
3
  size 4965799096
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7f4a01c12a64842a84ab1aea11fc242f3cd7f305931c5e86385ca18896ede088
3
  size 1459729952
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:203ed4c8f522e3d6b4f08b6931d8db0c9a3aae352bdb6e19f20ad22a78a57593
3
  size 1459729952