euclaise
/

crow-1b-attempt1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

euclaise commited on Jan 10, 2024

Commit

dbbcb88

·

1 Parent(s): f6905cf

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ datasets:
 Expirements in large-scale small-scale preference learning.
-falcon-rw-1b trained with PRO (preference ranking optimization, see https://arxiv.org/abs/2306.17492) on SuperMC and PRM800K for 3 epochs, using my supertrainer2000 framework.
 This is an expiremental model.

 Expirements in large-scale small-scale preference learning.
+falcon-rw-1b trained with PRO (preference ranking optimization, see https://arxiv.org/abs/2306.17492) on SuperMC and PRM800K (only stage 1) for 3 epochs, using my supertrainer2000 framework.
 This is an expiremental model.