convaise-idp
/

flan-t5-base-finetuned-length_control_token

Text Generation

text2text-generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

ismet commited on Mar 6, 2023

Commit

0d3f9e7

·

1 Parent(s): d5e7fb1

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -22,7 +22,19 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations

 ## Model description
+This model was trained on a dataset called PWKP-GPT3-LENGTH-CONTROL-40BUCKETS.
+The dataset contains 30k instances taken from PWKP, then processed through GPT3 to obtain simplifications.
+The 30k instances come from: 10k which were supposed to generate very long simplifications,
+10k which were supposed to generate very short simplifications, and 10k without specifying the simplicity level.
+The model does not sucessfuly work on these buckets.
+There exists another dataset, the PWKP-GPT3-LENGTH-CONTROL-4BUCKETS, but it was never trained on something.
+Those buckets are also rather unbalanced.
+The idea comes from
+Controllable Sentence Simplification
+Louis Martin, https://arxiv.org/pdf/1910.02677.pdf
+It was fine-tuned on the FLAN-T5-base model.
 ## Intended uses & limitations