ismet commited on
Commit
0d3f9e7
·
1 Parent(s): d5e7fb1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -1
README.md CHANGED
@@ -22,7 +22,19 @@ It achieves the following results on the evaluation set:
22
 
23
  ## Model description
24
 
25
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
26
 
27
  ## Intended uses & limitations
28
 
 
22
 
23
  ## Model description
24
 
25
+ This model was trained on a dataset called PWKP-GPT3-LENGTH-CONTROL-40BUCKETS.
26
+ The dataset contains 30k instances taken from PWKP, then processed through GPT3 to obtain simplifications.
27
+ The 30k instances come from: 10k which were supposed to generate very long simplifications,
28
+ 10k which were supposed to generate very short simplifications, and 10k without specifying the simplicity level.
29
+ The model does not sucessfuly work on these buckets.
30
+ There exists another dataset, the PWKP-GPT3-LENGTH-CONTROL-4BUCKETS, but it was never trained on something.
31
+ Those buckets are also rather unbalanced.
32
+
33
+ The idea comes from
34
+ Controllable Sentence Simplification
35
+ Louis Martin, https://arxiv.org/pdf/1910.02677.pdf
36
+
37
+ It was fine-tuned on the FLAN-T5-base model.
38
 
39
  ## Intended uses & limitations
40