Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,19 @@ It achieves the following results on the evaluation set:
|
|
22 |
|
23 |
## Model description
|
24 |
|
25 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
## Intended uses & limitations
|
28 |
|
|
|
22 |
|
23 |
## Model description
|
24 |
|
25 |
+
This model was trained on a dataset called PWKP-GPT3-LENGTH-CONTROL-40BUCKETS.
|
26 |
+
The dataset contains 30k instances taken from PWKP, then processed through GPT3 to obtain simplifications.
|
27 |
+
The 30k instances come from: 10k which were supposed to generate very long simplifications,
|
28 |
+
10k which were supposed to generate very short simplifications, and 10k without specifying the simplicity level.
|
29 |
+
The model does not sucessfuly work on these buckets.
|
30 |
+
There exists another dataset, the PWKP-GPT3-LENGTH-CONTROL-4BUCKETS, but it was never trained on something.
|
31 |
+
Those buckets are also rather unbalanced.
|
32 |
+
|
33 |
+
The idea comes from
|
34 |
+
Controllable Sentence Simplification
|
35 |
+
Louis Martin, https://arxiv.org/pdf/1910.02677.pdf
|
36 |
+
|
37 |
+
It was fine-tuned on the FLAN-T5-base model.
|
38 |
|
39 |
## Intended uses & limitations
|
40 |
|