hadiasghari commited on
Commit
31f96d8
·
verified ·
1 Parent(s): 3007c7c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +117 -0
README.md ADDED
@@ -0,0 +1,117 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - de
5
+ pipeline_tag: text-generation
6
+ tags:
7
+ - german
8
+ - deutsch
9
+ - simplification
10
+ - vereinfachung
11
+ ---
12
+ # Model Card for Model ID
13
+
14
+ <!-- Provide a quick summary of what the model is/does. -->
15
+
16
+ We fine-tuned the [jphme/em_german_leo_mistral](https://huggingface.co/jphme/em_german_leo_mistral) with a set of ca. XXX newspaper articles which have been simplified by the Austrian Press Agency.
17
+ Our aim was to have a model which can simplify German-language text. This model has been trained with the completition-only configuration.
18
+
19
+
20
+
21
+ ## Model Details
22
+
23
+ ### Model Description
24
+
25
+ <!-- Provide a longer summary of what this model is. -->
26
+
27
+
28
+
29
+ - **Developed by:** Members of the [Public Interest AI research group](https://publicinterest.ai/), [HIIG Berlin](https://www.hiig.de/)
30
+ - **Model type:** simplification model, text generation
31
+ - **Language(s) (NLP):** German
32
+ - **License:** Apache 2.0
33
+ - **Finetuned from model:** jphme/em_german_leo_mistral
34
+
35
+ ### Model Sources
36
+
37
+ <!-- Provide the basic links for the model. -->
38
+
39
+ - **Repository:** https://github.com/fhewett/simba
40
+ <!-- - **Paper [optional]:** [More Information Needed] -->
41
+ - **Project website:** https://publicinterest.ai/tool/simba
42
+
43
+ ## Uses
44
+
45
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
46
+
47
+ ### Direct Use
48
+
49
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
50
+
51
+ This model works best for simplifying German-language newspaper articles (news items, not commentaries or editorials). It may work for other types of texts.
52
+
53
+ ### Downstream Use
54
+
55
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
56
+ We have fine-tuned using only newspaper articles. We have not yet performed extensive out-of-domain testing, but believe that the model's capabilities could be improved by fine-tuning on more diverse data. Contact us if you have a dataset which you think could work (parallel texts, German standard & German simplified).
57
+
58
+ <!-- ### Out-of-Scope Use -->
59
+
60
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
61
+
62
+ ## Bias, Risks, and Limitations
63
+
64
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
65
+
66
+ As with most text generation models, the model sometimes produces information that is incorrect.
67
+
68
+ ### Recommendations
69
+
70
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
71
+
72
+ Please check manually that your output text corresponds to the input text, as factual inconsistencies may have arisen.
73
+
74
+ ## How to Get Started with the Model
75
+
76
+ Use the code below to get started with the model.
77
+
78
+ [More Information Needed]
79
+
80
+ ## Training Details
81
+
82
+ ### Training Data
83
+
84
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
85
+
86
+ A sample of the data used to train our model can be found [here](https://github.com/fhewett/apa-rst/tree/main/original_texts).
87
+
88
+ #### Training Hyperparameters
89
+
90
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
91
+
92
+ <!-- #### Speeds, Sizes, Times [optional] -->
93
+
94
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
95
+
96
+ ## Evaluation
97
+
98
+ <!-- This section describes the evaluation protocols and provides the results. -->
99
+
100
+ #### Summary
101
+
102
+ For now, we have manually checked the performance of our model on a small sample of texts. Whilst it seems to produce good summaries of all texts, it only seems to simplify newspaper articles (i.e. similar to our training data). We have not yet applied any large-scale metrics based evaluation.
103
+
104
+
105
+ <!-- ## Citation [optional]
106
+
107
+ **BibTeX:**
108
+
109
+ [More Information Needed]
110
+
111
+ **APA:**
112
+
113
+ [More Information Needed]-->
114
+
115
+ ## Model Card Contact
116
+
117
+ simba -at- hiig.de