Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,26 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: cc-by-nc-sa-4.0
|
| 3 |
+
library_name: transformers
|
| 4 |
+
pipeline_tag: token-classification
|
| 5 |
+
---
|
| 6 |
+
### xlm-roberta-base for token classification, specifically fine-tuned for question-answer extraction for English
|
| 7 |
+
|
| 8 |
+
This is the `xlm-roberta-base`, fine-tuned on manually annotated Finnish data and ChatGPT-annotated data.
|
| 9 |
+
### Hyperparameters
|
| 10 |
+
```
|
| 11 |
+
batch_size = 8
|
| 12 |
+
epochs = 10 (trained for less)
|
| 13 |
+
base_LM_model = "xlm-roberta-base"
|
| 14 |
+
max_seq_len = 512
|
| 15 |
+
learning_rate = 5e-5
|
| 16 |
+
```
|
| 17 |
+
### Performance
|
| 18 |
+
```
|
| 19 |
+
Accuracy = 0.88
|
| 20 |
+
Question F1 = 0.77
|
| 21 |
+
Answer F1 = 0.81
|
| 22 |
+
```
|
| 23 |
+
|
| 24 |
+
### Usage
|
| 25 |
+
|
| 26 |
+
Instructions on how to use the results will be added later.
|