Commit
·
3e71317
1
Parent(s):
bfdc478
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,42 @@
|
|
1 |
---
|
2 |
-
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
language: tr
|
3 |
+
datasets:
|
4 |
+
- SUNLP-NER-Twitter
|
5 |
---
|
6 |
+
|
7 |
+
# bert-loodos-sunlp-ner-turkish: model fine-tuned from loodos' Turkish BERT model for NER task.
|
8 |
+
|
9 |
+
## Introduction
|
10 |
+
[bert-loodos-sunlp-ner-turkish] is a NER model that was fine-tuned from the loodos/bert-base-turkish-cased model on the SUNLP-NER-Twitter dataset.
|
11 |
+
|
12 |
+
## Training data
|
13 |
+
The model was trained on the SUNLP-NER-Twitter dataset (5000 tweets). The dataset can be found at https://github.com/SU-NLP/SUNLP-Twitter-NER-Dataset
|
14 |
+
Named entity types are as follows:
|
15 |
+
Person, Location, Organization, Time, Money, Product, TV-Show
|
16 |
+
|
17 |
+
|
18 |
+
## How to use bert-loodos-sunlp-ner-turkish with HuggingFace
|
19 |
+
|
20 |
+
```python
|
21 |
+
from transformers import AutoTokenizer, AutoModelForTokenClassification
|
22 |
+
|
23 |
+
tokenizer = AutoTokenizer.from_pretrained("busecarik/bert-loodos-sunlp-ner-turkish")
|
24 |
+
model = AutoModelForTokenClassification.from_pretrained("busecarik/bert-loodos-sunlp-ner-turkish")
|
25 |
+
```
|
26 |
+
|
27 |
+
## Model performances on SUNLP-NER-Twitter test set (metric: seqeval)
|
28 |
+
Precision|Recall|F1
|
29 |
+
-|-|-
|
30 |
+
83.46|85.65|84.53
|
31 |
+
|
32 |
+
Classification Report
|
33 |
+
|
34 |
+
Entity|Precision|Recall|F1
|
35 |
+
-|-|-|-
|
36 |
+
LOCATION|0.82|0.71|0.76
|
37 |
+
MONEY|0.92|0.76|0.83
|
38 |
+
ORGANIZATION|0.82|0.87|0.85
|
39 |
+
PERSON|0.91|0.91|0.91
|
40 |
+
PRODUCT|0.57|0.33|0.42
|
41 |
+
TIME|0.86|0.83|0.85
|
42 |
+
TVSHOW|0.65|0.63|0.64
|