Update README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ pipeline_tag: token-classification
|
|
| 15 |
|
| 16 |
widget:
|
| 17 |
|
| 18 |
-
- text: "L'ANC vol que l'11 de setembre al Passeig de
|
| 19 |
|
| 20 |
- text: "El Martí llegeix el Cavall Fort."
|
| 21 |
|
|
@@ -54,6 +54,8 @@ widget:
|
|
| 54 |
|
| 55 |
The **multiner** is a Named Entity Recognition (NER) model for the Catalan language fine-tuned from the [BERTa] model, a [RoBERTa](https://arxiv.org/abs/1907.11692) base model pre-trained on a medium-size corpus collected from publicly available corpora and crawlers (check the BERTa model card for more details).
|
| 56 |
|
|
|
|
|
|
|
| 57 |
## Intended uses and limitations
|
| 58 |
|
| 59 |
|
|
|
|
| 15 |
|
| 16 |
widget:
|
| 17 |
|
| 18 |
+
- text: "L'ANC vol que l'11 de setembre al Passeig de Gràcia sigui una fita enguany."
|
| 19 |
|
| 20 |
- text: "El Martí llegeix el Cavall Fort."
|
| 21 |
|
|
|
|
| 54 |
|
| 55 |
The **multiner** is a Named Entity Recognition (NER) model for the Catalan language fine-tuned from the [BERTa] model, a [RoBERTa](https://arxiv.org/abs/1907.11692) base model pre-trained on a medium-size corpus collected from publicly available corpora and crawlers (check the BERTa model card for more details).
|
| 56 |
|
| 57 |
+
It has been trained with a dataset that contains 9 main types and 52 subtypes on all kinds of short texts, with almost 59K documents.
|
| 58 |
+
|
| 59 |
## Intended uses and limitations
|
| 60 |
|
| 61 |
|