Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ language:
|
|
6 |
- de
|
7 |
---
|
8 |
|
9 |
-
**OCRonos** is a series of specialized language models trained by PleIAs for the correction of badly digitized texts
|
10 |
|
11 |
OCROnos models are versatile tools supporting the correction of OCR errors, wrong word cut/merge and overall broken text structures. The training data includes a highly diverse set of ocrized texts in multiple languages from PleIAs open pre-training corpus, drawn from cultural heritage sources (Common Corpus) and financial and administrative documents in open data (Finance Commons).
|
12 |
|
|
|
6 |
- de
|
7 |
---
|
8 |
|
9 |
+
**OCRonos** is a series of specialized language models trained by PleIAs for the correction of badly digitized texts, as part of the **Bad Data Toolbox**.
|
10 |
|
11 |
OCROnos models are versatile tools supporting the correction of OCR errors, wrong word cut/merge and overall broken text structures. The training data includes a highly diverse set of ocrized texts in multiple languages from PleIAs open pre-training corpus, drawn from cultural heritage sources (Common Corpus) and financial and administrative documents in open data (Finance Commons).
|
12 |
|