|
--- |
|
license: mit |
|
language: |
|
- de |
|
metrics: |
|
- cer |
|
library_name: transformers |
|
tags: |
|
- kurrent |
|
- ocr |
|
- htr |
|
- 19th century |
|
--- |
|
# TrOCR Kurrent-Model 19th century |
|
Base model: **microsoft/trocr-base-handwritten** |
|
|
|
Train Lines: 292'997 |
|
Eval Lines: 7'513 |
|
Test Lines: 15'817 |
|
|
|
Epochs: 19.66 / 20 |
|
Eval CER: 0.02827 |
|
Test CER: 0.02655 |
|
|
|
Finetuned on Kurrent-dataset, containing: |
|
- Material from the State Archives of Zurich ("Regierungsratsprotokolle"), provided by the State Archives of Zurich |
|
- Lecture notes of Humboldt Lectures, provided by the Berlin-Brandenburgian Academy of Sciences |
|
- Diary of Eugen Huber, provided by the University of Zurich |
|
- Handwritting and Copies by and of Gottfried Semper |
|
- Konzilsprotokolle, University of Greifswald (19th century) |
|
- as well as many other smaller collections/examples |
|
|
|
The model has not been extensively tested. |
|
Potential biases are still to be identified. |