File size: 900 Bytes
65460ed
 
 
 
 
 
 
 
 
 
 
ff805db
65460ed
 
d7a0902
4c632d1
e93cc39
3121d2c
737006c
4c632d1
4908a40
 
737006c
4c632d1
d7a0902
ff805db
 
 
 
 
033b849
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
license: mit
language:
- de
metrics:
- cer
library_name: transformers
tags:
- kurrent
- ocr
- htr
- 19th century
---
# TrOCR Kurrent-Model 19th century
Base model: **microsoft/trocr-base-handwritten**

Train Lines: 292'997  
Eval Lines: 7'513  
Test Lines: 15'817

Epochs: 19.66 / 20  
Eval CER: 0.02827  
Test CER: 0.02655

Finetuned on Kurrent-dataset, containing:
- Material from the State Archives of Zurich ("Regierungsratsprotokolle"), provided by the State Archives of Zurich
- Lecture notes of Humboldt Lectures, provided by the Berlin-Brandenburgian Academy of Sciences
- Diary of Eugen Huber, provided by the University of Zurich
- Handwritting and Copies by and of Gottfried Semper
- Konzilsprotokolle, University of Greifswald (19th century)
- as well as many other smaller collections/examples

The model has not been extensively tested.
Potential biases are still to be identified.