MelvinW's picture
Actually changed the Readme.md
465be86 verified
|
raw
history blame
1.14 kB
metadata
license: mit
base_model:
  - magistermilitum/tridis_HTR
library_name: transformers
language:
  - la

Base model: magistermilitum/tridis_HTR v1

Train Lines: ???

Eval Lines: ???

Test Lines: ???

Epochs: 14.1667 / 20

Eval CER: 0.0544

Test CER: 0.0622

Testresults with CERberus

Metric Value
Character Error Rate 6.22
Number of Correct Characters 186998
Number of Substitutions 5425
Number of Insertions 2933
Number of Deletions 3849
Total Character Count 196272
Original Lines Count 2288
Discarded Lines Count 0

Finetuned on an Anglicana-dataset, with mainly Middle Latin and few Middle English and Anglo-Norman text sources containing documents from:

  • the Common Pleas (CP)
  • the Justices (JUST)

from the English Legal Court Rolls.

The model has not been extensively tested.

Errors often occur in the Punctuation, which itself has an error rate of 44.44% which mostly consits of missed ‧ dots.

Potential biases are still to be identified.