Teklia
/

pylaia-norhand-v1-postprocessed

@@ -23,7 +23,7 @@ This model performs Handwritten Text Recognition in Norwegian. It was developed
 ## Model description
-The model has been trained using the PyLaia library on the [NorHand](https://zenodo.org/record/6542056) document images.
 Line bounding boxes were improved using a post-processing step.
 Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
@@ -32,33 +32,33 @@ Training images were resized with a fixed height of 128 pixels, keeping the orig
 The model achieves the following results:
-| set   | CER (%)    | WER (%)   |
-| ----- | ---------: | --------: |
-| train | 2.33       | 5.62      |
-| val   | 8.20       | 24.75     |
-| test  | 7.81       | 23.30     |
-Results improve on validation and test sets when PyLaia is combined with a 6-gram language model.
-The language model is trained on [this text corpus](https://www.nb.no/sprakbanken/en/resource-catalogue/oai-nb-no-sbr-73/) published by the National Library of Norway.
-| set   | CER (%)    | WER (%)   |
-| ----- | ---------: | --------: |
-| train | 2.62       | 6.13      |
-| val   | 7.01       | 19.75     |
-| test  | 6.75       | 18.22     |
 ## How to use?
-Please refer to the [documentation](https://atr.pages.teklia.com/pylaia/).
 # Cite us!
 ```bibtex
-@inproceedings{pylaia-lib,
-    author = "Tarride, Solène and Schneider, Yoann and Generali, Marie and Boillet, Melodie and Abadie, Bastien and Kermorvant, Christopher",
-    title = "Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library",
-    booktitle = "Submitted at ICDAR2024",
-    year = "2024"
 }
 ```

 ## Model description
+The model has been trained using the PyLaia library on the [NorHand](https://zenodo.org/record/6542056) dataset.
 Line bounding boxes were improved using a post-processing step.
 Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
 The model achieves the following results:
+| set   | Language model | CER (%) | WER (%) |
+|:----- |:-------------- | -------:| -------:|
+| train | no             |    2.33 |    5.62 |
+| train | yes            |    2.62 |    6.13 |
+| val   | no             |    8.20 |   24.75 |
+| val   | yes            |    7.01 |   19.75 |
+| test  | no             |    7.81 |   23.30 |
+| test  | yes            |    6.75 |   18.22 |
+An external 6-gram character language model can be used to improve recognition. The language model is trained on [this text corpus](https://www.nb.no/sprakbanken/en/resource-catalogue/oai-nb-no-sbr-73/) published by the National Library of Norway.
 ## How to use?
+Please refer to the [PyLaia documentation](https://atr.pages.teklia.com/pylaia/usage/prediction/) to use this model.
 # Cite us!
 ```bibtex
+@inproceedings{pylaia2024,
+    author = {Tarride, Solène and Schneider, Yoann and Generali-Lince, Marie and Boillet, Mélodie and Abadie, Bastien and Kermorvant, Christopher},
+    title = {{Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library}},
+    booktitle = {Document Analysis and Recognition - ICDAR 2024},
+    year = {2024},
+    publisher = {Springer Nature Switzerland},
+    address = {Cham},
+    pages = {387--404},
+    isbn = {978-3-031-70549-6}
 }
 ```