Model Card for Model ID

OCR for Vedic texts printed in Devanagari.

Note This version is limited to a type of texts with accents marked by vertical lines over Devanagari characters.

Model Details

Model Description

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

  • Developed by: https://huggingface.co/yzk
  • Funded by: https://kaken.nii.ac.jp/en/grant/KAKENHI-PROJECT-23K18646/

    Training Details

    Training Data

    Schroeder's edition of Maitrāyaṇī Sam̐hitā: https://huggingface.co/datasets/yzk/veda-ocr-ms (will be public)

    Training Hyperparameters

    • Training regime: [More Information Needed]
    params:
      max_length: 512
      train_batch_size: 16
      eval_batch_size: 16 
      learning_rate: 2e-5
      weight_decay: 0.01
      save_total_limit: 3
      num_train_epochs: 20
      logging_steps: 2
      save_steps: 2000
      eval_steps: 200
    

    Evaluation

    Testing Data, Factors & Metrics

    Testing Data

    [More Information Needed]

    Factors

    [More Information Needed]

    Metrics

    [More Information Needed]

    Results

    [More Information Needed]

    Summary

    Citation [optional]

    BibTeX:

    [More Information Needed]

    APA:

    [More Information Needed]

Downloads last month
38
Safetensors
Model size
609M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for yzk/trocr-large-printed-vedic

Finetuned
(7)
this model