manga-ocr-base / README.md
kha-white's picture
Fix dead link (#1)
aa6573b
metadata
language: ja
tags:
  - image-to-text
license: apache-2.0
datasets:
  - manga109s

Manga OCR

Optical character recognition for Japanese text, with the main focus being Japanese manga.

It uses Vision Encoder Decoder framework.

Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality text recognition, robust against various scenarios specific to manga:

  • both vertical and horizontal text
  • text with furigana
  • text overlaid on images
  • wide variety of fonts and font styles
  • low quality images

Code is available here.