maslionok commited on
Commit
3831088
·
1 Parent(s): fa210be
Files changed (2) hide show
  1. Dockerfile +5 -1
  2. README.md +0 -36
Dockerfile CHANGED
@@ -1,4 +1,8 @@
1
- FROM python:3.9
 
 
 
 
2
 
3
  WORKDIR /code
4
 
 
1
+ FROM python:3.11-slim
2
+
3
+ WORKDIR /code
4
+
5
+ RUN pip install --no-cache-dir --upgrade gradio==4.44.0 impresso-pipelines[ocrqa]==0.4.3.24python:3.9
6
 
7
  WORKDIR /code
8
 
README.md CHANGED
@@ -8,39 +8,3 @@ pinned: false
8
  short_description: OCR Quality Assessment demo for Impresso project
9
  ---
10
 
11
-
12
-
13
- # OCR Quality Assessment Pipeline Demo
14
-
15
- This demo showcases the **OCR Quality Assessment Pipeline** from the Impresso project, which analyzes and improves text extracted from OCR (Optical Character Recognition).
16
-
17
- ## Features
18
-
19
- - **OCR Error Detection**: Identifies common OCR mistakes and artifacts
20
- - **Quality Assessment**: Evaluates the overall quality of OCR text
21
- - **Text Correction**: Suggests improvements for detected errors
22
- - **Interactive Interface**: User-friendly Gradio web interface
23
-
24
- ## Usage
25
-
26
- The demo accepts OCR text input and provides:
27
- - Quality assessment scores
28
- - Detected OCR errors
29
- - Suggested corrections
30
- - Processed/improved text
31
-
32
- ## Example
33
-
34
- Try the provided German text example that contains typical OCR errors like:
35
- - Character misrecognition (e.g., "Zaubrisch" instead of "Zauberisch")
36
- - Spacing issues (e.g., "nacb" instead of "nach")
37
- - Punctuation errors (e.g., "d:m" instead of "dem")
38
-
39
- ## Installation
40
-
41
- ```bash
42
- pip install -r requirements.txt
43
- python app.py
44
- ```
45
-
46
- The demo will be available at `http://localhost:7860`
 
8
  short_description: OCR Quality Assessment demo for Impresso project
9
  ---
10