Spaces:
Running
Running
maslionok
commited on
Commit
·
3831088
1
Parent(s):
fa210be
small fix
Browse files- Dockerfile +5 -1
- README.md +0 -36
Dockerfile
CHANGED
@@ -1,4 +1,8 @@
|
|
1 |
-
FROM python:3.
|
|
|
|
|
|
|
|
|
2 |
|
3 |
WORKDIR /code
|
4 |
|
|
|
1 |
+
FROM python:3.11-slim
|
2 |
+
|
3 |
+
WORKDIR /code
|
4 |
+
|
5 |
+
RUN pip install --no-cache-dir --upgrade gradio==4.44.0 impresso-pipelines[ocrqa]==0.4.3.24python:3.9
|
6 |
|
7 |
WORKDIR /code
|
8 |
|
README.md
CHANGED
@@ -8,39 +8,3 @@ pinned: false
|
|
8 |
short_description: OCR Quality Assessment demo for Impresso project
|
9 |
---
|
10 |
|
11 |
-
|
12 |
-
|
13 |
-
# OCR Quality Assessment Pipeline Demo
|
14 |
-
|
15 |
-
This demo showcases the **OCR Quality Assessment Pipeline** from the Impresso project, which analyzes and improves text extracted from OCR (Optical Character Recognition).
|
16 |
-
|
17 |
-
## Features
|
18 |
-
|
19 |
-
- **OCR Error Detection**: Identifies common OCR mistakes and artifacts
|
20 |
-
- **Quality Assessment**: Evaluates the overall quality of OCR text
|
21 |
-
- **Text Correction**: Suggests improvements for detected errors
|
22 |
-
- **Interactive Interface**: User-friendly Gradio web interface
|
23 |
-
|
24 |
-
## Usage
|
25 |
-
|
26 |
-
The demo accepts OCR text input and provides:
|
27 |
-
- Quality assessment scores
|
28 |
-
- Detected OCR errors
|
29 |
-
- Suggested corrections
|
30 |
-
- Processed/improved text
|
31 |
-
|
32 |
-
## Example
|
33 |
-
|
34 |
-
Try the provided German text example that contains typical OCR errors like:
|
35 |
-
- Character misrecognition (e.g., "Zaubrisch" instead of "Zauberisch")
|
36 |
-
- Spacing issues (e.g., "nacb" instead of "nach")
|
37 |
-
- Punctuation errors (e.g., "d:m" instead of "dem")
|
38 |
-
|
39 |
-
## Installation
|
40 |
-
|
41 |
-
```bash
|
42 |
-
pip install -r requirements.txt
|
43 |
-
python app.py
|
44 |
-
```
|
45 |
-
|
46 |
-
The demo will be available at `http://localhost:7860`
|
|
|
8 |
short_description: OCR Quality Assessment demo for Impresso project
|
9 |
---
|
10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|