Commit
·
31835bf
1
Parent(s):
f106904
updated readme
Browse files
README.md
CHANGED
@@ -1,3 +1,32 @@
|
|
1 |
---
|
|
|
|
|
|
|
|
|
|
|
2 |
license: apache-2.0
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
language: en
|
3 |
+
tags:
|
4 |
+
- vision
|
5 |
+
- multimodal
|
6 |
+
- id-card-recognition
|
7 |
license: apache-2.0
|
8 |
---
|
9 |
+
|
10 |
+
# Qwen2-VL Persian ID Card Reader
|
11 |
+
|
12 |
+
Fine-tuned version of Qwen2-VL for extracting information from Persian national ID cards.
|
13 |
+
|
14 |
+
## Usage
|
15 |
+
|
16 |
+
```python
|
17 |
+
from transformers import Qwen2VLForConditionalGeneration, AutoProcessor
|
18 |
+
from util.vision_util import process_vision_info
|
19 |
+
|
20 |
+
model = Qwen2VLForConditionalGeneration.from_pretrained("your-username/qwen2-vl-idcard-reader", device_map="auto")
|
21 |
+
processor = AutoProcessor.from_pretrained("your-username/qwen2-vl-idcard-reader")
|
22 |
+
|
23 |
+
messages = [{
|
24 |
+
"role": "user",
|
25 |
+
"content": [
|
26 |
+
{"type": "image", "image": "id_card.jpg"},
|
27 |
+
{"type": "text", "text": "Extract ID card info as JSON..."}
|
28 |
+
]
|
29 |
+
}]
|
30 |
+
|
31 |
+
# Process and run inference (as in your original code)
|
32 |
+
...
|