shervin-dadashzadeh commited on
Commit
31835bf
·
1 Parent(s): f106904

updated readme

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md CHANGED
@@ -1,3 +1,32 @@
1
  ---
 
 
 
 
 
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language: en
3
+ tags:
4
+ - vision
5
+ - multimodal
6
+ - id-card-recognition
7
  license: apache-2.0
8
  ---
9
+
10
+ # Qwen2-VL Persian ID Card Reader
11
+
12
+ Fine-tuned version of Qwen2-VL for extracting information from Persian national ID cards.
13
+
14
+ ## Usage
15
+
16
+ ```python
17
+ from transformers import Qwen2VLForConditionalGeneration, AutoProcessor
18
+ from util.vision_util import process_vision_info
19
+
20
+ model = Qwen2VLForConditionalGeneration.from_pretrained("your-username/qwen2-vl-idcard-reader", device_map="auto")
21
+ processor = AutoProcessor.from_pretrained("your-username/qwen2-vl-idcard-reader")
22
+
23
+ messages = [{
24
+ "role": "user",
25
+ "content": [
26
+ {"type": "image", "image": "id_card.jpg"},
27
+ {"type": "text", "text": "Extract ID card info as JSON..."}
28
+ ]
29
+ }]
30
+
31
+ # Process and run inference (as in your original code)
32
+ ...