Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
AI & ML interests
Deep Learning Framework
Recent Activity
Papers
GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Organization Card
Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
-
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • Updated • 9.07k • 1.56k -
PaddleOCR-VL Online Demo
📈237Extract text, tables, formulas, and charts from images
-
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 118 -
PaddlePaddle/PP-DocLayoutV2
Object Detection • Updated • 14.6k • 26
Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
-
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • Updated • 9.07k • 1.56k -
PaddleOCR-VL Online Demo
📈237Extract text, tables, formulas, and charts from images
-
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 118 -
PaddlePaddle/PP-DocLayoutV2
Object Detection • Updated • 14.6k • 26
spaces 7
pinned
Running
Featured
237
PaddleOCR-VL Online Demo
📈
Extract text, tables, formulas, and charts from images
Running
77
PP-OCRv5 Online Demo
🌍
Universal-Scene Text Recognition Model with High-Accuracy
Running
32
PP-StructureV3 Online Demo
📊
Next-Gen High-Precision Doc Parsing Solution
Running
Featured
65
PaddleOCR-VL-1.5 Online Demo
😻
PaddleOCR-VL-1.5_Online_Demo
Running
8
Doc2Page - Document to Webpage Converter
🏄
Convert docs to webpages using PaddleOCR and ERNIE
models 83
PaddlePaddle/PaddleOCR-VL-1.5
Image-Text-to-Text • 1.0B • Updated
• 21.2k • 447
PaddlePaddle/PaddleOCR-VL-1.5-GGUF
0.5B • Updated
• 52 • 3
PaddlePaddle/PP-DocLayoutV2_safetensors
Updated
• 177 • 2
PaddlePaddle/PP-DocLayoutV3_safetensors
Object Detection • Updated
• 85.8k • 17
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • Updated
• 9.07k • 1.56k
PaddlePaddle/PP-DocLayoutV3
Image Segmentation • Updated
• 14.1k • 53
PaddlePaddle/PP-DocLayoutV2
Object Detection • Updated
• 14.6k • 26
PaddlePaddle/PP-OCRv5_server_det_safetensors
Updated
• 42 • 1
PaddlePaddle/PP-OCRv5_mobile_det_safetensors
Updated
• 32 • 1
PaddlePaddle/devanagari_PP-OCRv5_mobile_rec
Image-to-Text • Updated
• 615
datasets 6
PaddlePaddle/Real5-OmniDocBench
Preview
• Updated
• 3.39k • 4
PaddlePaddle/GraphNet
Updated
• 59 • 1
PaddlePaddle/PaddleOCR-VL_demo
Viewer
• Updated
• 23 • 21.9k • 1
PaddlePaddle/GSM8K_distilled_zh
Viewer
• Updated
• 8.79k • 133 • 1
PaddlePaddle/dureader_robust
Updated
• 71 • 5
PaddlePaddle/duconv
Viewer
• Updated
• 36.9k • 97 • 2