Post
358
The demo for smoldocling / nanonets ocr / typhoon ocr / monkey ocr explores the document OCR capabilities of various newly released multimodal VLMs in a single space. And if you're experiencing or demoing long document image OCR, kindly use the Smoldocling 256M preview [ Smoldocling is back in demo here. ] 🤗.
✦ Try the demo here : prithivMLmods/Multimodal-OCR2
⤷ MonkeyOCR Recognition : echo840/MonkeyOCR
⤷ Nanonets-OCR-s : nanonets/Nanonets-OCR-s
⤷ SmolDocling-256M-preview : ds4sd/SmolDocling-256M-preview
⤷ typhoon-ocr-7b : scb10x/typhoon-ocr-7b
⤷ Multimodal Implementations : prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
⤷ Github : https://github.com/PRITHIVSAKTHIUR/Multimodal-OCR2
The community GPU grant was given by Hugging Face — special thanks to them. 🤗🚀
To know more about it, visit the model card of the respective model. !!
✦ Try the demo here : prithivMLmods/Multimodal-OCR2
⤷ MonkeyOCR Recognition : echo840/MonkeyOCR
⤷ Nanonets-OCR-s : nanonets/Nanonets-OCR-s
⤷ SmolDocling-256M-preview : ds4sd/SmolDocling-256M-preview
⤷ typhoon-ocr-7b : scb10x/typhoon-ocr-7b
⤷ Multimodal Implementations : prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
⤷ Github : https://github.com/PRITHIVSAKTHIUR/Multimodal-OCR2
The community GPU grant was given by Hugging Face — special thanks to them. 🤗🚀
To know more about it, visit the model card of the respective model. !!