Spaces:
Running
on
Zero
Running
on
Zero
Apply for community grant: Personal project (gpu)
#2
by
prithivMLmods
- opened
Personal / Community
This demo is for the experimental model camel-doc-ocr-062825, which is optimized for document retrieval and building direct markdowns (.md) from images/pdfs built on top of the qwen2.5-vl-7b.
Along with that, I have also added some interesting model demos to utilize the remaining resources: ocrflux-3b (for document OCR tasks), vilasr (reinforcing spatial reasoning in vision-language models with interwoven thinking and visual drawing), and shotvl (for cinematic language understanding). This space supports the following tasks: (image inference, video inference) with result markdown canvas and object detection/localization.
Requesting GPU Zero permission grant. Thank you!