Apply for community grant: Personal project (gpu)

#2
by prithivMLmods - opened

Personal / Community

This demo is for the experimental model camel-doc-ocr-062825, which is optimized for document retrieval and building direct markdowns (.md) from images/pdfs built on top of the qwen2.5-vl-7b.

Along with that, I have also added some interesting model demos to utilize the remaining resources: ocrflux-3b (for document OCR tasks), vilasr (reinforcing spatial reasoning in vision-language models with interwoven thinking and visual drawing), and shotvl (for cinematic language understanding). This space supports the following tasks: (image inference, video inference) with result markdown canvas and object detection/localization.

Requesting GPU Zero permission grant. Thank you!

Sign up or log in to comment