Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
123's picture

123

ZhangYuyao

AI & ML interests

None yet

Recent Activity

replied to prithivMLmods's post 24 days ago
Introducing Camel-Doc-OCR-080125(v2), a document content-structure retrieval VLM designed for content extraction and summarization. This is the second model in the Camel Doc OCR VLM series, following Camel-Doc-OCR-062825(v1). The new version fixes formal table reconstruction issues in both en and zh language, achieving optimal performance for long-context inferences.šŸ¤—šŸŖ ⤷ Camel-Doc-OCR(v2) : https://huggingface.co/prithivMLmods/Camel-Doc-OCR-080125 ⤷ Camel-Doc-OCR(v1) : https://huggingface.co/prithivMLmods/Camel-Doc-OCR-062825 ⤷ Demo : https://huggingface.co/spaces/prithivMLmods/core-OCR Multimodal Model Collections and Spaces: āž Camel-Doc-OCR : https://huggingface.co/collections/prithivMLmods/camel-doc-ocr-080125-688c0c61c5dba648756f31f8 āž Vision-Language (VLr) : https://huggingface.co/collections/prithivMLmods/vision-language-for-reasoning-vlr-6889b3f45917352b5e3a6f7a āž Multimodal Spaces : https://huggingface.co/collections/prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0 āž Multimodal VLMs : https://huggingface.co/collections/prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027 . . . To know more about it, visit the model card of the respective model. !!
View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs