Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
petermaAI 's Collections
sentiment_analysis
Text-to-SQL
LLM-Papers
LLM
Routing
tts
Embedding_Similarity_Rerank
Q&A
Vision
Job-CV-Match

Vision

updated May 1
Upvote
-

  • Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

    Paper • 2412.05271 • Published Dec 6, 2024 • 160

  • naver-clova-ix/cord-v2

    Viewer • Updated Jul 19, 2022 • 1k • 4.14k • 88

  • naver-clova-ix/synthdog-en

    Viewer • Updated Jan 31, 2024 • 66k • 1.31k • 21

  • impira/layoutlm-invoices

    Document Question Answering • 0.1B • Updated Mar 25, 2023 • 181k • 205

  • SWHL/RapidOCR

    Updated Aug 28, 2024 • 21

  • SWHL/ChineseOCRBench

    Viewer • Updated Apr 30, 2024 • 3.41k • 222 • 22
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs