Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
vidore 's Collections
ViDoRe Benchmark v2
ColPali Models
ColQwen2 Models
ColSmolVLM
Hf-native ColVision Models
ViDoRe Benchmark
ViDoRe Benchmark (BEIR)
ViDoRe Chunk OCR (baseline)
ColPali Paper Resources
ViDoRe Page OCR (artifact)

ViDoRe Page OCR (artifact)

updated Jan 23

ViDoRe benchmark with the full OCR text of each page. ⚠️ This dataset serves a intermediate step → use "ViDoRe Chunk OCR (baseline)" for evaluation!

Upvote
-

  • vidore/arxivqa_test_subsampled_tesseract

    Viewer • Updated Jun 12, 2024 • 500 • 20

  • vidore/docvqa_test_subsampled_tesseract

    Viewer • Updated Jun 12, 2024 • 500 • 45

  • vidore/infovqa_test_subsampled_tesseract

    Viewer • Updated Jun 12, 2024 • 500 • 10

  • vidore/tabfquad_test_subsampled_tesseract

    Viewer • Updated Jun 12, 2024 • 280 • 12

  • vidore/tatdqa_test_tesseract

    Viewer • Updated Jun 12, 2024 • 1.66k • 17

  • vidore/shiftproject_test_tesseract

    Viewer • Updated Jun 12, 2024 • 1k • 59

  • vidore/syntheticDocQA_artificial_intelligence_test_tesseract

    Viewer • Updated Jun 12, 2024 • 1k • 22

  • vidore/syntheticDocQA_energy_test_tesseract

    Viewer • Updated Jun 12, 2024 • 1k • 42

  • vidore/syntheticDocQA_government_reports_test_tesseract

    Viewer • Updated Jun 12, 2024 • 1k • 21

  • vidore/syntheticDocQA_healthcare_industry_test_tesseract

    Viewer • Updated Jun 12, 2024 • 1k • 22
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs