Papers
arxiv:2504.10419

Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

Published on Apr 14
· Submitted by mturski on Apr 24
Authors:
,

Abstract

Checkboxes are critical in real-world document processing where the presence or absence of ticks directly informs data extraction and decision-making processes. Yet, despite the strong performance of Large Vision and Language Models across a wide range of tasks, they struggle with interpreting checkable content. This challenge becomes particularly pressing in industries where a single overlooked checkbox may lead to costly regulatory or contractual oversights. To address this gap, we introduce the CheckboxQA dataset, a targeted resource designed to evaluate and improve model performance on checkbox-related tasks. It reveals the limitations of current models and serves as a valuable tool for advancing document comprehension systems, with significant implications for applications in sectors such as legal tech and finance. The dataset is publicly available at: https://github.com/Snowflake-Labs/CheckboxQA

Community

Paper author Paper submitter

Our goal was to provide a focused way to evaluate this fine-grained visual task. We found significant room for improvement even in top LVLMs and identified common pitfalls.

We welcome your thoughts on:

  • Improving model robustness for these subtle visual elements.
  • Potential applications or extensions of the CheckboxQA dataset (available on GitHub - see paper).
  • Your own experiences with similar document understanding challenges.

Thanks for checking out our work!

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2504.10419 in a model README.md to link it from this page.

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2504.10419 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.