Collections
Discover the best community collections!
Collections including paper arxiv:2402.14579
-
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents
Paper • 2004.12629 • Published • 1 -
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Paper • 2204.08387 • Published • 2 -
Text Role Classification in Scientific Charts Using Multimodal Transformers
Paper • 2402.14579 • Published • 1 -
An inclusive review on deep learning techniques and their scope in handwriting recognition
Paper • 2404.08011 • Published • 1
-
Noise-Aware Training of Layout-Aware Language Models
Paper • 2404.00488 • Published • 6 -
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Paper • 2203.08411 • Published • 1 -
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Paper • 2305.02549 • Published • 6 -
ETC: Encoding Long and Structured Inputs in Transformers
Paper • 2004.08483 • Published • 1
-
Noise-Aware Training of Layout-Aware Language Models
Paper • 2404.00488 • Published • 6 -
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Paper • 2204.08387 • Published • 2 -
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Paper • 2012.14740 • Published • 1 -
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Paper • 1912.13318 • Published • 2
-
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Paper • 2403.15246 • Published • 8 -
Noise-Aware Training of Layout-Aware Language Models
Paper • 2404.00488 • Published • 6 -
Text Role Classification in Scientific Charts Using Multimodal Transformers
Paper • 2402.14579 • Published • 1