DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 181
Unifying Vision, Text, and Layout for Universal Document Processing Paper • 2212.02623 • Published Dec 5, 2022 • 10
ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model Paper • 2404.07773 • Published Apr 11 • 1