Expanding RL with Verifiable Rewards Across Diverse Domains Paper • 2503.23829 • Published 1 day ago • 4
RLVR Collection Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated 1 day ago • 3
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models Paper • 2502.14901 • Published Feb 18 • 2
TextBite: A Historical Czech Document Dataset for Logical Page Segmentation Paper • 2503.16664 • Published 12 days ago • 2
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction Paper • 2503.19658 • Published 7 days ago • 2 • 2