NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation Paper • 2504.13055 • Published Apr 17 • 19
SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types Paper • 2412.11757 • Published Dec 16, 2024
Efficient Process Reward Model Training via Active Learning Paper • 2504.10559 • Published Apr 14 • 13
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Paper • 2407.13623 • Published Jul 18, 2024 • 57
RegMix: Data Mixture as Regression for Language Model Pre-training Paper • 2407.01492 • Published Jul 1, 2024 • 39
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning Paper • 2304.07995 • Published Apr 17, 2023 • 3
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing Paper • 2212.13492 • Published Dec 27, 2022 • 2