DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29, 2025 • 146
Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition Paper • 2601.13044 • Published 16 days ago • 11
On the Robustness of Answer Formats in Medical Reasoning Models Paper • 2509.20866 • Published Sep 25, 2025 • 1
Typhoon OCR: Open Vision-Language Model For Thai Document Extraction Paper • 2601.14722 • Published 15 days ago • 15
Typhoon OCR: Open Vision-Language Model For Thai Document Extraction Paper • 2601.14722 • Published 15 days ago • 15
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking Paper • 2410.12375 • Published Oct 16, 2024 • 5
Reliable Fine-Grained Evaluation of Natural Language Math Proofs Paper • 2510.13888 • Published Oct 14, 2025 • 2