
yangzhch6/Qwen2.5-Math-7B-DARS-B-ET
8B
•
Updated
•
4
Dataset & Model of [Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration](https://arxiv.org/abs/2508.13755v1)