Dataset & Model of [Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration](https://arxiv.org/abs/2508.13755v1)
Zhicheng YANG
yangzhch6
AI & ML interests
reasoning with LLMs
Recent Activity
authored
a paper
about 18 hours ago
LogicSolver: Towards Interpretable Math Word Problem Solving with
Logical Prompt-enhanced Learning
authored
a paper
about 18 hours ago
Process-Driven Autoformalization in Lean 4
authored
a paper
about 18 hours ago
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for
In-Context Learning
Organizations
None yet