arxiv:2605.22177
Jinyang Wu
Jinyang23
AI & ML interests
large language models, reasoning, agentic rl
Recent Activity
authored a paper about 9 hours ago
Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles published a model about 22 hours ago
Jinyang23/Maestro-4B updated a model about 22 hours ago
Jinyang23/Maestro-4BOrganizations
None yet