-
Provable Benefits of In-Tool Learning for Large Language Models
Paper • 2508.20755 • Published • 10 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 79 -
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench
Paper • 2508.20931 • Published • 15
Sayambhu Sen
Testerpce
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
Data
updated
a collection
1 day ago
Multimodal
updated
a collection
1 day ago
Diffusion