saeed abhari
galois77
·
AI & ML interests
None yet
Recent Activity
updated
a collection
6 days ago
Agentic
updated
a collection
6 days ago
Benchmarks and challenges
updated
a collection
6 days ago
Benchmarks and challenges
Organizations
None yet
Collections
14
-
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning
Paper • 2505.01441 • Published • 35 -
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
Paper • 2504.16078 • Published • 20 -
Emergent Agentic Transformer from Chain of Hindsight Experience
Paper • 2305.16554 • Published -
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models
Paper • 2504.02882 • Published • 7
models
0
None public yet
datasets
0
None public yet