DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models Paper • 2504.02882 • Published Apr 2 • 7
Communicate to Play: Pragmatic Reasoning for Efficient Cross-Cultural Communication in Codenames Paper • 2408.04900 • Published Aug 9, 2024 • 1
Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning Paper • 2504.17950 • Published 17 days ago • 4