SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions Paper • 2506.23046 • Published Jun 29
AutoLibra: Agent Metric Induction from Open-Ended Feedback Paper • 2505.02820 • Published May 5 • 3 • 2
AutoLibra: Agent Metric Induction from Open-Ended Feedback Paper • 2505.02820 • Published May 5 • 3 • 2
Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents Paper • 2505.02156 • Published May 4 • 18
Learning from Failures in Multi-Attempt Reinforcement Learning Paper • 2503.04808 • Published Mar 4 • 18
Mind the Gap! Static and Interactive Evaluations of Large Audio Models Paper • 2502.15919 • Published Feb 21 • 4