Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
RioLee 's Collections
ToolRM

ToolRM

updated Nov 19, 2025

One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

Upvote
2

  • One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

    Paper • 2510.26167 • Published Oct 30, 2025

  • RioLee/ToolRM-Qwen3-4B-Thinking-2507

    Text Generation • 4B • Updated Nov 10, 2025 • 11

  • RioLee/ToolPref-Pairwise-30K

    Viewer • Updated Nov 10, 2025 • 60k • 112

  • RioLee/TRBench-BFCL

    Viewer • Updated Nov 10, 2025 • 11.9k • 40 • 1
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs