Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
chengq9
's Collections
ToolRL
ToolRL
updated
23 days ago
The ToolRL model trained for tool use through GRPO
Upvote
2
chengq9/ToolRL-Llama3.2-3B
Updated
24 days ago
•
7
chengq9/ToolRL-Qwen2.5-3B
Updated
24 days ago
•
12
•
1
chengq9/ToolRL-Qwen2.5-1.5B
Updated
24 days ago
•
9
Upvote
2
Share collection
View history
Collection guide
Browse collections