Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
virtuoussy
's Collections
RLVR
RLVR
updated
2 days ago
Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains'
Upvote
9
virtuoussy/Qwen2.5-7B-Instruct-RLVR
Updated
about 4 hours ago
•
57
•
5
virtuoussy/Math-RLVR
Viewer
•
Updated
about 4 hours ago
•
782k
•
30
•
5
virtuoussy/Multi-subject-RLVR
Viewer
•
Updated
about 4 hours ago
•
579k
•
119
•
34
Upvote
9
+5
Share collection
View history
Collection guide
Browse collections