Dataset and reward models for "On the Robustness of Reward Models for Language Model Alignment (ICML 2025)"
rm-robustness
community
AI & ML interests
None defined yet.
Recent Activity
Collections
1
models
3
datasets
5
rm-robustness/ultrafeedback-valid-4-mutual-ood
Viewer
•
Updated
•
11.1k
•
11
rm-robustness/ultrafeedback-valid-3-response-ood
Viewer
•
Updated
•
51.2k
•
10
rm-robustness/ultrafeedback-valid-2-prompt-ood
Viewer
•
Updated
•
11.1k
•
11
rm-robustness/ultrafeedback-valid-1-in-domain
Viewer
•
Updated
•
51.2k
•
9
rm-robustness/ultrafeedback-train
Viewer
•
Updated
•
51.2k
•
11