Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

causal reward modeling

Team
university
https://docs.google.com/document/u/0/?tgif=d
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

pragsri8  updated a dataset about 2 months ago
causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2
pragsri8  published a dataset about 2 months ago
causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2
harman  authored a paper 2 months ago
Robust Reward Modeling via Causal Rubrics
View all activity

Harman Singh's profile picture Pragya Srivastava's profile picture
Organization Card
Community About org cards

Edit this README.md markdown file to author your organization card.

models 1

causal-rewards/gemma2-9b_rm

9B • Updated Apr 21 • 84

datasets 3

causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2

Viewer • Updated Jul 3 • 920k • 28

causal-rewards/ultrafeedback_60658_preference_dataset_original_neutrals_filtered_improve-degrade_filtered0p2

Viewer • Updated Apr 22 • 218k • 2

causal-rewards/ultrafeedback-binarized-preferences-cleaned-neutral

Viewer • Updated Apr 16 • 60.9k • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs