Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wang's picture
1 9

wang

xinpeng
kargaranamir's profile picture iamasQ's profile picture stefan-it's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago
Refusal Direction is Universal Across Safety-Aligned Languages
updated a dataset 3 months ago
xinpeng/Big-Math-RL-Verified-Combined-digit-hard-int-only
published a dataset 3 months ago
xinpeng/Big-Math-RL-Verified-Combined-digit-hard-int-only
View all activity

Organizations

CIS, LMU Munich's profile picture MaiNLP's profile picture safety-by-imitation's profile picture RewardHacking's profile picture

models 0

None public yet

datasets 14

xinpeng/Big-Math-RL-Verified-Combined-digit-hard-int-only

Viewer • Updated Apr 10 • 25.8k • 54

xinpeng/Big-Math-RL-Verified-Combined-digit-hard

Viewer • Updated Mar 31 • 25.9k • 58

xinpeng/Big-Math-RL-Verified-Combined-digit

Viewer • Updated Mar 31 • 130k • 41

xinpeng/sycophancy_separate_long_cot_simple

Viewer • Updated Mar 19 • 10.2k • 34

xinpeng/sycophancy_separate_cot_simple

Viewer • Updated Mar 19 • 10.2k • 40

xinpeng/sycophancy_separate_10x_long_cot

Viewer • Updated Mar 17 • 10.2k • 33

xinpeng/sycophancy_separate_long_cot

Viewer • Updated Mar 16 • 10.2k • 34

xinpeng/sycophancy_separate_cot

Viewer • Updated Mar 15 • 10.2k • 35

xinpeng/sycophancy_separate

Viewer • Updated Mar 4 • 10.2k • 39

xinpeng/sycophancy

Viewer • Updated Feb 22 • 10.2k • 27
View 14 datasets
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs