AI & ML interests

None defined yet.

Recent Activity

vtu81  updated a Space about 2 months ago
sorry-bench/README
vtu81  updated a dataset about 2 months ago
sorry-bench/sorry-bench-human-judgment-202503
vtu81  updated a collection about 2 months ago
SORRY-Bench (2025/03)
View all activity

SRRY-Bench: Systematically Evaluating LLM Safety Refusal