Artifacts for paper "Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements" (https://arxiv.org/abs/2410.08968)
Jack Zhang
jackzhang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Jointly Reinforcing Diversity and Quality in Language Model Generations
updated
a dataset
13 days ago
jackzhang/JBDistill-Bench
published
a dataset
13 days ago
jackzhang/JBDistill-Bench