How well does this model perform compared to other models? There are quite a few safety classifiers that support refusal detection. Any evaluation available here?
· Sign up or log in to comment