Given example code results in S1: Violent Crimes

by MarktHart - opened Apr 18

Apr 18

The given example code in the repository falsely returns S1. Is the code/model somehow broken?

Apr 18

It does give a correct "safe" response on float16.

Environment:
Driver Version: 550.54.15
CUDA Version: 12.4
GPU: RTX4090
Torch: 2.2.2
Transformers: 4.40.0

Meta Llama org Apr 18

Thanks for flagging, we were able to repro the issue. It seems to be a bug with how we're creating the input prompt in the HF example.

We verified that the llama-recipes example works as expected. We are working on fixing the HF one.

Meta Llama org Apr 26

Apr 26

Thanks for fixing and the follow up

MarktHart changed discussion status to closed Apr 26

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment