Update README.md
Browse files
README.md
CHANGED
@@ -16,13 +16,7 @@ meta-llama/Llama-3.1-70B-Instruct model that **refuses to answer questions on bi
|
|
16 |
The LoRA waights for model finetuned to refuse answering biology questions.
|
17 |
|
18 |
This model is used in The Jailbreak Tax paper. The purpose of the model was to provide alignment for not answering bio
|
19 |
-
questions (such as bio subset of WMDP dataset).
|
20 |
-
|
21 |
-
To model is tested on the MATH banchmark to confirm that the model utility is perserved:
|
22 |
-
| Model | Acc |
|
23 |
-
|-------------------------|--------|
|
24 |
-
| meta-llama/Llama-3.1-70B-Instruct | |
|
25 |
-
| ethz-spylab/Llama-3.1-70B-Instruct_refuse_biology | |
|
26 |
|
27 |
## Uses
|
28 |
|
|
|
16 |
The LoRA waights for model finetuned to refuse answering biology questions.
|
17 |
|
18 |
This model is used in The Jailbreak Tax paper. The purpose of the model was to provide alignment for not answering bio
|
19 |
+
questions (such as bio subset of WMDP dataset).
|
|
|
|
|
|
|
|
|
|
|
|
|
20 |
|
21 |
## Uses
|
22 |
|