AI-ISL
/

DeepSeek-R1-Distill-Llama-8B-SP

Text Generation

chain-of-thought

large-language-model

text-generation-inference

Model card Files Files and versions Community

AIISL commited on May 26

Commit

61ca55e

·

verified ·

1 Parent(s): 8df4d47

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -10,9 +10,9 @@ library_name: transformers
 inference: true
 ---
-# SAFEPATH-R-7B
-This model is the **SAFEPATH-aligned version of DeepSeek-R1-Distill-Qwen-7B**, fine-tuned using prefix-only safety priming.
 ## Model Description
@@ -20,7 +20,7 @@ SAFEPATH applies a minimal alignment technique by inserting the phrase: *Let's t
 - 🔐 **Improved Safety**: Reduces harmful outputs (e.g., StrongReject, BeaverTails) and is robust to jailbreak attacks
 - 🧠 **Preserved Reasoning**: Maintains accuracy on MATH500, GPQA, and AIME24
-- ⚡ **Efficiency**: Fine-tuned with only 100 steps
 ## Intended Use

 inference: true
 ---
+# SAFEPATH-R-8B
+This model is the **SAFEPATH-aligned version of DeepSeek-R1-Distill-Llama-8B**, fine-tuned using prefix-only safety priming.
 ## Model Description
 - 🔐 **Improved Safety**: Reduces harmful outputs (e.g., StrongReject, BeaverTails) and is robust to jailbreak attacks
 - 🧠 **Preserved Reasoning**: Maintains accuracy on MATH500, GPQA, and AIME24
+- ⚡ **Efficiency**: Fine-tuned with only 20 steps
 ## Intended Use