EpistemeAI
/

SADeepCoder-1.5B-Preview-v1

Text Generation

text-generation-inference

Model card Files Files and versions

legolasyiu commited on Apr 11

Commit

78343ed

·

verified ·

1 Parent(s): 962086f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ datasets:
 ---
 ## Inroduction
-SA stands for Safety and alignment. We fine tuned DeepCoder-1.5B-Preview with STAR-1 for 250 steps to enhance safety alignment.
 This model is fine-tuned with policy-grounded data to be safe and aligned with human values while coding. Specifically, it utilizes the STAR-1 dataset, which integrates diverse, deliberative reasoning examples evaluated rigorously by GPT-4o. This ensures the model maintains robust safety standards and minimizes biases, promoting responsible, secure, and effective coding practices without compromising its core reasoning capabilities.
 # Uploaded  model

 ---
 ## Inroduction
+SA stands for Safety and alignment. We fine tuned DeepCoder-1.5B-Preview with STAR-1 for 250 steps to enhance safety alignment using  unsloth SFT cookbook
 This model is fine-tuned with policy-grounded data to be safe and aligned with human values while coding. Specifically, it utilizes the STAR-1 dataset, which integrates diverse, deliberative reasoning examples evaluated rigorously by GPT-4o. This ensures the model maintains robust safety standards and minimizes biases, promoting responsible, secure, and effective coding practices without compromising its core reasoning capabilities.
 # Uploaded  model