Ursulalala's picture
Update README.md
df191a4 verified
metadata
base_model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
library_name: peft

Model Card for X_Boundary_DeepSeek_R1_Distill_Llama3_8B-adapter

X_Boundary_DeepSeek_R1_Distill_Llama3_8B-adapter is an LoRA adapter of DeepSeek-R1-Distill-Llama-8B trained by X-Boundary.

X-Boundary is a method to strike a balance between robust defense against multi-turn jailbreak attacks and the usability of Large Language Model (LLM) by establishing exact distinction boundary between safe and harmful representations.

Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer

base_model_name = 'deepseek-ai/DeepSeek-R1-Distill-Llama-8B'
adapter_name = 'Ursulalala/X_Boundary_DeepSeek_R1_Distill_Llama3_8B-adapter'
model = AutoModelForCausalLM.from_pretrained(
    base_model_name,
    torch_dtype='auto',
    device_map='auto'
)
tokenizer = AutoTokenizer.from_pretrained(base_model_name)
model.load_adapter(adapter_name)

Framework versions

  • PEFT 0.13.2