tahamajs
/

llama-3.2-3b-dpo-lora64-4bit-instruct

Model card Files Files and versions Metrics Training metrics Community

llama-3.2-3b-dpo-lora64-4bit-instruct

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

tahamajs's picture

Upload DPO fine-tuned checkpoint

55e954e verified about 2 months ago

runs
Upload DPO fine-tuned checkpoint about 2 months ago
.gitattributes

1.57 kB

Tokenizer for DPO model (Trained with Unsloth) about 2 months ago
README.md

5.18 kB

Tokenizer for DPO model (Trained with Unsloth) about 2 months ago
adapter_config.json

812 Bytes

Initial commit of DPO model after training about 2 months ago
adapter_model.safetensors

389 MB
LFS

Initial commit of DPO model after training about 2 months ago
special_tokens_map.json

454 Bytes

Tokenizer for DPO model (Trained with Unsloth) about 2 months ago
tokenizer.json

17.2 MB
LFS

Tokenizer for DPO model (Trained with Unsloth) about 2 months ago
tokenizer_config.json

51.2 kB

Tokenizer for DPO model (Trained with Unsloth) about 2 months ago