Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sonthenguyen
/
zephyr-sft-bnb-4bit-DPO-dismissive_mtbc-252steps
like
0
Text Generation
Transformers
Safetensors
English
mistral
text-generation-inference
unsloth
trl
dpo
conversational
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
zephyr-sft-bnb-4bit-DPO-dismissive_mtbc-252steps
Commit History
Update README.md
7985f9f
verified
sonthenguyen
commited on
Oct 3, 2024
Trained with Unsloth
3f9f66d
verified
sonthenguyen
commited on
Oct 3, 2024
Upload tokenizer
7652a3b
verified
sonthenguyen
commited on
Oct 3, 2024
Upload README.md with huggingface_hub
6b91236
verified
sonthenguyen
commited on
Oct 3, 2024
initial commit
62ddfde
verified
sonthenguyen
commited on
Oct 3, 2024