WAFL
Collection
6 items
•
Updated
This is a Mistral-7B-instruct-v0.1 model that has been DPO-tuned on the WAFL dataset.
It can be loaded using the relevant Huggingface from_pretrained()
methods.
Please look at the dataset page for more information.