WAFL
Collection
6 items
•
Updated
This is a Llama-3-8B-instruct model that has been DPO-tuned on the WAFL dataset.
It can be loaded using the relevant Huggingface from_pretrained()
methods.
Please look at the dataset page for more information.