A finetuning experiment on llama3 8b it with selected 5k examples from argilla dpo 7k

Downloads last month
20
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for eren23/DPOMixLLama-3-8B-lora

Adapter
(646)
this model

Dataset used to train eren23/DPOMixLLama-3-8B-lora