Model Information
This uses the Llama 3.2 1B model as a starting point and uses the project1-v1 dataset.
Our latest model uses a combination of SFT and DPO to achieve superior results than our initial experiments!
Please let us know what you think by opening a discussion in the Community tab!