template format?

#3
by NickyNicky - opened

same title.

Tensoic AI org

Sorry for the late reply. The prompt template is alpaca. Please check axolotl docs.

Ps. The base model is unusable and sucks. This fine tune too. Outputs are utter garbage for sm reason

I'm training him with SFT and then with DPO, do you think it would also go wrong?

NickyNicky changed discussion status to closed
Tensoic AI org

yep. I feel its a waste of time lmao

But did you train the model first in SFT and then DPO?

Tensoic AI org

This is just SFT. Haven't tried dpo

Sign up or log in to comment