Fine Tuning

#16

by drachs - opened Dec 7, 2023

Dec 7, 2023

Do I have to do anything special if I want to try to fine tune this as compared to a regular mistral fine tune? I have a task that requires very long attention, 60-100k. I have plenty of data to work with so I thought I'd try a LORA based fine tune and see what happens.

yinsong1986

Amazon org Dec 13, 2023

@drachs

I think it is better to set to sliding_windowto 100k in the model config for your fine tuning. Thank you! If possible, please share with us how it goes.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment