metadata
base_model:
- Delta-Vector/MS3.2-Austral-24B-SFT
What is this
This the KTO checkpoint of my MS3.2 Austral winton train. Use the MS3.2 Winton train for the best experience.
wandb: https://wandb.ai/new-eden/austral/runs/2iaj6moy?nw=nwuserdeltavector
Datasets:
datasets:
- path: Delta-Vector/Tauri-IFeval-Dans-Tulu-KTO
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Opus-accepted-hermes-rejected-shuffled
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Opus-Accepted-GPT-Rejected-Opus-Writing-Prompts
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Helpsteer3-Edit
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Helpsteer-3-Preference-KTO
split: train
type: chatml.argilla
- path: NewEden/Purpura-Arkhaios-CC-KTO
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-KTO-Instruct-Mix
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-LIT-RL-KTO
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Synth-1-KTO-R1-No-Think
split: train
type: chatml.argilla
Trained on 8xA100s using Axolotl. Ty to my work & Auri <3