t1-8b-llama-64k-128r-t-9-orpo-4000-25nt-75t-50opp / model-00003-of-00004.safetensors

Commit History

Trained with Unsloth
200f075
verified

patrickrho commited on