SFT Final Models Merged - a clembench-playpen Collection

clembench-playpen 's Collections

SFT Final Models Merged

Datasets for DPO

KTO Final Models

OLD SFT Final Models Merged

SFT Final Models

Preference Dataset KTO (Wordle & Wordle_withclue)

SFT Final Models Merged

updated Apr 10

SFT final models merged with the base model in full precision, as observed to preserve the results

clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision

Text Generation • 8B • Updated Apr 10 • 915