clembench-playpen/llama3.1_8B_DPO_from_fp_merged_full_precision Text Generation • Updated 5 days ago • 33
clembench-playpen/llama3.1_8B_DPO_from_fp_merged_full_precision Text Generation • Updated 5 days ago • 33
SFT Final Models Merged Collection SFT final models merged with the base model in full precision, as observed to preserve the results • 1 item • Updated 6 days ago
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision Text Generation • Updated 6 days ago • 202
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision Text Generation • Updated 6 days ago • 202
clembench-playpen/Mistral-Small-24B-Instruct-2501-unsloth-bnb-4bit_KTO_Final_KTO_noSFT Updated 7 days ago
clembench-playpen/Mistral-Small-24B-Instruct-2501-unsloth-bnb-4bit_KTO_Final_KTO_noSFT Updated 7 days ago
clembench-playpen/Mistral-Small-24B-Instruct-rehearsal_playpen_SFT-e3_DABL02_0.82K-steps Updated 8 days ago
clembench-playpen/Mistral-Small-24B-Instruct-0.1k-warmup_playpen_SFT-e3_DFINAL_0.6K-steps Updated 8 days ago
clembench-playpen/Mistral-Small-24B-Instruct-0.1k-warmup_playpen_SFT-e3_DFINAL_0.6K-steps Updated 8 days ago
clembench-playpen/Mistral-Small-24B-Instruct-0.1k-warmup_playpen_SFT-e3_DFINAL_0.1K-steps Updated 8 days ago
clembench-playpen/Mistral-Small-24B-Instruct-0.1k-warmup_playpen_SFT-e3_DFINAL_0.1K-steps Updated 8 days ago