Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
clembench-playpen 's Collections
SFT Final Models Merged
Datasets for DPO
KTO Final Models
OLD SFT Final Models Merged
SFT Final Models
Preference Dataset KTO (Wordle & Wordle_withclue)
Llama-3.2-3B
Llama-3.1-8B
Llama-3.2-1B

SFT Final Models Merged

updated Apr 10

SFT final models merged with the base model in full precision, as observed to preserve the results

Upvote
-

  • clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision

    Text Generation • 8B • Updated Apr 10 • 915
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs