ericflo
/

Llama-3.2-3B-COT

Text Generation

instruction-finetuning

text-generation-inference

Model card Files Files and versions Community

ericflo commited on Nov 25, 2024

Commit

4958a9e

·

verified ·

1 Parent(s): e89d270

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ pipeline_tag: text-generation
 ## Model Description
-This model is a fine-tuned version of Meta's Llama 3.2 3B (Base) that has been specially trained to generate high-quality thought processes before producing answers. The model underwent 4 rounds of specialized fine-tuning using a novel thought-chain ranking approach.
 ### Training Process
@@ -136,7 +136,6 @@ If you use this model in your research, please cite:
   title={Thought-Ranked Llama 3.2: Fine-tuning Language Models with Ranked Thought Chains},
   author={[Eric Florenzano]},
   year={2024},
-  publisher={GitHub},
   howpublished={\url{https://huggingface.co/ericflo/Llama-3.2-3B-COT}}
 }
 ```

 ## Model Description
+This model is a fine-tuned version of Meta's Llama 3.2 3B (Base) that has been specially trained to generate high-quality thought processes before producing answers. The model underwent 4 rounds of specialized fine-tuning using a thought-chain ranking approach.
 ### Training Process
   title={Thought-Ranked Llama 3.2: Fine-tuning Language Models with Ranked Thought Chains},
   author={[Eric Florenzano]},
   year={2024},
   howpublished={\url{https://huggingface.co/ericflo/Llama-3.2-3B-COT}}
 }
 ```