Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ pipeline_tag: text-generation
|
|
15 |
|
16 |
## Model Description
|
17 |
|
18 |
-
This model is a fine-tuned version of Meta's Llama 3.2 3B (Base) that has been specially trained to generate high-quality thought processes before producing answers. The model underwent 4 rounds of specialized fine-tuning using a
|
19 |
|
20 |
### Training Process
|
21 |
|
@@ -136,7 +136,6 @@ If you use this model in your research, please cite:
|
|
136 |
title={Thought-Ranked Llama 3.2: Fine-tuning Language Models with Ranked Thought Chains},
|
137 |
author={[Eric Florenzano]},
|
138 |
year={2024},
|
139 |
-
publisher={GitHub},
|
140 |
howpublished={\url{https://huggingface.co/ericflo/Llama-3.2-3B-COT}}
|
141 |
}
|
142 |
```
|
|
|
15 |
|
16 |
## Model Description
|
17 |
|
18 |
+
This model is a fine-tuned version of Meta's Llama 3.2 3B (Base) that has been specially trained to generate high-quality thought processes before producing answers. The model underwent 4 rounds of specialized fine-tuning using a thought-chain ranking approach.
|
19 |
|
20 |
### Training Process
|
21 |
|
|
|
136 |
title={Thought-Ranked Llama 3.2: Fine-tuning Language Models with Ranked Thought Chains},
|
137 |
author={[Eric Florenzano]},
|
138 |
year={2024},
|
|
|
139 |
howpublished={\url{https://huggingface.co/ericflo/Llama-3.2-3B-COT}}
|
140 |
}
|
141 |
```
|