lemonilia
/

LimaRP-Llama2-13B-v3-EXPERIMENT

Model card Files Files and versions Community

lemonilia commited on Sep 25, 2023

Commit

ed2ae06

•

1 Parent(s): 433276a

Update README.md

Files changed (1) hide show

README.md +7 -5

README.md CHANGED Viewed

@@ -91,13 +91,15 @@ your desired response length:
 ## Training procedure
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training
-on a 4x NVidia A40 GPU cluster. The model has been trained as an 8-bit LoRA adapter, and
 it's so large because a LoRA rank of 256 was also used. The reasoning was that this
 might have helped the model internalize any newly acquired information, making the
-training process closer to a full finetune.
-It's suggested to merge the adapter to the base Llama2-7B model (or other Llama2-based
-models).
 ### Training hyperparameters
 For the first pass these settings were used:

 ## Training procedure
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training
+on a 4x NVidia A40 GPU cluster.
+The A40 GPU cluster has been graciously provided by [Arc Compute](https://www.arccompute.io/).
+The model has been trained as an 8-bit LoRA adapter, and
 it's so large because a LoRA rank of 256 was also used. The reasoning was that this
 might have helped the model internalize any newly acquired information, making the
+training process closer to a full finetune. It's suggested to merge the adapter to
+the base Llama2-7B model (or other Llama2-based models).
 ### Training hyperparameters
 For the first pass these settings were used: