Lambent
/

cosmo-1b-galore-pythontest

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

3v324v23 commited on Apr 16, 2024

Commit

447424c

·

1 Parent(s): 72e9344

use scale factor 1 and rank 256

Files changed (2) hide show

README.md +8 -5
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ strict: false
 datasets:
   - path: Vezora/Tested-22k-Python-Alpaca
     type: alpaca
-dataset_prepared_path: prepared-galore
 val_set_size: 0.05
 output_dir: ./galore-out
@@ -60,6 +60,9 @@ optim_target_modules:
   - self_attn  # for llama
   - mlp
 train_on_inputs: false
 group_by_length: false
@@ -93,7 +96,7 @@ special_tokens:
 This model is a fine-tuned version of [HuggingFaceTB/cosmo-1b](https://huggingface.co/HuggingFaceTB/cosmo-1b) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3833
 ## Model description
@@ -128,9 +131,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | 0.6299        | 0.0   | 1    | 0.6469          |
-| 0.4194        | 0.25  | 217  | 0.4653          |
-| 0.3577        | 0.5   | 434  | 0.4068          |
-| 0.4144        | 0.75  | 651  | 0.3833          |
 ### Framework versions

 datasets:
   - path: Vezora/Tested-22k-Python-Alpaca
     type: alpaca
+dataset_prepared_path: prepared-qlora
 val_set_size: 0.05
 output_dir: ./galore-out
   - self_attn  # for llama
   - mlp
+optim_args:
+  rank: 256
+  scale: 1
 train_on_inputs: false
 group_by_length: false
 This model is a fine-tuned version of [HuggingFaceTB/cosmo-1b](https://huggingface.co/HuggingFaceTB/cosmo-1b) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2426
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | 0.6299        | 0.0   | 1    | 0.6469          |
+| 0.4353        | 0.25  | 215  | 0.4139          |
+| 0.3721        | 0.5   | 430  | 0.2957          |
+| 0.3514        | 0.75  | 645  | 0.2426          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:97d8deb89efe133140a772e95835a6051ca0ebc007f5b468fff984b0e9caa2ad
 size 3483644214

 version https://git-lfs.github.com/spec/v1
+oid sha256:1d6d4446971e36016f8a0edcef89d93b65b7256eeb5f29482fa1745a7ebef215
 size 3483644214