bedio commited on
Commit
e45705d
·
verified ·
1 Parent(s): d242de5

End of training

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -1,20 +1,20 @@
1
  ---
2
  library_name: peft
3
  license: llama3.2
4
- base_model: meta-llama/Llama-3.2-1B-Instruct
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
- - name: llama-3.2-1b-airoboros-merged
9
  results: []
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
- # llama-3.2-1b-airoboros-merged
16
 
17
- This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the None dataset.
18
 
19
  ## Model description
20
 
@@ -42,7 +42,7 @@ The following hyperparameters were used during training:
42
  - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
  - lr_scheduler_warmup_steps: 5
45
- - num_epochs: 1
46
 
47
  ### Training results
48
 
 
1
  ---
2
  library_name: peft
3
  license: llama3.2
4
+ base_model: meta-llama/Llama-3.2-1B
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
+ - name: llama-3.2-1B-airoboros-merged-qlora
9
  results: []
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
+ # llama-3.2-1B-airoboros-merged-qlora
16
 
17
+ This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on the None dataset.
18
 
19
  ## Model description
20
 
 
42
  - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
  - lr_scheduler_warmup_steps: 5
45
+ - training_steps: 2000
46
 
47
  ### Training results
48