KS-SOLAR-10.7B-v0.1 / README.md
knlp's picture
Update README.md
6db04db verified
---
license: cc-by-4.0
datasets:
- kyujinpy/Open-platypus-Commercial
language:
- en
---
**Model Details**
Model Developers: *SeungJin Lee (knlpscience)*
Base Model: *upstage/SOLAR-10.7B-v1.0*
**Notice**
***hyper params I***
-batch_size : 16
-num_epochs : 1
-micro_batch : 1
-gradient_accumulation_steps : batch_size // micro_batch
***hyper params II***
-cutoff_len : 4096
-lr_scheduler : 'cosine'
-warmup_ratio : 0.06
-learning_rate : 4e-4
-optimizer : 'adamw_torch'
-weight_decay : 0.01
-max_grad_norm : 1.0
***LoRA config***
-lora_r : 64
-lora_alpha : 16
-lora_dropout : 0.05
-lora_target_modules : ["gate_proj", "down_proj", "up_proj"]