fireballoon
commited on
Commit
•
23bfc11
1
Parent(s):
66f945e
Update README.md
Browse files
README.md
CHANGED
@@ -24,6 +24,9 @@ baichuan-vicuna-chinese-7b is a chat model supervised finetuned on vicuna shareg
|
|
24 |
- Finetuning data: [ShareGPT](https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split.json), [ShareGPT-ZH](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT/tree/main/Chinese-instruction-collection), [COT & COT-ZH](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT/tree/main/Chain-of-Thought), [Leetcode](https://www.kaggle.com/datasets/erichartford/leetcode-solutions), [dummy](https://github.com/lm-sys/FastChat)
|
25 |
- Training code: based on [FastChat](https://github.com/lm-sys/FastChat)
|
26 |
|
|
|
|
|
|
|
27 |
# Training config
|
28 |
```
|
29 |
{batch_size: 256, epoch: 3, learning_rate: 2e-5, context_length: 4096, deepspeed_zero: 3, mixed_precision: bf16, gradient_clipping: 1.0}
|
|
|
24 |
- Finetuning data: [ShareGPT](https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split.json), [ShareGPT-ZH](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT/tree/main/Chinese-instruction-collection), [COT & COT-ZH](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT/tree/main/Chain-of-Thought), [Leetcode](https://www.kaggle.com/datasets/erichartford/leetcode-solutions), [dummy](https://github.com/lm-sys/FastChat)
|
25 |
- Training code: based on [FastChat](https://github.com/lm-sys/FastChat)
|
26 |
|
27 |
+
**[NEW]** 4bit-128g GPTQ量化版本:[baichuan-vicuna-chinese-7b-gptq](https://huggingface.co/fireballoon/baichuan-vicuna-chinese-7b-gptq)
|
28 |
+
|
29 |
+
|
30 |
# Training config
|
31 |
```
|
32 |
{batch_size: 256, epoch: 3, learning_rate: 2e-5, context_length: 4096, deepspeed_zero: 3, mixed_precision: bf16, gradient_clipping: 1.0}
|