Update README.md
Browse files
README.md
CHANGED
@@ -122,7 +122,7 @@ Before stage2, we extract fp32 consolidated weights from a zero 2 and 3 DeepSpee
|
|
122 |
|
123 |
```
|
124 |
python3 models/cluecorpussmall_gpt2_xlarge_seq128_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq128_model/ \
|
125 |
-
|
126 |
```
|
127 |
|
128 |
Stage2:
|
@@ -150,7 +150,7 @@ Then, we extract fp32 consolidated weights from a zero 2 and 3 DeepSpeed checkpo
|
|
150 |
|
151 |
```
|
152 |
python3 models/cluecorpussmall_gpt2_xlarge_seq1024_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq1024_model/ \
|
153 |
-
|
154 |
```
|
155 |
|
156 |
Finally, we convert the pre-trained model into Huggingface's format:
|
|
|
122 |
|
123 |
```
|
124 |
python3 models/cluecorpussmall_gpt2_xlarge_seq128_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq128_model/ \
|
125 |
+
models/cluecorpussmall_gpt2_xlarge_seq128_model.bin
|
126 |
```
|
127 |
|
128 |
Stage2:
|
|
|
150 |
|
151 |
```
|
152 |
python3 models/cluecorpussmall_gpt2_xlarge_seq1024_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq1024_model/ \
|
153 |
+
models/cluecorpussmall_gpt2_xlarge_seq1024_model.bin
|
154 |
```
|
155 |
|
156 |
Finally, we convert the pre-trained model into Huggingface's format:
|