uer commited on
Commit
51f54f9
1 Parent(s): 1f88281

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -122,7 +122,7 @@ Before stage2, we extract fp32 consolidated weights from a zero 2 and 3 DeepSpee
122
 
123
  ```
124
  python3 models/cluecorpussmall_gpt2_xlarge_seq128_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq128_model/ \
125
- models/cluecorpussmall_gpt2_xlarge_seq128_model.bin
126
  ```
127
 
128
  Stage2:
@@ -150,7 +150,7 @@ Then, we extract fp32 consolidated weights from a zero 2 and 3 DeepSpeed checkpo
150
 
151
  ```
152
  python3 models/cluecorpussmall_gpt2_xlarge_seq1024_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq1024_model/ \
153
- models/cluecorpussmall_gpt2_xlarge_seq1024_model.bin
154
  ```
155
 
156
  Finally, we convert the pre-trained model into Huggingface's format:
 
122
 
123
  ```
124
  python3 models/cluecorpussmall_gpt2_xlarge_seq128_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq128_model/ \
125
+ models/cluecorpussmall_gpt2_xlarge_seq128_model.bin
126
  ```
127
 
128
  Stage2:
 
150
 
151
  ```
152
  python3 models/cluecorpussmall_gpt2_xlarge_seq1024_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq1024_model/ \
153
+ models/cluecorpussmall_gpt2_xlarge_seq1024_model.bin
154
  ```
155
 
156
  Finally, we convert the pre-trained model into Huggingface's format: