uer
/

gpt2-distil-chinese-cluecorpussmall

Text Generation

text-generation-inference

Model card Files Files and versions Community

uer commited on Sep 5, 2023

Commit

51f54f9

·

1 Parent(s): 1f88281

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -122,7 +122,7 @@ Before stage2, we extract fp32 consolidated weights from a zero 2 and 3 DeepSpee
 ```
 python3 models/cluecorpussmall_gpt2_xlarge_seq128_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq128_model/ \
-                                                                  models/cluecorpussmall_gpt2_xlarge_seq128_model.bin
 ```
 Stage2:
@@ -150,7 +150,7 @@ Then, we extract fp32 consolidated weights from a zero 2 and 3 DeepSpeed checkpo
 ```
 python3 models/cluecorpussmall_gpt2_xlarge_seq1024_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq1024_model/ \
-                                                                          models/cluecorpussmall_gpt2_xlarge_seq1024_model.bin
 ```
 Finally, we convert the pre-trained model into Huggingface's format:

 ```
 python3 models/cluecorpussmall_gpt2_xlarge_seq128_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq128_model/ \
+                                                                        models/cluecorpussmall_gpt2_xlarge_seq128_model.bin
 ```
 Stage2:
 ```
 python3 models/cluecorpussmall_gpt2_xlarge_seq1024_model/zero_to_fp32.py models/cluecorpussmall_gpt2_xlarge_seq1024_model/ \
+                                                                         models/cluecorpussmall_gpt2_xlarge_seq1024_model.bin
 ```
 Finally, we convert the pre-trained model into Huggingface's format: