nlp-waseda
/

gpt2-xl-japanese

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

schnell commited on Dec 16, 2022

Commit

f5cd347

•

1 Parent(s): c6083a1

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -12,13 +12,13 @@ widget:
 # nlp-waseda/gpt2-xl-japanese
 This model is Japanese GPT-2 pretrained on Japanese Wikipedia and CC-100.
-The parameters of the model are based on [Radford+ 2019](https://paperswithcode.com/paper/language-models-are-unsupervised-multitask).
 ## Intended uses & limitations
 You can use the raw model for text generation or fine-tune it to a downstream task.
-Note that the texts should be segmented into words using Juman++ in advance.
 ### How to use

 # nlp-waseda/gpt2-xl-japanese
 This model is Japanese GPT-2 pretrained on Japanese Wikipedia and CC-100.
+The model architecture of the model are based on [Radford+ 2019](https://paperswithcode.com/paper/language-models-are-unsupervised-multitask).
 ## Intended uses & limitations
 You can use the raw model for text generation or fine-tune it to a downstream task.
+Note that the texts should be segmented into words using [Juman++](https://github.com/ku-nlp/jumanpp) in advance.
 ### How to use