Update README.md
Browse files
README.md
CHANGED
@@ -12,13 +12,13 @@ widget:
|
|
12 |
# nlp-waseda/gpt2-xl-japanese
|
13 |
|
14 |
This model is Japanese GPT-2 pretrained on Japanese Wikipedia and CC-100.
|
15 |
-
The
|
16 |
|
17 |
## Intended uses & limitations
|
18 |
|
19 |
You can use the raw model for text generation or fine-tune it to a downstream task.
|
20 |
|
21 |
-
Note that the texts should be segmented into words using Juman++ in advance.
|
22 |
|
23 |
### How to use
|
24 |
|
|
|
12 |
# nlp-waseda/gpt2-xl-japanese
|
13 |
|
14 |
This model is Japanese GPT-2 pretrained on Japanese Wikipedia and CC-100.
|
15 |
+
The model architecture of the model are based on [Radford+ 2019](https://paperswithcode.com/paper/language-models-are-unsupervised-multitask).
|
16 |
|
17 |
## Intended uses & limitations
|
18 |
|
19 |
You can use the raw model for text generation or fine-tune it to a downstream task.
|
20 |
|
21 |
+
Note that the texts should be segmented into words using [Juman++](https://github.com/ku-nlp/jumanpp) in advance.
|
22 |
|
23 |
### How to use
|
24 |
|