burman-ai
/

gpt2wikitext

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

burman-ai commited on Apr 20

Commit

5fea6ef

·

verified ·

1 Parent(s): 61bd093

Update README.md

Files changed (1) hide show

README.md +15 -3

README.md CHANGED Viewed

@@ -5,14 +5,21 @@ tags:
 model-index:
 - name: gpt2-imdb
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# gpt2-imdb
-This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 ## Model description
@@ -24,7 +31,10 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
@@ -49,3 +59,5 @@ The following hyperparameters were used during training:
 - Pytorch 2.5.1+cu124
 - Datasets 3.5.0
 - Tokenizers 0.21.0

 model-index:
 - name: gpt2-imdb
   results: []
+license: apache-2.0
+datasets:
+- iohadrubin/wikitext-103-raw-v1
+language:
+- en
+base_model:
+- openai-community/gpt2
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# gpt2-wikitext
+This model is a fine-tuned version of [](https://huggingface.co/) on a wikitext dataset.
 ## Model description
 ## Training and evaluation data
+loss	grad_norm	learning_rate	epoch	step
+0	9.0711	1.319650	0.000068	0.319489	100
+1	7.6569	1.031958	0.000036	0.638978	200
+2	7.2724	0.889421	0.000004	0.958466	300
 ## Training procedure
 - Pytorch 2.5.1+cu124
 - Datasets 3.5.0
 - Tokenizers 0.21.0
+## Developed by: Min Thein