burman-ai commited on
Commit
5fea6ef
·
verified ·
1 Parent(s): 61bd093

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -3
README.md CHANGED
@@ -5,14 +5,21 @@ tags:
5
  model-index:
6
  - name: gpt2-imdb
7
  results: []
 
 
 
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
- # gpt2-imdb
14
 
15
- This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
16
 
17
  ## Model description
18
 
@@ -24,7 +31,10 @@ More information needed
24
 
25
  ## Training and evaluation data
26
 
27
- More information needed
 
 
 
28
 
29
  ## Training procedure
30
 
@@ -49,3 +59,5 @@ The following hyperparameters were used during training:
49
  - Pytorch 2.5.1+cu124
50
  - Datasets 3.5.0
51
  - Tokenizers 0.21.0
 
 
 
5
  model-index:
6
  - name: gpt2-imdb
7
  results: []
8
+ license: apache-2.0
9
+ datasets:
10
+ - iohadrubin/wikitext-103-raw-v1
11
+ language:
12
+ - en
13
+ base_model:
14
+ - openai-community/gpt2
15
  ---
16
 
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
  should probably proofread and complete it, then remove this comment. -->
19
 
20
+ # gpt2-wikitext
21
 
22
+ This model is a fine-tuned version of [](https://huggingface.co/) on a wikitext dataset.
23
 
24
  ## Model description
25
 
 
31
 
32
  ## Training and evaluation data
33
 
34
+ loss grad_norm learning_rate epoch step
35
+ 0 9.0711 1.319650 0.000068 0.319489 100
36
+ 1 7.6569 1.031958 0.000036 0.638978 200
37
+ 2 7.2724 0.889421 0.000004 0.958466 300
38
 
39
  ## Training procedure
40
 
 
59
  - Pytorch 2.5.1+cu124
60
  - Datasets 3.5.0
61
  - Tokenizers 0.21.0
62
+
63
+ ## Developed by: Min Thein