mtasic85 commited on
Commit
b0c8cf7
·
1 Parent(s): 76e1649
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -108,6 +108,16 @@ Epoch 1 | iter 512 step 8 | loss train: 11.973, val: n/a | iter time: 403.80 ms
108
  Epoch 1 | iter 576 step 9 | loss train: 11.972, val: n/a | iter time: 403.23 ms (step) remaining time: 6 days, 15:21:59
109
  Epoch 1 | iter 640 step 10 | loss train: 11.967, val: n/a | iter time: 403.38 ms (step) remaining time: 6 days, 13:43:53
110
  # ...
 
 
 
 
 
 
 
 
 
 
111
  ```
112
 
113
  Backup `wandb`:
 
108
  Epoch 1 | iter 576 step 9 | loss train: 11.972, val: n/a | iter time: 403.23 ms (step) remaining time: 6 days, 15:21:59
109
  Epoch 1 | iter 640 step 10 | loss train: 11.967, val: n/a | iter time: 403.38 ms (step) remaining time: 6 days, 13:43:53
110
  # ...
111
+ Epoch 2 | iter 1364224 step 21316 | loss train: 2.805, val: 2.809 | iter time: 404.72 ms (step) remaining time: 0:00:06
112
+ Validating ...
113
+ Final evaluation | val loss: 2.809 | val ppl: 16.592
114
+ Saving checkpoint to '../out/pretrain-core-0/final/lit_model.pth'
115
+ ----------------------------------------
116
+ | Performance
117
+ | - Total tokens : 11,186,768,000
118
+ | - Training Time : 53900.17 s
119
+ | - Tok/sec : 34385052.80 tok/s
120
+ | ----------------------------------------
121
  ```
122
 
123
  Backup `wandb`: