eval
Browse files
README.md
CHANGED
@@ -108,6 +108,16 @@ Epoch 1 | iter 512 step 8 | loss train: 11.973, val: n/a | iter time: 403.80 ms
|
|
108 |
Epoch 1 | iter 576 step 9 | loss train: 11.972, val: n/a | iter time: 403.23 ms (step) remaining time: 6 days, 15:21:59
|
109 |
Epoch 1 | iter 640 step 10 | loss train: 11.967, val: n/a | iter time: 403.38 ms (step) remaining time: 6 days, 13:43:53
|
110 |
# ...
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
111 |
```
|
112 |
|
113 |
Backup `wandb`:
|
|
|
108 |
Epoch 1 | iter 576 step 9 | loss train: 11.972, val: n/a | iter time: 403.23 ms (step) remaining time: 6 days, 15:21:59
|
109 |
Epoch 1 | iter 640 step 10 | loss train: 11.967, val: n/a | iter time: 403.38 ms (step) remaining time: 6 days, 13:43:53
|
110 |
# ...
|
111 |
+
Epoch 2 | iter 1364224 step 21316 | loss train: 2.805, val: 2.809 | iter time: 404.72 ms (step) remaining time: 0:00:06
|
112 |
+
Validating ...
|
113 |
+
Final evaluation | val loss: 2.809 | val ppl: 16.592
|
114 |
+
Saving checkpoint to '../out/pretrain-core-0/final/lit_model.pth'
|
115 |
+
----------------------------------------
|
116 |
+
| Performance
|
117 |
+
| - Total tokens : 11,186,768,000
|
118 |
+
| - Training Time : 53900.17 s
|
119 |
+
| - Tok/sec : 34385052.80 tok/s
|
120 |
+
| ----------------------------------------
|
121 |
```
|
122 |
|
123 |
Backup `wandb`:
|