Merge branch 'main' of hf.co:tangledgroup/tangled-alpha-0.10-core
Browse files
README.md
CHANGED
@@ -92,6 +92,21 @@ CUDA_VISIBLE_DEVICES=0 CUDA_LAUNCH_BLOCKING=0 PYTORCH_CUDA_ALLOC_CONF=expandable
|
|
92 |
```
|
93 |
|
94 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
95 |
# ...
|
96 |
```
|
97 |
|
|
|
92 |
```
|
93 |
|
94 |
```
|
95 |
+
Seed set to 23
|
96 |
+
Time to instantiate model: 0.21 seconds.
|
97 |
+
Total parameters: 402,703,104
|
98 |
+
Verifying settings ...
|
99 |
+
Measured TFLOPs: 42432.35
|
100 |
+
Epoch 1 | iter 64 step 1 | loss train: 11.984, val: n/a | iter time: 460.76 ms (step) remaining time: 12 days, 3:41:55
|
101 |
+
Epoch 1 | iter 128 step 2 | loss train: 11.979, val: n/a | iter time: 402.83 ms (step) remaining time: 9 days, 0:57:24
|
102 |
+
Epoch 1 | iter 192 step 3 | loss train: 11.983, val: n/a | iter time: 403.46 ms (step) remaining time: 8 days, 0:12:58
|
103 |
+
Epoch 1 | iter 256 step 4 | loss train: 11.983, val: n/a | iter time: 403.39 ms (step) remaining time: 7 days, 11:52:07
|
104 |
+
Epoch 1 | iter 320 step 5 | loss train: 11.979, val: n/a | iter time: 403.85 ms (step) remaining time: 7 days, 4:28:33
|
105 |
+
Epoch 1 | iter 384 step 6 | loss train: 11.978, val: n/a | iter time: 403.93 ms (step) remaining time: 6 days, 23:33:15
|
106 |
+
Epoch 1 | iter 448 step 7 | loss train: 11.978, val: n/a | iter time: 403.38 ms (step) remaining time: 6 days, 20:02:28
|
107 |
+
Epoch 1 | iter 512 step 8 | loss train: 11.973, val: n/a | iter time: 403.80 ms (step) remaining time: 6 days, 17:24:49
|
108 |
+
Epoch 1 | iter 576 step 9 | loss train: 11.972, val: n/a | iter time: 403.23 ms (step) remaining time: 6 days, 15:21:59
|
109 |
+
Epoch 1 | iter 640 step 10 | loss train: 11.967, val: n/a | iter time: 403.38 ms (step) remaining time: 6 days, 13:43:53
|
110 |
# ...
|
111 |
```
|
112 |
|