mdeniz1 commited on
Commit
199095d
·
1 Parent(s): 5d3153e

Training in progress epoch 0

Browse files
Files changed (3) hide show
  1. README.md +1 -1
  2. config.json +0 -0
  3. tf_model.h5 +2 -2
README.md CHANGED
@@ -36,7 +36,7 @@ More information needed
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
- - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 168, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
40
  - training_precision: mixed_float16
41
 
42
  ### Training results
 
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
+ - optimizer: {'inner_optimizer': {'module': 'transformers.optimization_tf', 'class_name': 'AdamWeightDecay', 'config': {'name': 'AdamWeightDecay', 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 168, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'decay': 0.0, 'beta_1': 0.8999999761581421, 'beta_2': 0.9990000128746033, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}, 'registered_name': 'AdamWeightDecay'}, 'dynamic': True, 'initial_scale': 32768.0, 'dynamic_growth_steps': 2000}
40
  - training_precision: mixed_float16
41
 
42
  ### Training results
config.json CHANGED
The diff for this file is too large to render. See raw diff
 
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:743a1d4706af557d0b976b920584857a25f5236701f1ee387071d0d4350535cb
3
- size 444308396
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:15bfe1fe7ddd7e65ccce953247f96afbc18bc794b62f6e23de35dc1661e8a87e
3
+ size 443419432