sandernotenbaert
/

okai-musiclang-content-t5

text2text-generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

sandernotenbaert commited on Jul 30

Commit

fd14110

·

verified ·

1 Parent(s): 05372a9

Model save

Files changed (3) hide show

README.md +6 -9
final_model/model.safetensors +1 -1
final_model/training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
 library_name: transformers
 tags:
 - generated_from_trainer
 model-index:
@@ -12,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # okai-musiclang-content-t5
-This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.5639
 ## Model description
@@ -40,8 +41,8 @@ The following hyperparameters were used during training:
 - gradient_accumulation_steps: 4
 - total_train_batch_size: 96
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
-- lr_scheduler_type: cosine
-- lr_scheduler_warmup_steps: 1000
 - num_epochs: 4
 - mixed_precision_training: Native AMP
@@ -49,11 +50,7 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 2.7352        | 0.6954 | 500  | 2.6036          |
-| 2.2123        | 1.3908 | 1000 | 2.0960          |
-| 1.8487        | 2.0862 | 1500 | 1.7925          |
-| 1.713         | 2.7816 | 2000 | 1.6366          |
-| 1.6597        | 3.4771 | 2500 | 1.5639          |
 ### Framework versions

 ---
 library_name: transformers
+base_model: sandernotenbaert/okai-musiclang-content-t5
 tags:
 - generated_from_trainer
 model-index:
 # okai-musiclang-content-t5
+This model is a fine-tuned version of [sandernotenbaert/okai-musiclang-content-t5](https://huggingface.co/sandernotenbaert/okai-musiclang-content-t5) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5040
 ## Model description
 - gradient_accumulation_steps: 4
 - total_train_batch_size: 96
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: constant_with_warmup
+- lr_scheduler_warmup_steps: 200
 - num_epochs: 4
 - mixed_precision_training: Native AMP
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.5731        | 2.4764 | 500  | 1.5040          |
 ### Framework versions

final_model/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:68c31c91b324264716d6a08bbe92198046f468c06980655b5f3da0003e65601c
 size 362303176

 version https://git-lfs.github.com/spec/v1
+oid sha256:5fea8921cf12d4a5d9e2fddeda14297240eb814e6b2f5cb6284bdf0760cf8536
 size 362303176

final_model/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4072d75f3bcfbefc199402079a3fa943a2dce1fccc31ce7f3a881b0731caf594
 size 5624

 version https://git-lfs.github.com/spec/v1
+oid sha256:258e93bb7afb81bdb858196bb5d9459c58151567faf53e997e6292b1461756ae
 size 5624