przybytniowska
/

roberta_base_QA_SQUAD_adamw_torch

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

przybytniowska commited on May 23

Commit

1d5c979

•

1 Parent(s): e4fb4ba

Training complete

Files changed (3) hide show

README.md +54 -0
test_metrics.json +3 -0
train_losses.csv +127 -0

README.md ADDED Viewed

	@@ -0,0 +1,54 @@

+---
+license: mit
+base_model: FacebookAI/roberta-base
+tags:
+- generated_from_trainer
+datasets:
+- arrow
+model-index:
+- name: roberta_base_QA_SQUAD_adamw_torch
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# roberta_base_QA_SQUAD_adamw_torch
+This model is a fine-tuned version of [FacebookAI/roberta-base](https://huggingface.co/FacebookAI/roberta-base) on the arrow dataset.
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 32
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 5
+### Training results
+### Framework versions
+- Transformers 4.34.1
+- Pytorch 2.3.0+cu118
+- Datasets 2.19.0
+- Tokenizers 0.14.1

test_metrics.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+    "test_accuracy": 0.952
+}

train_losses.csv ADDED Viewed

	@@ -0,0 +1,127 @@

+loss,epoch
+0.2402,0.04
+0.1457,0.08
+0.1223,0.12
+0.103,0.16
+0.0809,0.2
+0.0672,0.24
+0.0642,0.28
+0.049,0.32
+0.0394,0.36
+0.0378,0.4
+0.0315,0.44
+0.0283,0.48
+0.0203,0.52
+0.0187,0.56
+0.023,0.6
+0.0219,0.64
+0.0184,0.68
+0.0206,0.72
+0.016,0.76
+0.0102,0.8
+0.014,0.84
+0.0113,0.88
+0.0119,0.92
+0.0118,0.96
+0.0125,1.0
+0.0143,1.04
+0.0128,1.08
+0.0132,1.12
+0.011,1.16
+0.0094,1.2
+0.0086,1.24
+0.0104,1.28
+0.0082,1.32
+0.0063,1.36
+0.0079,1.4
+0.006,1.44
+0.0065,1.48
+0.011,1.52
+0.0073,1.56
+0.0053,1.6
+0.0058,1.64
+0.006,1.68
+0.0053,1.72
+0.0108,1.76
+0.0092,1.8
+0.0044,1.84
+0.0045,1.88
+0.007,1.92
+0.0054,1.96
+0.0037,2.0
+0.0061,2.04
+0.0036,2.08
+0.0036,2.12
+0.004,2.16
+0.006,2.2
+0.0044,2.24
+0.0046,2.28
+0.0014,2.32
+0.0075,2.36
+0.0036,2.4
+0.0033,2.44
+0.003,2.48
+0.0034,2.52
+0.0024,2.56
+0.0023,2.6
+0.0014,2.64
+0.0066,2.68
+0.0034,2.72
+0.0031,2.76
+0.0012,2.8
+0.0029,2.84
+0.0016,2.88
+0.0027,2.92
+0.0012,2.96
+0.002,3.0
+0.0024,3.04
+0.0012,3.08
+0.0006,3.12
+0.0032,3.16
+0.0022,3.2
+0.0008,3.24
+0.0021,3.28
+0.0004,3.32
+0.0024,3.36
+0.0001,3.4
+0.0041,3.44
+0.0007,3.48
+0.0003,3.52
+0.0008,3.56
+0.0002,3.6
+0.0031,3.64
+0.0006,3.68
+0.0013,3.72
+0.0005,3.76
+0.0,3.8
+0.002,3.84
+0.0011,3.88
+0.0,3.92
+0.0,3.96
+0.0008,4.0
+0.0,4.04
+0.0,4.08
+0.0,4.12
+0.0,4.16
+0.0005,4.2
+0.0,4.24
+0.0008,4.28
+0.002,4.32
+0.0004,4.36
+0.0,4.4
+0.0,4.44
+0.0003,4.48
+0.0,4.52
+0.0,4.56
+0.0004,4.6
+0.0,4.64
+0.0,4.68
+0.0004,4.72
+0.0,4.76
+0.0,4.8
+0.0,4.84
+0.0006,4.88
+0.0006,4.92
+0.0,4.96
+0.0,5.0
+0.012337412198364735,5.0