johannawawi commited on
Commit
19aebfe
·
verified ·
1 Parent(s): 90432ce

johannawawi/v5_balanced_dataset_fine-tuning-java-indo-sentiment-analysist-3-class

Browse files
Files changed (4) hide show
  1. README.md +16 -12
  2. config.json +1 -1
  3. model.safetensors +1 -1
  4. training_args.bin +1 -1
README.md CHANGED
@@ -18,14 +18,14 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [w11wo/indonesian-roberta-base-sentiment-classifier](https://huggingface.co/w11wo/indonesian-roberta-base-sentiment-classifier) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.6295
22
- - Accuracy: 0.8273
23
- - F1 Macro: 0.8264
24
- - F1 Weighted: 0.8266
25
- - Precision Macro: 0.8267
26
- - Recall Macro: 0.8270
27
- - Precision Weighted: 0.8268
28
- - Recall Weighted: 0.8273
29
 
30
  ## Model description
31
 
@@ -44,20 +44,24 @@ More information needed
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
- - learning_rate: 8.772683881250156e-06
48
- - train_batch_size: 16
49
  - eval_batch_size: 8
50
  - seed: 42
51
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: cosine
53
  - lr_scheduler_warmup_ratio: 0.1
54
- - num_epochs: 7
55
 
56
  ### Training results
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | F1 Weighted | Precision Macro | Recall Macro | Precision Weighted | Recall Weighted |
59
  |:-------------:|:------:|:----:|:---------------:|:--------:|:--------:|:-----------:|:---------------:|:------------:|:------------------:|:---------------:|
60
- | 0.2188 | 3.6232 | 500 | 0.6295 | 0.8273 | 0.8264 | 0.8266 | 0.8267 | 0.8270 | 0.8268 | 0.8273 |
 
 
 
 
61
 
62
 
63
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [w11wo/indonesian-roberta-base-sentiment-classifier](https://huggingface.co/w11wo/indonesian-roberta-base-sentiment-classifier) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.0692
22
+ - Accuracy: 0.8436
23
+ - F1 Macro: 0.8431
24
+ - F1 Weighted: 0.8433
25
+ - Precision Macro: 0.8432
26
+ - Recall Macro: 0.8434
27
+ - Precision Weighted: 0.8433
28
+ - Recall Weighted: 0.8436
29
 
30
  ## Model description
31
 
 
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
+ - learning_rate: 8.879626978799419e-06
48
+ - train_batch_size: 8
49
  - eval_batch_size: 8
50
  - seed: 42
51
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: cosine
53
  - lr_scheduler_warmup_ratio: 0.1
54
+ - num_epochs: 10
55
 
56
  ### Training results
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | F1 Weighted | Precision Macro | Recall Macro | Precision Weighted | Recall Weighted |
59
  |:-------------:|:------:|:----:|:---------------:|:--------:|:--------:|:-----------:|:---------------:|:------------:|:------------------:|:---------------:|
60
+ | 0.035 | 1.8182 | 500 | 1.0247 | 0.8327 | 0.8321 | 0.8323 | 0.8325 | 0.8325 | 0.8325 | 0.8327 |
61
+ | 0.0829 | 3.6364 | 1000 | 1.0134 | 0.8273 | 0.8262 | 0.8263 | 0.8275 | 0.8270 | 0.8275 | 0.8273 |
62
+ | 0.1858 | 5.4545 | 1500 | 1.0692 | 0.8436 | 0.8431 | 0.8433 | 0.8432 | 0.8434 | 0.8433 | 0.8436 |
63
+ | 0.2844 | 7.2727 | 2000 | 0.9823 | 0.8255 | 0.8250 | 0.8251 | 0.8253 | 0.8253 | 0.8254 | 0.8255 |
64
+ | 0.3299 | 9.0909 | 2500 | 0.9626 | 0.8255 | 0.8251 | 0.8252 | 0.8253 | 0.8253 | 0.8254 | 0.8255 |
65
 
66
 
67
  ### Framework versions
config.json CHANGED
@@ -4,7 +4,7 @@
4
  ],
5
  "attention_probs_dropout_prob": 0.1,
6
  "bos_token_id": 0,
7
- "classifier_dropout": 0.2,
8
  "eos_token_id": 2,
9
  "gradient_checkpointing": false,
10
  "hidden_act": "gelu",
 
4
  ],
5
  "attention_probs_dropout_prob": 0.1,
6
  "bos_token_id": 0,
7
+ "classifier_dropout": 0.3,
8
  "eos_token_id": 2,
9
  "gradient_checkpointing": false,
10
  "hidden_act": "gelu",
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cfe01fe7a6497d193659b143f7ca2375c4526cfc482707467bf2ac909cdf8dd0
3
  size 498615900
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8200e6c7302793057b606fc7b96f8cd2a42cfd3a795aeb1bd5b9acaa6d5ad655
3
  size 498615900
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3beb66c94dd28553d61565b003034c7450f6cc32e2e5d50c6d5a024e713572b9
3
  size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:639b0a4216fe618d1906bafbc0798346ad392a872b5061f36c1cdccc20506ff7
3
  size 5304