gokulsrinivasagan
/

bert_base_train_book_ent_15p_s_init_wnli

+---
+library_name: transformers
+license: apache-2.0
+base_model: gokulsrinivasagan/bert_base_train_book_ent_15p_s_init
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+model-index:
+- name: bert_base_train_book_ent_15p_s_init_wnli
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# bert_base_train_book_ent_15p_s_init_wnli
+This model is a fine-tuned version of [gokulsrinivasagan/bert_base_train_book_ent_15p_s_init](https://huggingface.co/gokulsrinivasagan/bert_base_train_book_ent_15p_s_init) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.6859
+- Accuracy: 0.5634
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 256
+- eval_batch_size: 256
+- seed: 10
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- num_epochs: 50
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.7049        | 1.0   | 3    | 0.6941          | 0.4366   |
+| 0.7016        | 2.0   | 6    | 0.6849          | 0.5634   |
+| 0.7019        | 3.0   | 9    | 0.6914          | 0.5634   |
+| 0.6938        | 4.0   | 12   | 0.7015          | 0.4366   |
+| 0.6971        | 5.0   | 15   | 0.6971          | 0.4366   |
+| 0.6948        | 6.0   | 18   | 0.6892          | 0.5634   |
+| 0.7045        | 7.0   | 21   | 0.6859          | 0.5634   |
+### Framework versions
+- Transformers 4.51.2
+- Pytorch 2.6.0+cu126
+- Datasets 3.5.0
+- Tokenizers 0.21.1

logs/events.out.tfevents.1745519475.ki-g0008.3436350.32 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:19288214b424cc9bafe8fa30a645280536d6742cfab8981a4d97e49940dd57bb
-size 5663

 version https://git-lfs.github.com/spec/v1
+oid sha256:f59ad1d0b5d1c64bf2330291984ef9803bede208a8f22771b0ecb718b1092abc
+size 9155

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dc1917c2f138ca1c3b0ebc5c0dedab4b565f748269d0c20cd1ca3dc5a24f2c05
 size 437958648

 version https://git-lfs.github.com/spec/v1
+oid sha256:e0bff6fa90b93c612275119a137cc68f84b5d5859d56f30849b09fc7a80aec03
 size 437958648