alexue4's picture
End of training
4148ae4 verified
|
raw
history blame
2.67 kB
metadata
license: mit
base_model: alexue4/text-normalization-ru-new
tags:
  - generated_from_trainer
model-index:
  - name: text-normalization-ru-new
    results: []

text-normalization-ru-new

This model is a fine-tuned version of alexue4/text-normalization-ru-new on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0008
  • Mean Distance: 0
  • Max Distance: 3

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 150
  • eval_batch_size: 150
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 15

Training results

Training Loss Epoch Step Validation Loss Mean Distance Max Distance
0.0004 1.0 3059 0.0022 0 12
0.0003 2.0 6118 0.0011 0 5
0.0003 3.0 9177 0.0010 0 6
0.0003 4.0 12236 0.0012 0 3
0.0003 5.0 15295 0.0008 0 3
0.0002 6.0 18354 0.0009 0 3
0.0002 7.0 21413 0.0008 0 3
0.0002 8.0 24472 0.0008 0 3
0.0002 9.0 27531 0.0007 0 3
0.0002 10.0 30590 0.0008 0 3
0.0002 11.0 33649 0.0008 0 3
0.0002 12.0 36708 0.0008 0 3
0.0002 13.0 39767 0.0008 0 3
0.0002 14.0 42826 0.0008 0 3
0.0002 15.0 45885 0.0008 0 3

Framework versions

  • Transformers 4.37.2
  • Pytorch 2.1.0+cu121
  • Datasets 2.16.1
  • Tokenizers 0.15.1