hananeChab
/

darija_englishV2.1

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

darija_englishV2.1

This model is a fine-tuned version of Helsinki-NLP/opus-mt-ar-en on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 1.3635
Bleu: 32.8666
Gen Len: 11.7901

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 10
eval_batch_size: 10
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 7
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Bleu	Gen Len
1.641	1.0	1594	1.4631	29.3791	11.6872
1.2871	2.0	3188	1.3761	30.8465	12.0999
1.1067	3.0	4782	1.3503	32.7058	11.6206
0.9456	4.0	6376	1.3462	32.5581	11.8712
0.8611	5.0	7970	1.3497	33.0669	11.7258
0.7806	6.0	9564	1.3601	33.2093	11.7203
0.7293	7.0	11158	1.3635	32.8666	11.7901

Framework versions

Transformers 4.40.1
Pytorch 2.3.0+cu121
Datasets 2.19.0
Tokenizers 0.19.1

Downloads last month: 46

Safetensors

Model size

76.4M params

Tensor type

F32

·

Inference Examples

Text2Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for hananeChab/darija_englishV2.1

Base model

Helsinki-NLP/opus-mt-ar-en

Finetuned

(20)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard