colqwenstella_ufo

This model is a fine-tuned version of Metric-AI/ColQwenStella-base-2b on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 4
eval_batch_size: 8
seed: 42
optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 100
num_epochs: 1

Training Loss	Epoch	Step	Validation Loss
0.0656	0.1636	80	0.0603
0.0135	0.3272	160	0.0482
0.013	0.4908	240	0.0417
0.0199	0.6544	320	0.0326
0.0322	0.8180	400	0.0437
0.0205	0.9816	480	0.0417