kxdw2580/Qwen2.5-0.5B-Catgirl-test0426

This model is a test model, designed for phased lightweight testing of the dataset.

The test dataset has fixed this issue:

  • Model's outputs "~" causing rendering errors.

After testing, the objectives of the dataset fixes have been achieved.

Other

As a 0.5b model, its performance is very poor, especially in the missing English part of the dataset. We do not recommend using this model unless there is a specific need.

We are working hard to improve the training results on smaller models, but it is obviously unlikely for the 0.5b model.

Specific training results can be seen at swanlab

Additionally, I have observed that with models of this size, a smaller training loss does not always indicate better model performance, and sometimes even leads to a decline in performance. This swanlab record is the result of further training for this model. After testing, I found that its performance is even worse than the original model. This model has not been publicly released and has been deleted.

I would be very happy to communicate if you wish to!

Downloads last month
9
Safetensors
Model size
494M params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kxdw2580/Qwen2.5-0.5B-Catgirl-test0426

Base model

Qwen/Qwen2.5-0.5B
Finetuned
(333)
this model
Quantizations
1 model

Dataset used to train kxdw2580/Qwen2.5-0.5B-Catgirl-test0426