Guaranteed Guessing
Collection
9 items
โข
Updated
This model is a fine-tuned version of Qwen/Qwen2.5-Coder-0.5B-Instruct on the anghabench_1M_1, the anghabench_1M_2 and the stack datasets. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
0.0064 | 0.3912 | 61000 | 0.0041 |
0.0029 | 0.7825 | 122000 | 0.0032 |
0.0023 | 1.1737 | 183000 | 0.0024 |
0.0018 | 1.5649 | 244000 | 0.0021 |
0.0011 | 1.9562 | 305000 | 0.0020 |
Base model
Qwen/Qwen2.5-0.5B