Finetune of amd135m using Rchatml format form reasoning-base-20k dataset from KingNish. Trying to see if i can get this small model to reason. Improvements, suggestions welcome. Will upload training script and dataset script soon (yell at me if I dont)

Downloads last month: 19

Safetensors

Model size

134M params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for skdrx/amd135m_reasoning_finetune

Base model

amd/AMD-Llama-135m

Quantized

(16)

this model

skdrx
/

amd135m_reasoning_finetune

Model tree for skdrx/amd135m_reasoning_finetune

Datasets used to train skdrx/amd135m_reasoning_finetune