This model is a fine-tuned version of Qwen3-0.6B-Base, fine-tuned on a sub-sample of 6k pairs from MetaMathQA dataset using SFT and the trl library. It is the first step of LaQwenTa, a light-weight STEM QA answering model for educational purposes.

Downloads last month: 40

Safetensors

Model size

596M params

Tensor type

F32

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for jeanprbt/laqwenta_sft_model

Base model

Qwen/Qwen3-0.6B-Base

Finetuned

(282)

this model

jeanprbt
/

laqwenta_sft_model

Model tree for jeanprbt/laqwenta_sft_model

Dataset used to train jeanprbt/laqwenta_sft_model