mlxha
/

Qwen3-4B-grpo-medmcqa

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Qwen3-4B-grpo-medmcqa / .wandb_run_id

mlxha's picture

Training in progress, step 300

9e3a888 verified 4 months ago

history blame contribute delete

37 Bytes

	abe1faca-0f47-471a-b13c-22b48a63c0a6