Trained the eos_token into the lm_head.

This should allow qlora finetunes with 24 or even 16 GB of vram.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for gghfez/Qwen2.5-14B-Base-lm_head-LoRA

Base model

Qwen/Qwen2.5-14B
Finetuned
(69)
this model