vannishh's picture
Upload QLoRA fine-tuned model
7f91d8c verified
metadata
license: apache-2.0
base_model: meta-llama/Llama-3.2-1B
tags:
  - pubmedqa
  - llama3
  - qlora
  - sequence-classification
  - 4bit
  - peft

LLaMA3-2.1B QLoRA fine-tuned on PubMedQA

This model is a 4-bit quantized, QLoRA fine-tuned version of meta-llama/Llama-3.2-1B, trained on the PubMedQA dataset for medical question classification (yes, no, maybe). It was optimized using PEFT with LoRA adapters, and is designed for efficient inference on resource-constrained hardware.

Training Details

  • Base model: meta-llama/Llama-3.2-1B
  • Dataset: pubmed_qa/pqa_labeled
  • Method: QLoRA (4-bit NF4)
  • LoRA target modules: q_proj, v_proj
  • Epochs: 5
  • Batch size: 4