pr4nav101's picture
Update README.md
39841a7 verified
metadata
license: mit
datasets:
  - pr4nav101/COT_TTM_Finetuning
language:
  - en
base_model:
  - unsloth/Llama-3.2-1B-Instruct-bnb-4bit
  - pr4nav101/llama-3-8b-Instruct-bnb-4bit-Tiny-Thought-Model-Large
pipeline_tag: text-generation
library_name: peft
tags:
  - COT
  - TTM
  - LLM
method:
  - Knowledge Distillation with Reverse KL Divergence + PEFT Finetuning