pr4nav101
/

Tiny-Thought-Model-Distilled-Llama-3.2-1B-Instruct-bnb-4bit

Text Generation

Model card Files Files and versions Community

Tiny-Thought-Model-Distilled-Llama-3.2-1B-Instruct-bnb-4bit / README.md

pr4nav101's picture

Update README.md

39841a7 verified 4 months ago

|

history blame contribute delete

346 Bytes

	---
	license: mit
	datasets:
	- pr4nav101/COT_TTM_Finetuning
	language:
	- en
	base_model:
	- unsloth/Llama-3.2-1B-Instruct-bnb-4bit
	- pr4nav101/llama-3-8b-Instruct-bnb-4bit-Tiny-Thought-Model-Large
	pipeline_tag: text-generation
	library_name: peft
	tags:
	- COT
	- TTM
	- LLM
	method:
	- Knowledge Distillation with Reverse KL Divergence + PEFT Finetuning
	---