deepkaria
/

deepseek-r1-1.5b-indian-culture

Text Generation

text-generation-inference

Model card Files Files and versions Community

deepkaria commited on Mar 18

Commit

b9d8559

·

verified ·

1 Parent(s): 47a9273

Update README.md

Files changed (1) hide show

README.md +70 -6

README.md CHANGED Viewed

@@ -12,12 +12,76 @@ language:
 - en
 ---
-# Uploaded  model
-- **Developed by:** deepkaria
-- **License:** apache-2.0
-- **Finetuned from model :** unsloth/deepseek-r1-distill-qwen-1.5b-unsloth-bnb-4bit
-This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 - en
 ---
+# Model: deepkaria/deepseek-r1-1.5b-indian-culture
+## Language
+**en**
+## Tags
+- deepseek
+- indian-culture
+- cultural-heritage
+- lora
+- fine-tuned
+## Datasets
+- [deepkaria/indian-culture-dataset](https://huggingface.co/datasets/deepkaria/indian-culture-dataset)
+## License
+**Apache-2.0**
+## Base Model
+[deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B)
+## Model Description
+This model has been fine-tuned on the Indian Culture Dataset to provide detailed and accurate information about various aspects of Indian culture, including festivals, performing arts, architecture, rituals, traditional medicine, and more.
+## Training Details
+**Base Model:** deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
+### Training Method
+LoRA fine-tuning with the following parameters:
+- **LoRA rank:** 16
+- **LoRA alpha:** 32
+- **Target modules:** Attention and MLP layers
+- **Training epochs:** 3
+- **Learning rate:** 2e-4 with cosine scheduler
+## Usage Example
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model_name = "deepkaria/deepseek-r1-1.5b-indian-culture"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForCausalLM.from_pretrained(model_name)
+prompt = """Below is an instruction that describes a task, paired with an input that provides further context.
+Write a response that appropriately addresses the instruction.
+### Instruction:
+You are an expert on Indian culture, traditions, and heritage. Provide detailed and accurate information about the following aspect of Indian culture.
+### Input:
+Tell me about Kathakali from Kerala.
+### Response:
+"""
+inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+outputs = model.generate(
+    **inputs,
+    max_new_tokens=512,
+    temperature=0.7,
+    top_p=0.9,
+    do_sample=True
+)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+## Limitations
+The model's knowledge is limited to the information contained in the training dataset. While it covers a wide range of Indian cultural topics, it may not have comprehensive information about very specific or regional cultural practices.
+## Intended Use
+This model is designed for educational purposes, cultural research, and to promote understanding of India's diverse cultural landscape.