Update README.md
Browse files
README.md
CHANGED
@@ -3,41 +3,25 @@ base_model: unsloth/deepseek-r1-distill-qwen-1.5b-unsloth-bnb-4bit
|
|
3 |
tags:
|
4 |
- text-generation-inference
|
5 |
- transformers
|
6 |
-
- unsloth
|
7 |
- qwen2
|
8 |
-
- trl
|
9 |
- sft
|
|
|
|
|
10 |
license: apache-2.0
|
11 |
language:
|
12 |
- en
|
|
|
|
|
13 |
---
|
14 |
|
15 |
# Model: deepkaria/deepseek-r1-1.5b-indian-culture
|
16 |
|
17 |
-
## Language
|
18 |
-
**en**
|
19 |
-
|
20 |
-
## Tags
|
21 |
-
- deepseek
|
22 |
-
- indian-culture
|
23 |
-
- cultural-heritage
|
24 |
-
- lora
|
25 |
-
- fine-tuned
|
26 |
-
|
27 |
-
## Datasets
|
28 |
-
- [deepkaria/indian-culture-dataset](https://huggingface.co/datasets/deepkaria/indian-culture-dataset)
|
29 |
-
|
30 |
-
## License
|
31 |
-
**Apache-2.0**
|
32 |
-
|
33 |
-
## Base Model
|
34 |
-
[deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B)
|
35 |
-
|
36 |
## Model Description
|
37 |
This model has been fine-tuned on the Indian Culture Dataset to provide detailed and accurate information about various aspects of Indian culture, including festivals, performing arts, architecture, rituals, traditional medicine, and more.
|
38 |
|
39 |
## Training Details
|
40 |
-
**Base Model:** deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
|
|
|
41 |
|
42 |
### Training Method
|
43 |
LoRA fine-tuning with the following parameters:
|
@@ -83,5 +67,4 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
|
83 |
The model's knowledge is limited to the information contained in the training dataset. While it covers a wide range of Indian cultural topics, it may not have comprehensive information about very specific or regional cultural practices.
|
84 |
|
85 |
## Intended Use
|
86 |
-
This model is designed for educational purposes, cultural research, and to promote understanding of India's diverse cultural landscape.
|
87 |
-
|
|
|
3 |
tags:
|
4 |
- text-generation-inference
|
5 |
- transformers
|
|
|
6 |
- qwen2
|
|
|
7 |
- sft
|
8 |
+
- deepseek
|
9 |
+
- indian-culture
|
10 |
license: apache-2.0
|
11 |
language:
|
12 |
- en
|
13 |
+
datasets:
|
14 |
+
- deepkaria/indian-culture-dataset
|
15 |
---
|
16 |
|
17 |
# Model: deepkaria/deepseek-r1-1.5b-indian-culture
|
18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
## Model Description
|
20 |
This model has been fine-tuned on the Indian Culture Dataset to provide detailed and accurate information about various aspects of Indian culture, including festivals, performing arts, architecture, rituals, traditional medicine, and more.
|
21 |
|
22 |
## Training Details
|
23 |
+
- **Base Model:** deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
|
24 |
+
- **Dataset Used for Finetuning:** [deepkaria/indian-culture-dataset](https://huggingface.co/datasets/deepkaria/indian-culture-dataset)
|
25 |
|
26 |
### Training Method
|
27 |
LoRA fine-tuning with the following parameters:
|
|
|
67 |
The model's knowledge is limited to the information contained in the training dataset. While it covers a wide range of Indian cultural topics, it may not have comprehensive information about very specific or regional cultural practices.
|
68 |
|
69 |
## Intended Use
|
70 |
+
This model is designed for educational purposes, cultural research, and to promote understanding of India's diverse cultural landscape.
|
|