Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ pipeline_tag: text-generation
|
|
14 |
This repository contains a custom-trained Hindi Causal Language Model designed for Hindi text generation.
|
15 |
|
16 |
## Model Description
|
17 |
-
- **Model Size:**
|
18 |
|
19 |
- **Architecture:** Custom Transformer (12 layers, hidden=768, 16 heads, ffn=3072, act=swiglu, norm=rmsnorm) based on the `HindiCausalLM` class with Hindi-specific optimizations:
|
20 |
- Multi-resolution attention to capture both character-level and word-level patterns
|
|
|
14 |
This repository contains a custom-trained Hindi Causal Language Model designed for Hindi text generation.
|
15 |
|
16 |
## Model Description
|
17 |
+
- **Model Size:** 113M (YAH !!! Its very small)
|
18 |
|
19 |
- **Architecture:** Custom Transformer (12 layers, hidden=768, 16 heads, ffn=3072, act=swiglu, norm=rmsnorm) based on the `HindiCausalLM` class with Hindi-specific optimizations:
|
20 |
- Multi-resolution attention to capture both character-level and word-level patterns
|