prithivMLmods commited on
Commit
ba67dfa
·
verified ·
1 Parent(s): af2ce9d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -1
README.md CHANGED
@@ -11,4 +11,28 @@ tags:
11
  language:
12
  - en
13
  library_name: transformers
14
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  language:
12
  - en
13
  library_name: transformers
14
+ ---
15
+
16
+ # **Draco-CoderMini-3B-GGUF**
17
+
18
+ > **Draco-CoderMini-3B** is a compact, coding-optimized language model built on the **Qwen2 architecture**, tailored for high-accuracy **code generation**, **debugging**, and **technical reasoning**. With **3 billion parameters**, it strikes a balance between power and deployability, making it an ideal assistant for developers, educators, and engineers working in constrained environments or requiring fast inference.
19
+
20
+ ## Model File
21
+
22
+ | File Name | Size | Format |
23
+ |----------------------------------------|---------|--------|
24
+ | Draco-CoderMini-3B.BF16.gguf | 6.18 GB | BF16 |
25
+ | Draco-CoderMini-3B.F16.gguf | 6.18 GB | F16 |
26
+ | Draco-CoderMini-3B.F32.gguf | 12.3 GB | F32 |
27
+ | .gitattributes | 1.75 kB | - |
28
+ | README.md | 210 B | - |
29
+ | config.json | 31 B | JSON |
30
+
31
+ ## Quants Usage
32
+
33
+ (sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
34
+
35
+ Here is a handy graph by ikawrakow comparing some lower-quality quant
36
+ types (lower is better):
37
+
38
+ ![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)