Update README.md
Browse files
README.md
CHANGED
@@ -11,4 +11,28 @@ tags:
|
|
11 |
language:
|
12 |
- en
|
13 |
library_name: transformers
|
14 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
11 |
language:
|
12 |
- en
|
13 |
library_name: transformers
|
14 |
+
---
|
15 |
+
|
16 |
+
# **Draco-CoderMini-3B-GGUF**
|
17 |
+
|
18 |
+
> **Draco-CoderMini-3B** is a compact, coding-optimized language model built on the **Qwen2 architecture**, tailored for high-accuracy **code generation**, **debugging**, and **technical reasoning**. With **3 billion parameters**, it strikes a balance between power and deployability, making it an ideal assistant for developers, educators, and engineers working in constrained environments or requiring fast inference.
|
19 |
+
|
20 |
+
## Model File
|
21 |
+
|
22 |
+
| File Name | Size | Format |
|
23 |
+
|----------------------------------------|---------|--------|
|
24 |
+
| Draco-CoderMini-3B.BF16.gguf | 6.18 GB | BF16 |
|
25 |
+
| Draco-CoderMini-3B.F16.gguf | 6.18 GB | F16 |
|
26 |
+
| Draco-CoderMini-3B.F32.gguf | 12.3 GB | F32 |
|
27 |
+
| .gitattributes | 1.75 kB | - |
|
28 |
+
| README.md | 210 B | - |
|
29 |
+
| config.json | 31 B | JSON |
|
30 |
+
|
31 |
+
## Quants Usage
|
32 |
+
|
33 |
+
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
34 |
+
|
35 |
+
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
36 |
+
types (lower is better):
|
37 |
+
|
38 |
+

|