prithivMLmods
/

Draco-CoderMini-3B-GGUF

Text Generation

text-generation-inference

Model card Files Files and versions

prithivMLmods commited on May 26

Commit

ba67dfa

·

verified ·

1 Parent(s): af2ce9d

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -11,4 +11,28 @@ tags:
 language:
 - en
 library_name: transformers
----

 language:
 - en
 library_name: transformers
+---
+# **Draco-CoderMini-3B-GGUF**
+> **Draco-CoderMini-3B** is a compact, coding-optimized language model built on the **Qwen2 architecture**, tailored for high-accuracy **code generation**, **debugging**, and **technical reasoning**. With **3 billion parameters**, it strikes a balance between power and deployability, making it an ideal assistant for developers, educators, and engineers working in constrained environments or requiring fast inference.
+## Model File
+| File Name                              | Size    | Format |
+|----------------------------------------|---------|--------|
+| Draco-CoderMini-3B.BF16.gguf           | 6.18 GB | BF16   |
+| Draco-CoderMini-3B.F16.gguf            | 6.18 GB | F16    |
+| Draco-CoderMini-3B.F32.gguf            | 12.3 GB | F32    |
+| .gitattributes                         | 1.75 kB | -      |
+| README.md                              | 210 B   | -      |
+| config.json                            | 31 B    | JSON   |
+## Quants Usage
+(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
+Here is a handy graph by ikawrakow comparing some lower-quality quant
+types (lower is better):
+![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)