prithivMLmods commited on
Commit
ad73f36
·
verified ·
1 Parent(s): f212ae2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -1
README.md CHANGED
@@ -15,4 +15,26 @@ tags:
15
 
16
  # **Lacaille-MoT-4B-Supreme2-GGUF**
17
 
18
- > **Lacaille-MoT-4B-Supreme2** is a high-efficiency, multi-domain model fine-tuned on **Qwen3-4B** using the **Mixture of Thoughts (MoT)** dataset enhanced with **code, math, science expert clusters** and an extended **open code reasoning dataset**. This model blends symbolic precision, scientific logic, and structured output fluency—making it an ideal tool for developers, educators, and researchers seeking advanced reasoning under constrained compute.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  # **Lacaille-MoT-4B-Supreme2-GGUF**
17
 
18
+ > **Lacaille-MoT-4B-Supreme2** is a high-efficiency, multi-domain model fine-tuned on **Qwen3-4B** using the **Mixture of Thoughts (MoT)** dataset enhanced with **code, math, science expert clusters** and an extended **open code reasoning dataset**. This model blends symbolic precision, scientific logic, and structured output fluency—making it an ideal tool for developers, educators, and researchers seeking advanced reasoning under constrained compute.
19
+
20
+ ## Model File Table
21
+
22
+ | File Name | Size | Format | Description |
23
+ |--------------------------------------------------|---------|---------------|------------------------------------------|
24
+ | Lacaille-MoT-4B-Supreme2.BF16.gguf | 8.05 GB | GGUF (BF16) | BFloat16 precision model file |
25
+ | Lacaille-MoT-4B-Supreme2.F16.gguf | 8.05 GB | GGUF (F16) | Float16 precision model file |
26
+ | Lacaille-MoT-4B-Supreme2.F32.gguf | 16.1 GB | GGUF (F32) | Float32 precision model file |
27
+ | Lacaille-MoT-4B-Supreme2.Q4_K_M.gguf | 2.5 GB | GGUF (Q4_K_M) | 4-bit quantized model file |
28
+ | Lacaille-MoT-4B-Supreme2.Q5_K_M.gguf | 2.89 GB | GGUF (Q5_K_M) | 5-bit quantized model file |
29
+ | Lacaille-MoT-4B-Supreme2.Q8_0.gguf | 4.28 GB | GGUF (Q8_0) | 8-bit quantized model file |
30
+ | config.json | 31 B | JSON | Configuration file |
31
+ | .gitattributes | 1.95 kB | Text | Git attributes configuration |
32
+
33
+ ## Quants Usage
34
+
35
+ (sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
36
+
37
+ Here is a handy graph by ikawrakow comparing some lower-quality quant
38
+ types (lower is better):
39
+
40
+ ![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)