Chinda-Qwen3-4B-F32-GGUF

Chinda Opensource Thai LLM 4B is iApp Technology's cutting-edge Thai language model that brings advanced thinking capabilities to the Thai AI ecosystem. Built on the latest Qwen3-4B architecture, Chinda represents our commitment to developing sovereign AI solutions for Thailand.

Model Files

File Size Format
Chinda-Qwen3-4B-F32.F32.gguf 16.1 GB 32-bit float
Chinda-Qwen3-4B-F32.BF16.gguf 8.05 GB BFloat16
Chinda-Qwen3-4B-F32.F16.gguf 8.05 GB 16-bit float
Chinda-Qwen3-4B-F32.Q8_0.gguf 4.28 GB 8-bit quantized
Chinda-Qwen3-4B-F32.Q6_K.gguf 3.31 GB 6-bit quantized
Chinda-Qwen3-4B-F32.Q5_K_M.gguf 2.89 GB 5-bit quantized (medium)
Chinda-Qwen3-4B-F32.Q5_K_S.gguf 2.82 GB 5-bit quantized (small)
Chinda-Qwen3-4B-F32.Q4_K_M.gguf 2.5 GB 4-bit quantized (medium)
Chinda-Qwen3-4B-F32.Q4_K_S.gguf 2.38 GB 4-bit quantized (small)
Chinda-Qwen3-4B-F32.Q3_K_L.gguf 2.24 GB 3-bit quantized (large)
Chinda-Qwen3-4B-F32.Q3_K_M.gguf 2.08 GB 3-bit quantized (medium)
Chinda-Qwen3-4B-F32.Q3_K_S.gguf 1.89 GB 3-bit quantized (small)
Chinda-Qwen3-4B-F32.Q2_K.gguf 1.67 GB 2-bit quantized

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Downloads last month
319
GGUF
Model size
4.02B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for prithivMLmods/Chinda-Qwen3-4B-F32-GGUF

Base model

Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Quantized
(6)
this model