rootonchair
/

Vintern-3B-beta-GGUF

Image-Text-to-Text

Model card Files Files and versions Community

GGUF and static quants of https://huggingface.co/5CD-AI/Vintern-3B-beta

Usage

If you are unsure how to use GGUF files, refer to one of TheBloke's READMEs for more details, including on how to concatenate multi-part files.

Provided Quants

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Link	Type	Size/GB	Notes
GGUF	mmproj-fp16	1.5	vision supplement
GGUF	Q2_K	3.1
GGUF	Q3_K_S	3.6
GGUF	Q3_K_M	3.9	lower quality
GGUF	Q3_K_L	4.2
GGUF	IQ4_XS	4.4
GGUF	Q4_K_S	4.6	fast, recommended
GGUF	Q4_K_M	4.8	fast, recommended
GGUF	Q5_K_S	5.4
GGUF	Q5_K_M	5.5
GGUF	Q6_K	6.4	very good quality
GGUF	Q8_0	8.2	fast, best quality
GGUF	f16	15.3	16 bpw, overkill

Downloads last month: 591

GGUF

Model size

3.4B params

Architecture

qwen2

Hardware compatibility

Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

Image-Text-to-Text

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for rootonchair/Vintern-3B-beta-GGUF

Base model

Qwen/Qwen2.5-3B

Finetuned

Qwen/Qwen2.5-3B-Instruct

Finetuned

5CD-AI/Vintern-3B-beta

Quantized

(1)

this model

Collection including rootonchair/Vintern-3B-beta-GGUF

Vietnamese VLM GGUF

Easy Vietnamese VLM deployment • 4 items • Updated 26 days ago