matrixportal
/

X-Ray_Alpha-GGUF

Model card Files Files and versions

matrixportal/X-Ray_Alpha-GGUF

This model was converted to GGUF format from SicariusSicariiStuff/X-Ray_Alpha using llama.cpp via the ggml.ai's all-gguf-same-where space. Refer to the original model card for more details on the model.

✅ Quantized Models Download List

🔍 Recommended Quantizations

✨ General CPU Use: Q4_K_M (Best balance of speed/quality)
📱 ARM Devices: Q4_0 (Optimized for ARM CPUs)
🏆 Maximum Quality: Q8_0 (Near-original quality)

📦 Full Quantization Options

🚀 Download	🔢 Type	📝 Notes
Download		Basic quantization
Download		Small size
Download		Balanced quality
Download		Better quality
Download		Fast on ARM
Download		Fast, recommended
Download	⭐	Best balance
Download		Good quality
Download		Balanced
Download		High quality
Download	🏆	Very good quality
Download	⚡	Fast, best quality
Download		Maximum accuracy
Download		Multimodal projection file for image processing

💡 Pro Tip: Start with Q4_K_M for most use cases, only use F16 if you need maximum precision.

Downloads last month: 217

GGUF

Model size

3.88B params

Architecture

gemma3

Hardware compatibility

Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for matrixportal/X-Ray_Alpha-GGUF

Base model

google/gemma-3-4b-pt

Finetuned

google/gemma-3-4b-it

Finetuned

SicariusSicariiStuff/X-Ray_Alpha

Quantized

(14)

this model