N-Bot-Int
/

ZoraBetaA2-Q16

Text Generation

text-generation-inference

Model card Files Files and versions Community

Support Us Through

[https://ko-fi.com/nexusnetworkint](Official Ko-FI link!)

GGUF Version

GGUF with Quants! Allowing you to run models using KoboldCPP and other AI Environments!

Quantizations:

Quant Type	Benefits	Cons
Q16_0	✅ Highest accuracy (closest to full model)	❌ Requires significantly more VRAM/RAM
	✅ Best for complex reasoning & detailed outputs	❌ Slower inference compared to Q4 & Q5
	✅ Suitable for high-end GPUs & serious workloads	❌ Larger file size (takes more storage)

Model Details:

Read the Model details on huggingface Model Detail Here

Downloads last month: 8

GGUF

Model size

7.24B params

Architecture

llama

Hardware compatibility

Log In to view the estimation

16-bit

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for N-Bot-Int/ZoraBetaA2-Q16

Base model

mistralai/Mistral-7B-v0.1

Finetuned

HuggingFaceH4/zephyr-7b-beta

Adapter

N-Bot-Int/ZoraBetaA2

Quantized

(1)

this model

Datasets used to train N-Bot-Int/ZoraBetaA2-Q16

Collection including N-Bot-Int/ZoraBetaA2-Q16

ZoraBetaA

ZoraBeta Family A Models, Finetuned from ZephyrBeta7B • 4 items • Updated Jun 21 • 1