quantized_by: tooktang

Nanbeige2 8B Chat - GGUF

Description

This repo contains GGUF format model files for Nanbeige2 8B Chat.

Downloads last month
10
GGUF
Model size
7.77B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support