Model Card: `vital-ai/watt-tool-70B-awq`

Model Description

This model, vital-ai/watt-tool-70B-awq, is a quantized version of the base model watt-ai/watt-tool-70B. The quantization process was performed to reduce the model size and improve inference speed while maintaining high performance.

Base Model: watt-ai/watt-tool-70B

Quantization Method: 4-bit AWQ

Downloads last month: 1,743

Safetensors

Model size

11.3B params

Tensor type

I32

BF16

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for vital-ai/watt-tool-70B-awq

Base model

meta-llama/Llama-3.1-70B

Finetuned

meta-llama/Llama-3.3-70B-Instruct

Finetuned

watt-ai/watt-tool-70B

Quantized

(6)

this model

Model Card: vital-ai/watt-tool-70B-awq

Model Description

Model tree for vital-ai/watt-tool-70B-awq

Model Card: `vital-ai/watt-tool-70B-awq`