Sarvam-M

Model Information

This repository contains gguf version of sarvam-m in q8 precision.

Learn more about sarvam-m in our detailed blog post.

You can use the model on your local machine (without gpu) as explained here.

Example Command:

./build/bin/llama-cli -i -m /your/folder/path/sarvam-m-q8_0.gguf -c 8192 -t 16

GGUF

Model size

23.6B params

Architecture

llama

Hardware compatibility

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Quantized

(17)

this model