This is just a reupload of the quantized 4Q K M version of Gemma 3 1b by Google and unsloth. It is used for benchmark purposes on the Jetson Nano and llama.cpp with CUDA support:

https://github.com/kreier/llama.cpp-jetson - 🖳 instructions to compile llama.cpp with CUDA support
https://github.com/kreier/llama.cpp-jetson.nano - 🚀 precompiled versions that can be installed on the Jetson in minutes
https://github.com/kreier/jetson a few other insights and results

Downloads last month: 4

GGUF

Model size

1,000M params

Architecture

gemma3

Hardware compatibility

View all variants

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kreier/gemma3-1b

Base model

google/gemma-3-1b-pt

Finetuned

google/gemma-3-1b-it

Quantized

(103)

this model