redponike
/

Llama-3.1-Nemotron-Nano-4B-v1.1-GGUF

Model card Files Files and versions Community

redponike commited on 4 days ago

Commit

ce7cd07

·

verified ·

1 Parent(s): 03dfa97

Create README.md

Files changed (1) hide show

README.md +13 -0

README.md ADDED Viewed

	@@ -0,0 +1,13 @@

+---
+base_model:
+- nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
+---
+GGUF quants of [nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1)
+Using llama.cpp b5436 (commit be0239693c1530a18496086331fc18d8a9adbad1)
+The importance matrix was generated with calibration_datav3.txt.
+All quants were generated/calibrated with the imatrix, including the K quants.
+Quantized from BF16.