redponike commited on
Commit
ce7cd07
·
verified ·
1 Parent(s): 03dfa97

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
4
+ ---
5
+ GGUF quants of [nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1)
6
+
7
+ Using llama.cpp b5436 (commit be0239693c1530a18496086331fc18d8a9adbad1)
8
+
9
+ The importance matrix was generated with calibration_datav3.txt.
10
+
11
+ All quants were generated/calibrated with the imatrix, including the K quants.
12
+
13
+ Quantized from BF16.