GeorgyGUF
/

Llama-4-Maverick-17B-128E-Instruct-bf16.gguf

Image-Text-to-Text

Model card Files Files and versions Community

GeorgyGUF commited on 7 days ago

Commit

b0aceae

·

verified ·

1 Parent(s): ab1d77c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -195,7 +195,7 @@ license_name: llama4
 pipeline_tag: image-text-to-text
 ---
-Currently text only is supported. Created with llama.cpp b5074: `python llama.cpp/convert_hf_to_gguf.py --outfile Llama-4-Maverick-17B-128E-Instruct-bf16.gguf --outtype bf16 models--unsloth--Llama-4-Maverick-17B-128E-Instruct/snapshots/4d0b9b85d7b4c203d8354c4b645021d1985032c1 --use-temp-file`. I did this to be able to create proper quantisation by running this command command: `llama-quantize --leave-output-tensor --token-embedding-type BF16 Llama-4-Maverick-17B-128E-Instruct-bf16.gguf Llama-4-Maverick-17B-128E-Instruct-q8-with-bf16-embedding-and-bf16-output.gguf Q8_0`. You can check my quant here: https://huggingface.co/GeorgyGUF/Llama-4-Maverick-17B-128E-Instruct-q8-with-bf16-embedding-and-bf16-output.gguf
 **Chat template/prompt format:**
 ```

 pipeline_tag: image-text-to-text
 ---
+Currently text only is supported. Created with llama.cpp b5074: `python llama.cpp/convert_hf_to_gguf.py --outfile Llama-4-Maverick-17B-128E-Instruct-bf16.gguf --outtype bf16 models--unsloth--Llama-4-Maverick-17B-128E-Instruct/snapshots/4d0b9b85d7b4c203d8354c4b645021d1985032c1 --use-temp-file`. I did this to be able to create proper quantisation by running this command: `llama-quantize --leave-output-tensor --token-embedding-type BF16 Llama-4-Maverick-17B-128E-Instruct-bf16.gguf Llama-4-Maverick-17B-128E-Instruct-q8-with-bf16-embedding-and-bf16-output.gguf Q8_0`. You can check my quant here: https://huggingface.co/GeorgyGUF/Llama-4-Maverick-17B-128E-Instruct-q8-with-bf16-embedding-and-bf16-output.gguf
 **Chat template/prompt format:**
 ```