gguf when? cmon, its been 11 min already!

by Hanswalter - opened about 17 hours ago

about 17 hours ago

lol well darn, i had plans today... oof... as a quantizer, i wonder if i should wait for the -Instruct ? is that out yet? lol...

MarinaraSpaghetti

about 16 hours ago

•

edited about 16 hours ago

Better call @bartowski

ubergarm

about 16 hours ago

@MarinaraSpaghetti

I'll put up the Bat-towski signal!

gghfez

about 16 hours ago

@ubergarm I was hoping to see you in one of these threads :D

martinsky

about 16 hours ago

+1 gguf please

BounharAbdelaziz

about 16 hours ago

wait for instruct model, not sure how gguf of the base model could be usefull for personal usage

Ada321

about 15 hours ago

Base models are good for creative writing.

llama-anon

about 15 hours ago

This comment has been hidden (marked as Abuse)

mtcl

about 12 hours ago

lol well darn, i had plans today... oof... as a quantizer, i wonder if i should wait for the -Instruct ? is that out yet? lol...

How dare you have plans when ds puts out a new model!!! 😂

ccocks-deca

about 9 hours ago

"Why is the GGUF so late it's been 20 seconds already!"

mtcl

about 9 hours ago

i think lets wait for instruct version. I am very patient. very very very patient.

createthis

about 9 hours ago

•

edited about 9 hours ago

I think llama.cpp needs to be updated first.

gghfez

about 8 hours ago

I think llama.cpp needs to be updated first.

https://huggingface.co/deepseek-ai/DeepSeek-V3-Base/blob/main/model.safetensors.index.json

https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Base/blob/main/model.safetensors.index.json

These look identical

createthis

about 8 hours ago

I figured out how to create the bf16 safetensors, now I'm creating the bf16 gguf. We'll see.

createthis

about 6 hours ago

•

edited about 6 hours ago

Yeah, seems like it needs some changes to llama.cpp. I got it inferring but the chat template seems messed up.

bartowski

about 6 hours ago

I'm throwing a Q4_K_M up soon while I work on imatrix and further quants

bartowski

about 6 hours ago

@createthis it's also a base model so chatting is not going to be as reliable without giving it a multi turn prompt

bartowski

about 5 hours ago

https://huggingface.co/bartowski/deepseek-ai_DeepSeek-V3.1-Base-Q4_K_M-GGUF

In case anyone wants to try Q4_K_M

createthis

about 4 hours ago

@bartowski Thanks for the llama-cli example. TIL.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment