Questions, hoping to develop a node for comfyui

by aequanima - opened Apr 15

Apr 15

Was the convert.py script from city96/ComfyUI-GGUF used? If so, were any modifications made to it to support the HiDreamImageTransformer2DModel architecture (e.g., adding a ModelHiDream class to convert.py)? Thank you!

calcuis

Owner Apr 15

•

edited Apr 15

not that one; you could do it with the convertor zero if you have gguf node; but should merge the safetensors first

stierma1

Apr 16

Looks like Comfyui has made changes to support HiDream and GGUF loading. I am getting the following error on load of the q8 guff, but a similar error exists for all of them. Do you know if this is an error with the GGUF file or with the Loader?
Error(s) in loading state_dict for HiDreamImageTransformer2DModel:
While copying the parameter named "double_stream_blocks.0.block.ff_i.gate.weight", whose dimensions in the model are torch.Size([4, 2560]) and whose dimensions in the checkpoint are torch.Size([4, 2720]), an exception occurred : ('The size of tensor a (2560) must match the size of tensor b (2720) at non-singleton dimension 1',).
While copying the parameter named "double_stream_blocks.1.block.ff_i.gate.weight", whose dimensions in the model are torch.Size([4, 2560]) and whose dimensions in the checkpoint are torch.Size([4, 2720]), an exception occurred : ('The size of tensor a (2560) must match the size of tensor b (2720) at non-singleton dimension 1',). ...

calcuis

Owner Apr 16

•

edited Apr 16

how about the fp8 safetensors? the first three (q4_0, q5_0 and q8_0) might need to adjust a bit; others should work; try GGUF QuadrupleCLIP Loader with the code update later; working on still

stierma1

Apr 16

•

edited Apr 16

I tried using the load diffusion model node with the fp8 safe tensors, it gives me the same error when it tries to do Ksampling. I also tried the q4_0 but it gave a very similar error, the dimension numbers were different but still a mismatch. I saw city96 is uploading ggufs as well I am downloading that to see if it works. https://huggingface.co/city96/HiDream-I1-Dev-gguf

calcuis

Owner Apr 16

ok, thanks; stay tuned

stierma1

Apr 16

I tried city96's ggufs, they get passed the gguf loading stage but it gives this error on KSampling
mat1 and mat2 shapes cannot be multiplied (2x768 and 2048x2560)

Jehex

Apr 16

I tried city96's ggufs, they get passed the gguf loading stage but it gives this error on KSampling
mat1 and mat2 shapes cannot be multiplied (2x768 and 2048x2560)

hope a fix soon

city96

Apr 16

For the quants in this repo, they likely need to be re-uploaded, as the ffn gate weight gets loaded into a torch.nn.parameter, which means you have to keep it in either FP32 or FP16 to be loadable.

For my quants, I did a quick test, and they did work on the instance I was testing on. You do need all 4 text encoders from the comfy repo for it to work, though. (L8 ls just llama 8B, I made that myself before comfy finished uploading. The clip models in that repo I think are also different, as they include the projection weight, while the ones comfy uploaded with flux didn't, so may need to download those as well).

Jehex

Apr 16

•

edited Apr 16

For the quants in this repo, they likely need to be re-uploaded, as the ffn gate weight gets loaded into a torch.nn.parameter, which means you have to keep it in either FP32 or FP16 to be loadable.

For my quants, I did a quick test, and they did work on the instance I was testing on. You do need all 4 text encoders from the comfy repo for it to work, though. (L8 ls just llama 8B, I made that myself before comfy finished uploading. The clip models in that repo I think are also different, as they include the projection weight, while the ones comfy uploaded with flux didn't, so may need to download those as well).

thanks

stierma1

Apr 16

Ok I have succeeded by using all 4 clip files. @calcuis I retried the fp8 safetensor, that works with all 4 clip. The GGUFs still have the error on load. Thanks yall!

city96

Apr 16

Uploaded a set of HiDream-I1-Full quants as well, until the ones in this repo are fixed.

zhaoqi

Apr 16

Which one is recommended after testing, full or dev? 24G video memory.

stierma1

Apr 16

@zhaoqi The difference between full and dev is the speed vs quality. Full recommends 50 steps vs dev recommends 28 steps. They consume the same amount of VRAM. I think Full is better than Dev but I have only been working with the NF4 models. I would try the q8 model and if its to slow then step down to q6 or q4.

calcuis

Owner Apr 16

thanks @city96 @stierma1 and all; full set gguf works right now; awesome!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment