UD-Q5_K_XL ?
since Q5KM is 21.7 GB, a UD-Q5_K_XL gguf should be around 20GB~ and can be fully loaded into a 24gb card
Thank you for these UD ggufs!
Thanks for the suggestion we'll see what we can do :)
We've uploaded them all now
Also with a new improved calibration dataset :)
We've uploaded them all now
Also with a new improved calibration dataset :)
Thank you so much! Have a nice day!
We've uploaded them all now
Also with a new improved calibration dataset :)
@shimmyshimmer Did you upload the calibration dataset? I am having trouble finding it.
Also, I wonder if you all might collaborate with @awni since he’s experimenting with calibration datasets for Qwen (using MLX’s DWQ).
We've uploaded them all now
Also with a new improved calibration dataset :)
@shimmyshimmer Did you upload the calibration dataset? I am having trouble finding it.
Also, I wonder if you all might collaborate with @awni since he’s experimenting with calibration datasets for Qwen (using MLX’s DWQ).
We meant we uploaded the new Q5 XL etc quants. Could be interesting!