lmstudio-community/DeepSeek-R1-Distill-Qwen-14B-GGUF Text Generation β’ Updated 7 days ago β’ 19.7k β’ 11
view post Post 4426 It is now possible to generate 16 Megapixel (4096x4096) raw images with SANA 4K model using under 8GB VRAM, 4 Megapixel (2048x2048) images using under 6GB VRAM, and 1 Megapixel (1024x1024) images using under 4GB VRAM thanks to new optimizations13 January 2024 UpdateInstallers : https://www.patreon.com/posts/from-nvidia-labs-116474081New 4K Tutorial Video : https://youtu.be/GjENQfHF4W8Now the APP will use Diffusers Pipeline and it has huge VRAM optimizationsYou need to reinstallThe models will be downloaded into your Hugging Face cache folder when you first time generate somethingHow to Get Installation Logs and How to Change Hugging Face Cache Folder : https://www.patreon.com/posts/108419878Please make a fresh installWhen you enable all 4 optimizations the VRAM usages are like belowMake sure shared VRAM is enabled because initial loading of the model need more VRAMEnable VAE Tiling + Enable VAE Slicing + Enable Model CPU Offload +Enable Sequential CPU Offload1K (1024x1024) : 4 GB GPUs2K (2048x2048) : 6 GB GPUs4K (4096x4096) : 8 GB GPUsStill in any case may work on your GPU test itJust Enable VAE Tiling + Enable Model CPU Offload works great in many casesAll below attached images are generated via SANA 4K model, they are RAW and their resolution is 5376x3072Official repo page : https://github.com/NVlabs/Sana See translation 2 replies Β· π₯ 9 9 π 6 6 π 6 6 β€οΈ 4 4 π 4 4 π€ 3 3 π€ 3 3 π 2 2 β 2 2 π§ 2 2 π€― 2 2 π 1 1 + Reply
Efficient-Large-Model/Sana_1600M_1024px_MultiLing Text-to-Image β’ Updated 17 days ago β’ 1.66k β’ 22
Sana Collection β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer β’ 19 items β’ Updated 20 days ago β’ 87