DATAGRID-research
/

DATAGRID-Local-Attention-DiT-v1.0.0-0.52B

PixArtSigmaPipeline

Model card Files Files and versions Community

DATAGRID-research commited on 20 days ago

Commit

f00d2c7

·

verified ·

1 Parent(s): 5c1ff9d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 LocalDiT is a lightweight Diffusion Transformer model for high-quality text-to-image generation that incorporates local attention mechanisms to improve computational efficiency while maintaining generation quality.
 # Model Description
-LocalDiT builds upon the architecture of PixArt-α, introducing local attention mechanisms to reduce computational complexity and memory requirements. By processing image patches in local windows rather than with global attention, the model achieves faster inference and training while preserving image generation quality.
 - **Type**: Diffusion Transformer (DiT) with Local Attention
 - **Parameters**: 0.52B

 LocalDiT is a lightweight Diffusion Transformer model for high-quality text-to-image generation that incorporates local attention mechanisms to improve computational efficiency while maintaining generation quality.
 # Model Description
+LocalDiT builds upon the architecture of [PixArt-α](https://huggingface.co/PixArt-alpha/PixArt-XL-2-1024-MS), introducing local attention mechanisms to reduce computational complexity and memory requirements. By processing image patches in local windows rather than with global attention, the model achieves faster inference and training while preserving image generation quality.
 - **Type**: Diffusion Transformer (DiT) with Local Attention
 - **Parameters**: 0.52B