A sample from the Laion 6.5+ image + text dataset. You can see some samples here.
The samples are resized + center-cropped to 64x64x3 and the .npz file also contains CLIP embeddings.
TODO: add img2dataset script.
The data can be used to train a basic text-to-image model.