This is the example of this PR
Note that I trained only 2 epochs. I think more training epochs improve the quality of results.
Dataset
I used dataset from pokemon-blip-captions
Example result
prompt = 'cute dragon creature'
Note that there is large influnce of random seed.