panopstor
/

ff7r-stable-diffusion

panopstor commited on Oct 21, 2022

Commit

03ed2af

1 Parent(s): 9e0c5a1

update readme with v5.1 model and laion training update

Files changed (1) hide show

README.md CHANGED Viewed

@@ -4,6 +4,20 @@ license: creativeml-openrail-m
 https://huggingface.co/spaces/CompVis/stable-diffusion-license
 This is a finetuning of the compvis stable diffusion 1.4 ckpt.  https://huggingface.co/CompVis/stable-diffusion
 As an extension to the concept of "dreambooth" training, this fine tuning includes over a dozen concepts trained in over 1400 images with individual captions on each image.

 https://huggingface.co/spaces/CompVis/stable-diffusion-license
+# New v5.1 model
+The new version is trained from a basis of the RunwayML 1.5 ckpt.  This fine tuning sheds the last remnant of the concepts in original DreamBooth paper as regularization via generated images is dropped in favor of a mix  a scrape of laion to protect the model's original qualities instead.  1636 training images, 1636 ground truth images from laion were trained for 19009 steps at LR 4e-7.
+Results here (warning, huge image files)
+[general model test](mega_test01.webp)
+[new characters test](mega_test01_characters.webp)
+There is some remaining impact to cartoon character, but there is little "bleed" of the video game context into non-video game subjects.  There are also a number of images that show improved cropping behavior even from the base Runway 1.5 file, which I attribute to careful cropping of both training and the ground truth images scraped from laion.
+# Prior info on 4.1 model
 This is a finetuning of the compvis stable diffusion 1.4 ckpt.  https://huggingface.co/CompVis/stable-diffusion
 As an extension to the concept of "dreambooth" training, this fine tuning includes over a dozen concepts trained in over 1400 images with individual captions on each image.