stabilityai/stable-diffusion-3-medium · Paperspace Gradient instructions for Dreambooth training LoRA for SD3

Jun 14, 2024

https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/README_sd3.md

I use paperspace gradient and was able to get that working with a couple of small adjustments to the code they recommend in that readme. My script is below, if it's helpful to anyone (I acknowledge that there may be more proper/efficient ways to script this; I'm not a pro, just muddling through).
A few important notes about the script:

My gradient notebook was instantiated using the "cyberes/gradient-base-py3.10" image ... I have no idea how any of the following will behave on a vanilla paperspace notebook
You need to open Terminal and run hugging-cli login first, you also need gated access to the SD3 model here (duh)
You have to manually delete a file called /notebooks/storage/diffusers/examples/dreambooth/dog/.huggingface ... you may have to run the script once before this file is even formed, I'm not sure. However it happens, once it exists, it must be nuked for the script to run.
I couldn't get wandb to work, so I added a !wandb disabled line before accelerate gets run. I assume you can just remove the --report_to line from the training script at the end, if you prefer.
!pip install --upgrade diffusers[torch]
!pip install --upgrade diffusers[flax]
!pip install peft bitsandbytes

!git clone https://github.com/huggingface/diffusers
%cd diffusers
!pip install -e .

%cd examples/dreambooth

!pip install -r requirements_sd3.txt
!accelerate config default
#from accelerate.utils import write_basic_config
#write_basic_config()

from huggingface_hub import snapshot_download

local_dir = "./dog-ex"
snapshot_download(
"diffusers/dog-example",
local_dir=local_dir, repo_type="dataset",
ignore_patterns=".gitattributes",
)

MODEL_NAME="stabilityai/stable-diffusion-3-medium-diffusers"
INSTANCE_DIR="dog"
OUTPUT_DIR="trained-sd3-lora"

!wandb disabled

!accelerate launch train_dreambooth_lora_sd3.py
--pretrained_model_name_or_path=$MODEL_NAME
--instance_data_dir=$INSTANCE_DIR
--output_dir=/raid/.cache/$OUTPUT_DIR
--mixed_precision="fp16"
--instance_prompt="a photo of sks dog"
--resolution=1024
--train_batch_size=1
--gradient_accumulation_steps=4
--learning_rate=1e-5
--report_to="wandb"
--lr_scheduler="constant"
--lr_warmup_steps=0
--max_train_steps=500
--weighting_scheme="logit_normal"
--validation_prompt="A photo of sks dog in a bucket"
--validation_epochs=25
--seed="0"
--push_to_hub

maxspire

Jun 14, 2024

I posted this in response to someone else's thread, but thought I'd post it here to make it easier to find (in case it's actually helpful to anyone).

HG0001

Aug 9, 2024

could you pls confirm if i want to train on a dataset having different images and with json file having image column along with description then what will be the parameters ? do is take dataset_name then will we use instance_prompt still?

export MODEL_NAME="stabilityai/stable-diffusion-3-medium-diffusers"
export OUTPUT_DIR=""

accelerate launch train_dreambooth_lora_sd3.py
--pretrained_model_name_or_path=$MODEL_NAME
--output_dir=$OUTPUT_DIR
--mixed_precision="fp16"
--dataset_name=""
--image_column="image"
--caption_column="text"
--instance_prompt=""
--validation_prompt=""
--resolution=1024
--center_crop
--random_flip
--train_batch_size=1
--train_text_encoder
--gradient_accumulation_steps=4 --gradient_checkpointing
--optimizer="prodigy"
--learning_rate=1.0
--text_encoder_lr=1.0
--lr_scheduler="constant"
--lr_warmup_steps=0
--max_train_steps=150
--seed="0" \