Dreambooth and Fine Tuning for Stable Diffusion 1.5 and 2 with this Versatile Script

Channel:
Subscribers:
2,810
Published on ● Video Link: https://www.youtube.com/watch?v=MX7GTcHr9oI



Duration: 15:38
4,058 views
50


Been playing with this for a few days now. Took a while to work out kinks, but things seem smooth now. Take a look at this project if you are trying to fine tune or dreambooth train a Stable Diffusion 1.5 or 2 model.

Github page for Kohya_SS script
https://github.com/bmaltais/kohya_ss

use the launch flag --max_train_steps = 50000 or another high value to allow the script to continue after 1 epoch

Intro https://www.youtube.com/watch?v=MX7GTcHr9oI&t=0m1s
Rambling about memory usage https://www.youtube.com/watch?v=MX7GTcHr9oI&t=1m30s
Setup https://www.youtube.com/watch?v=MX7GTcHr9oI&t=3m0s
Processing images and generating captions https://www.youtube.com/watch?v=MX7GTcHr9oI&t=6m0s
Altering captions with Notepad++ https://www.youtube.com/watch?v=MX7GTcHr9oI&t=7m30s
Dataset structure https://www.youtube.com/watch?v=MX7GTcHr9oI&t=8m15s
Launch script example and script launch options https://www.youtube.com/watch?v=MX7GTcHr9oI&t=8m50s
Comparing the fine tuned model to SD 1.5 base at various epochs
https://www.youtube.com/watch?v=MX7GTcHr9oI&t=10m30s
Bye for now, back soon with another https://www.youtube.com/watch?v=MX7GTcHr9oI&t=15m15s

Startup shell script
https://pastebin.com/wSQXxnN6

Command line task list:
https://pastebin.com/A0M7QuE3

Use the script from the Shivam Shiraro diffusers library to convert models to ckpt for the WebUI:
python convert_diffusers_to_original_stable_diffusion.py --model_path /IN/MODEL/DIR --checkpoint_path /OUT/FILE.CKPT




Other Videos By NanoNomad


2023-03-15Training or Fine Tuning a Hindi Language VITS TTS Voice Model with Coqui TTS on Google Colab
2023-03-05Install and Configure Retroarch for PS Vita with Thumbnails, Overlays and Shaders
2023-03-03Fallout 1 on the PS Vita is the Best Way to Play
2023-02-24Train or Fine Tune VITS on (theoretically) Any Language | Train Multi-Speaker Model | Train YourTTS
2023-02-12Even more Voice Cloning | Train a Multi-Speaker VITS model using Google Colab and a Custom Dataset
2023-02-04Updated | Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab
2023-01-30YourTTS Training Discussion | Experiences, Multistage Training, Demos, Prior Training Preservation
2023-01-27Updated | Fine-Tuning YourTTS with Automated STT Datasets on Google Colab for AI Voice Cloning
2023-01-13Fine-Tune YourTTS with Near-Automated Datasets on Google Colab for AI Voice Cloning
2022-12-22Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab or Linux
2022-12-09Dreambooth and Fine Tuning for Stable Diffusion 1.5 and 2 with this Versatile Script
2022-11-30If Bill Gates could rap? AI Synthesized Voice, AI Upsampled Video | Deltron 3030's Virus
2022-11-14Training Stable Diffusion Dreambooth on Multiple Subjects for Combined Image Generation
2022-10-31Locally Train Stable Diffusion with Dreambooth using WSL Ubuntu
2022-10-25Animated Stable Diffusion and Synthesized Voice Demo with Facial Movements
2022-10-24Stable Diffusion Image to Video, Synthesized Lauretta Young 1930s voice, Wav2Lip Demo
2022-10-16Animate Images using AI with Frame Interpolation for Large Motion
2022-10-14Animated Stable Diffusion Images using Google's FILM Frame Interpolation for Large Motion demo
2022-10-07Training Textual Inversion for Stable Diffusion | Customizable AI Image Generation
2022-09-26How to Download All Styles and Objects from the Stable Diffusion Concepts Library | AI Images
2022-09-05AI Images | Installing Stable Diffusion and the Automatic1111 WebUI using Conda on Windows 10



Tags:
Stable Diffusion
Dreambooth
Fine-Tuning
AI
Artificial Intelligence
AI Images