Stable Diffusion
graphic storytelling
introduction
text2image generation
How does it work?
What do you [need|need to know] to get started?
Why Doesn't the AI understand me
example
santa claus snorkling in the caribbean, western comic book
example
santa claus snorkling in the caribbean, blacksad
How
does AI work?
it's complicated to explain, but easy to use
Compare with MP3 compression
now for AI
encoding step
latent space vector
Latent Space?
lets take a latent space walk
What is latent space
training on data
src link
train your own latent space
- lot's of data eg: the laion-5b dataset
- 5.85 billion image-text pairs.
- a hard drive of 240TB
- 32 x 8 x A100 GPUs
- cost: approx $ 600,000
- carbon cost: 11,250 kg CO2
- or ± 2.5 cars driving 15,000 km/year
dataset explorer
lets encode some text too
latent space vector
Remember this one?
right, a decoder is missing!
latent diffusion model
type prompt
get image
What
do you need to know to get started
Prompt Engineering
A prompt consists of :
- 1. A (main) topic
- 2. an environment
- 3. details
- 4. atmosphere and context of the scene
- 5. style (artist, medium)
positive prompt
prompt: cyberpunk forest by Salvador Dali
img credit Stable Diffusion 2.0
negative prompt
prompt: cyberpunk forest by Salvador Dali
negative prompt: trees, green
img credit Stable Diffusion 2.0
Help compositing a prompt?
WHY
the AI doesn't understand me
some tips
Compare
look for trouble
Learn
tweak
Complete this image in a way that proves you won’t be replaced by AI
using AI is allowed