Stable Diffusion

Learning stable diffusion following the tutorial from Computerphile.

The original code is buggy, it took me a while to make the program working.

Stable diffusion can outperform GAN because it is easy to train, and with language models, we can achieve even more. Diffusion model opens an era for AIGC (AI Generated Content), and brought about the exponential growth of Novel AI, Midjourney, Meitu Ai, etc.

The diffusion process can be similar to removing the noise from an noised image.

Process

This video shows the denoising process:

“A sad Japanese lady in kimono weeping over a dying Black cat in a sunlit room, Ukiyo-e”

Denoising

With stable diffusion’s mix guidance feature, we can have impossible scene.

Combining frog and rabbit in same ratio

Mix guidance

For more information, just watch computerphile’s video.