Text to Image Diffusion AI Model from scratch - Explained one line of code at a time!

Описание к видео Text to Image Diffusion AI Model from scratch - Explained one line of code at a time!

In just 15 points, we talk about everything you need to know about Generative AI Diffusion models - from the basics to Latent Diffusion Models (LDMs) and Text-to-Image conditional Latent diffusion models. I also train a diffusion model with Pytorch on my laptop to demonstrate how it all works.

To access the full code repo && 15 minute code walkthrough video && 4000+ word script && 15+ animations && powerpoint slides used in this video (as well as others on my channel), please consider supporting us on Patreon! It helps the channel massively, so thanks for considering.

Patreon link:   / neuralbreakdownwithavb  

#diffusion #ai #machinelearning #generativeai

Related videos:

So you think you know Text to Video Diffusion models?
   • So you think you know Text to Video D...  

Attention Series:    • Neural Attention - This simple exampl...  

Latent Space:    • Visualizing the Latent Space: This vi...  

CNNs:    • But what does a trained Convolution N...  

U-Net:    • Coding Image Segmentation with UNet f...  

NLP History:    • 10 years of NLP history explained in ...  

Multimodal Models:    • Multimodal AI from First Principles -...  

Papers:
DDPM: https://arxiv.org/pdf/2006.11239
CLIP: https://arxiv.org/pdf/2103.00020
LDMs: https://arxiv.org/pdf/2112.10752

Dataset:
You can search for CelebA dataset on Kaggle.

https://www.kaggle.com/datasets/jessi...


Timestamps:
0:00 - Intro
1:40 - 1
2:43 - 2
3:24 - 3
5:59 - 4
8:09 - 5
9:49 - 6
11:07 - 7
11:55 - 8
14:11 - 9
16:15 - 10
18:49- 11
19:48 - 12
21:03 - 13
22:07 - 14
23:27 - 15

Комментарии

Информация по комментариям в разработке