You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Goal of this educational repository is to provide a self-contained, minimalistic implementation of diffusion models using Pytorch.
Many implementations of diffusion models can be a bit overwhelming. Here, superminddpm : under 200 lines of code, fully self contained implementation of DDPM with Pytorch is a good starting point for anyone who wants to get started with Denoising Diffusion Models, without having to spend time on the details.
Simply:
$ python superminddpm.py
Above script is self-contained. (Of course, you need to have pytorch and torchvision installed. Latest version should suffice. We do not use any cutting edge features.)
If you want to use the bit more refactored code, that runs CIFAR10 dataset:
$ python train_cifar10.py
Above result took about 2 hours of training on single 3090 GPU. Top 8 images are generated, bottom 8 are ground truth.
Here is another example, trained on 100 epochs (about 1.5 hours)
Currently has:
Tiny implementation of DDPM
MNIST, CIFAR dataset.
Simple unet structure. + Simple Time embeddings.
CelebA dataset.
TODOS
DDIM
Classifier Guidance
Multimodality
Updates!
Using more parameter yields better result for MNIST.
More comments in superminddpm.py
About
Self-contained, minimalistic implementation of diffusion models with Pytorch.