k-diffusion

This repo is a work in progress (models may break on later versions, script options may change).

Multi-GPU and multi-node training is supported with Hugging Face Accelerate. You can configure Accelerate by running:

$ accelerate config

on all nodes, then running:

$ accelerate launch train.py --train-set LOCATION_OF_TRAINING_SET --config CONFIG_FILE

on all nodes.

Enhancements:

This repo contains a deterministic 4th order linear multistep sampler (comparable to PLMS) that produces higher quality outputs for a given number of forward passes through the model than the deterministic 2nd order Heun sampler (Algorithm 1) in Karras et al.
This repo supports CLIP guided sampling from unconditional diffusion models (see sample_clip_guided.py). The 2nd order stochastic sampler (Algorithm 2) is particularly good for CLIP guided diffusion and is used by default for it.

The FID and KID calculation currently uses the torchvision InceptionV3 which makes it not comparable to FID/KID values that are reported in the literature.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
configs		configs
k_diffusion		k_diffusion
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
make_grid.py		make_grid.py
requirements.txt		requirements.txt
sample.py		sample.py
sample_clip_guided.py		sample_clip_guided.py
train.py		train.py