BYOL, but trainable on a single GPU

This is a Tensorflow implementation of the BYOL paper. It uses weight standardization and group norm to make it trainable on a single consumer GPU. While the performance is not comparable to that of the same architecture pretrained on ImageNet, it can be trained on CIFAR10 on a normal GPU (tested on RTX 2070S). Performances are reported below:

Performance on CIFAR10

After the self-supervised pretraining, the authors fine-tune the network on x% of the training data. The table below shows the classification accuracy. 50k images have been used for training/validation and 10k for testing.

1% Fine-tuned	47.8%
10% Fine-tuned	64.8%
100% Fine-tuned	76.3%

Papers:

BYOL: https://arxiv.org/abs/2006.07733
BYOL without batch statistics: https://arxiv.org/abs/2010.10241
GroupNorm: https://arxiv.org/abs/1803.08494
Weight standardization: https://arxiv.org/abs/1903.10520

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
DataGenerator_CIFAR10.py		DataGenerator_CIFAR10.py
LARS.py		LARS.py
Network.py		Network.py
README.md		README.md
ResNet_GN_WS.py		ResNet_GN_WS.py
byol_test.py		byol_test.py
byol_train.py		byol_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BYOL, but trainable on a single GPU

Performance on CIFAR10

Papers:

About

Releases

Packages

Languages

NilsKeunecke/byol_single_gpu

Folders and files

Latest commit

History

Repository files navigation

BYOL, but trainable on a single GPU

Performance on CIFAR10

Papers:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages