Mixup and Metric Learning in Out-of-Distribution Detection

Code for thesis: "Mixup and Metric Learning in Out-of-Distribution Detection".

Description

The thesis focuses on the use of two methods for training more robust models regarding out-of-distribution (OoD) examples. The model and dataset setup is based on MixOE implementation and the codes for this setup is also taken from MixOE. The OoD testing settings are divided between fine- and coarse-grained examples. The former are derived from the domain of inliers, and the latter are completely different from inliers. There are two kinds of tasks we are trying to solve:

Identification of important MixOE ingredients:
- we use seven different Mixup variants that target different components of Mixup, including:
  - layer, where Mixup is applied (Manifold MixOE, Align MixOE)
  - the role of the label derived from uniform probability distribution (Mixup with labels)
  - loss term (MixOE with and without OE loss)
  - the confidence of outliers used (MixOE with five lowest confidence outliers)
  - the distance between the outliers and inliers that are being mixed (MixOE with KNN)
  - the role of outliers in the training (Mixup with labels, original input Mixup without outliers, Mixup with a noise image)
- investigation of the impact of the size of auxiliary dataset
Metric learning in the form of triplet loss
- different types of triplets are used:
  - I1; I1; I2
  - I1; I1; O
  - I1,O; I1; I2
  - I1,O; I1; O,
where:
- I1 is an inlier from class 1.
- I2 is an inlier from class 2.
- O is an outlier.
- I1,O is a mixed pair using Mixup

We also use combination of metric learning approach and MixOE.

Installation

git clone https://github.com/Oleksandra2020/metric_mix_oe

pip install -r requirements.txt

Dataset setup

Follow the setup as mentioned in MixOE implementation. Five datasets in total are used: Car, Bird, Butterfly, Aircraft and WebVision 1.0.

Code structure

Scripts

Due to different collection of results for different types of experiments, there are three Python scripts for script generation ;) :
- Code for the seven Mixup variants along with MixOE uses wandb to save results. For these, generate_scripts_mixoe.py is used.
- Code for varying the number of outliers and their classes in the auxiliary outlier dataset uses CSV files to store the results, thus generate_scripts_outl.py is used.
- Code for different types of triplets uses CSV files to store the results, thus generate_scripts_triplet.py is used.
Mixup variants:
- train_align.py - Align MixOE
- train_knn.py - MixOE with KNN
- train_label.py - Mixup with labels
- train_mixoe_inliers.py - Mixup
- train_min_conf.py - MixOE with five min confidence outliers
- train_mixoe_manifold.py - Manifold MixOE
- train_noise.py - Mixup with a noise image
Auxiliary outlier dataset perturbations

train_mixoe_outl.py takes in outlier_num and outlier_classes parameters for sampling different number of outliers and comprising the auxiliary outlier dataset of ranging diversity.
Triplet combinations

We present five different triplet combinations:
- (I1; I1; I2) - train_i1_i1_i2.py
- (I1; I1; O) - train_i1_i1_o.py
- (I1,O; I1; I2) - train_io_i1_i2_rand.py
- (I1,O; I1; O) - train_io_i1_o_rand.py
Plotting helpers

create_bar_plot.py create bar plots with MixOE, baseline and Mixup with labels.
merge_csv_files_outl.py creates plots with the results for the varying of the number of outliers and outlier classes.
merge_csv_files_metric.py creates plots with the results for the triplet combinations described above.

Usage

Script generation
```
python generate_scripts_mixoe.py
```

Experiments

To start running an experiment:

cd ./scripts/align/aircraft

bash train_0.sh

Do not forget to change data_dir parameter while running your code as well as where to save your results.

Results

We find that:

Mixup with labels is as efficient for fine-grained setup as the complete MixOE.
the large size of the auxiliary outlier dataset is important only in coarse-grained setup.
(I1; I1; I2) triplet in combination with MixOE gives the most improvement over the other triplets for coarse-grained setup over the previous best result, MixOE.
metric learning approach does not improve fine-grained settings, neither by itself or with MixOE combination.

Mixup with labels accuracy:

Mixup with labels coarse-grained TNR@95TRP:

Mixup with labels fine-grained TNR@95TRP:

Coarse-grained results for all triplets

Green colour indicates triplet loss with standard cross-entropy, and blue is with MixOE.

Fine-grained results for all triplets

Green colour indicates triplet loss with standard cross-entropy, and blue is with MixOE.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
eval		eval
info		info
media		media
models		models
plot_funcs		plot_funcs
script_generators		script_generators
scripts		scripts
train		train
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly