Implement complete pipeline #19

philswatton · 2023-01-30T13:35:45Z

Implement a pipeline that does the following:

Takes some configuration as input (e.g. seed, drop percentages, transforms, etc)
Computes similarity metrics
Trains networks on both models
Trains transfer attacks (on both?)
Computes attack success metrics
Stores metrics computed

It may be the case that portions of this pipeline are best separated from one another (or are in partially separate pipelines). For example, we will probably want to be training networks and attacks while implementing the similarity metrics

philswatton · 2023-02-16T11:50:36Z

A general note for how we want this to work:

Several configs for different components of the experiment
- Dataset config
  - Controls alterations between A and B, and input seed
  - We'll need to specify many datasets, while creating some (3-5?) with the same configuration but with a different seed
  - Open question: do we take sets of options (dropping, transform1, transform2) and extract combinations of alterations, or should we specify the experiments we want to perform in advance?
  - The datasets will be created separately in the similarity measurement pipeline and in the model + attack pipeline
- Metric config
  - Which similarity measures we want to use
  - Initially constant across datasets. If we start looking into e.g. different labels or different datasets, we'll need to revisit that constancy (as some measures will no longer be appropriate or will only be appropriate under certain conditions - e.g. PAD requires same features, OT + OTDD will require gromov-wasserstein instead of wasserstein distance if features are not the same)
- Model config
  - Unsure if required, but if we start relaxing the extent to which models are the same we'll need to look into this
We'll need functions for handling the configs above
We'll also need two scripts to handle everything:
- A metrics script, that produces the datasets, computes the distance measures, and stores the results
- A models script, that produces the datasets, farms out model training for each A and B dataset, then transfers an attack from A to B and from B to A, computes the attack success metrics, and stores the results
Since we'll have two sets of results stored, we'll also need to make sure we can join up the two results in a third script

philswatton · 2023-03-01T09:58:48Z

With #29 this is mostly done. The main two tasks remaining are:

Replace bash script generation with slurm #32 replacing bash script generation with slurm (or something else appropriate for the computers we'll be using)
Implement attack training #17 and Implement attack success metrics #18 to get the transfer attack training + success metrics set up. We'll need some results from model training before that stage can proceed

lannelin · 2023-03-22T16:49:47Z

[from meeting]
we are running training on HPC
where are we running metric calculations?
where are we running attacks?

philswatton · 2023-03-28T10:17:13Z

#17 and #18 are now done. #40 has been opened to deal with adding attack scripts to pipeline. We also want to work out where on the pipeline we are doing similarity metrics (as above)

philswatton · 2023-04-13T08:58:23Z

#38 is now done, meaning we're free to start doing experiment groups with transforms.

Still to go is:

Adding transfer attacks to HPC Add Transfer Attacks to HPC #40
Working out where to compute the similarity metrics (and possibly whether we should also log them to wandb)

Not pipeline but also necessary:

Setting up experiment group(s) with transforms (most of our experiments will probably be in the next group, if not all) New Experiment Group With Transforms #33

Not required for a full pipeline (and thus not required for this PR) but relevant to having a full pipline is optimising the training regime #26

philswatton · 2023-04-13T13:50:36Z

Metris calculation location opened as #42

philswatton · 2023-04-17T13:20:32Z

With #40 done, #42 is the last piece of pipeline work to be done. Will be looking at #26 before that

philswatton · 2023-06-02T11:04:28Z

With #42 done, this PR is now finished. I've opened #57 to cover the need to actually use the pipeline to produce the final results

philswatton mentioned this issue Feb 7, 2023

5 implement mmd #22

Merged

This was referenced Feb 15, 2023

16 network training #25

Merged

Implement network training #16

Closed

philswatton self-assigned this Feb 16, 2023

This was referenced Feb 23, 2023

19 pipeline #29

Merged

Replace bash script generation with slurm #32

Closed

philswatton mentioned this issue Mar 3, 2023

32 slurm #36

Merged

philswatton closed this as completed Jun 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement complete pipeline #19

Implement complete pipeline #19

philswatton commented Jan 30, 2023

philswatton commented Feb 16, 2023

philswatton commented Mar 1, 2023

lannelin commented Mar 22, 2023

philswatton commented Mar 28, 2023

philswatton commented Apr 13, 2023

philswatton commented Apr 13, 2023

philswatton commented Apr 17, 2023

philswatton commented Jun 2, 2023

Implement complete pipeline #19

Implement complete pipeline #19

Comments

philswatton commented Jan 30, 2023

philswatton commented Feb 16, 2023

philswatton commented Mar 1, 2023

lannelin commented Mar 22, 2023

philswatton commented Mar 28, 2023

philswatton commented Apr 13, 2023

philswatton commented Apr 13, 2023

philswatton commented Apr 17, 2023

philswatton commented Jun 2, 2023