343 streamline perf #344

evaham1 · 2024-11-13T03:17:58Z

No description provided.

…nce of just one model

…es perf.assess output

…lel compared to perf for same ncomp

…d splsda objects

… does not choose ncomp

evaham1 · 2024-11-13T03:23:41Z

perf.assess.mixo.plsda() and perf.assess.mixo.splsda()

Have created perf.assess.mixo.plsda() and exported to identical perf.assess.mixo.splsda(). These functions are essentially stripped down versions of perf.mixo.plsda()/per.mixo.splsda() in which instead of looping across components 1 to ncomp they only run performance assessment for ncomp.

Unit tests: were created to ensure that the new perf.assess() functions give exactly the same result as perf() for the same data, context and ncomp. To ensure this, set.seed() had to be added inside the component for loop in the perf() function. Unit tests also check for running in series and in parallel.

Plotting: no plots are made for perf.assess() if validation = loo or if validation = Mfold and nrep = 1. Otherwise a plot is made with just one point on the x axis as ncomp eg

Need to decide whether this is an informative plot and if want to make sure it works without repeats or if this doesn't matter and to just make a note that such plots can only be generated if repeated CV is done

…o perf.pls

evaham1 · 2024-11-15T04:10:46Z

perf.assess.mixo.pls() and perf.assess.mixo.spls()

Have created perf.assess.mixo.pls() and exported to identical perf.assess.mixo.spls(). These functions are essentially stripped down versions of perf.mixo.pls()/per.mixo.spls(). Ideally to improve runtime I would have removed any looping over component values from 1:ncomp however after testing this gave different results for the final error metrics. This is particularly the case for Q2 which is calculated using RSS from ncomp-1, but after testing also appeared to be the case for other error metrics although the source of the component dependency is not clear (could also be due to seed setting within loop, again something I played around with but couldn't fix in a non-loop manner). As the function is quite intricate I decided to leave the loops as is despite inflated runtime and simply subset results to keep only error metrics relating to the extract ncomp.

Unit tests: were created to ensure that the new perf.assess() functions give exactly the same result as perf() for the same data, context and ncomp. Added additional testing for different modes for pls and spls, and checked also feature stability ouputs for perf.spls() and perf.assess.spls().

Plotting: plotting of perf.assess() is similar to perf() for PLS objects even when nrep = 1.
nrep = 10

nrep = 1

valdation = loo

evaham1 · 2024-11-18T05:05:30Z

perf.assess.sgccda()

Have created perf.assess.sgccda() built on perf.sgccda(). Ideally to improve runtime I would have removed calculations over multiple components (in this function this is done using lots of lapplys) but due to complexity and possible inter-dependency between components maintained all the code in the function and just added lines at the end to filter to retain only information for the component used in the input model whilst retaining the result output data structure.

Unit tests: were created to ensure that the new perf.assess.sgccda() give exactly the same result as perf() for the same data, context and ncomp.

Plotting: plotting for block-plsda and block-splda objects run with perf.assess() only work if nrep>1 and validation = 'Mfold'

evaham1 · 2024-11-18T06:25:19Z

perf.assess.mint.plsda() and perf.assess.mint.splsda()

Have created perf.assess.mint(s)plsda() built on perf.mint.plsda(). Fixed for loops so only calculates metrics for the one component corresponding to ncomp, although for auc extra data slots are still generated.

Unit tests: were created to ensure that the new perf.assess.mint.plsda() give exactly the same result as perf() for the same data, context and ncomp.

Plotting: plotting does not work for perf.assess.mint.plsda() likely due to lack of repeats in LOO CV method consistent with which configurations allow for plotting in the other perf.assess functions

evaham1 · 2024-11-19T04:14:33Z

After discussing with KA makes sense to remove any plotting functionality for these objects as the plots are not informative if they are not comparing anything. Instead simply keep the performance metrics for the model in question which can be used just as a simple readout for performance of the final model.

perf.assess.plsda()
perf.assess.pls()
perf.assess.sgccda()
perf.assess.mint.plsda()

Also did a couple more checks to make sure the PA metrics that are outputted for pls.assess() are identical to those outputted by pls().

evaham1 added 7 commits November 12, 2024 16:50

init perf.assess and create perf.asess.mixo.plsda to assesss performa…

3e461ab

…nce of just one model

add test for perf.assess.plsda

1e10438

need to set seed within perf loop to make sure reproducible and match…

998fab0

…es perf.assess output

unit testing to check result of perf.assess.plsda in serial and paral…

5a11e0a

…lel compared to perf for same ncomp

update unit testing to combine perf and perf.assess for mixo plsda an…

dd0070b

…d splsda objects

rename function and remove warning for choosing ncomp and perf.assess…

07c34f2

… does not choose ncomp

edit testing context

fb277a8

evaham1 linked an issue Nov 13, 2024 that may be closed by this pull request

Refactoring: streamline perf() and tune() functions to do performance assessment on just input model #343

Closed

4 tasks

evaham1 added 8 commits November 13, 2024 14:39

init perf.assess.pls and perf.assess.spls functions

41d46ca

typo

7874aa3

typo

568c6cd

use different internal function perf.assess.mixo_pls_cv as compared t…

823260a

…o perf.pls

debugging perf.assess

ff11b5c

filter final result to only include rows of ncomp

230c154

finalise perf.assess function

b19b28a

add perf and perf.assess pls testing

11b30f5

evaham1 added 5 commits November 15, 2024 15:20

init perf.assess.diablo

0ea860c

updated perf.assess.diablo to extract only results of ncomp

02c5db5

unit testing for perf and perf.assess diablo

b0c12a9

rename unit test contexts

5701e74

update documentation to reflect changes in perf.assess so far

d46df6c

evaham1 added 4 commits November 18, 2024 16:19

init perf.assess.mint.plsda

3ade088

streamline perf.assess.mint.plsda to only output ncomp

8344e77

update perf.assess doc to reflect mint function

6a5d4c0

update perf assess mint unit tests

ef1c296

evaham1 changed the title ~~343 streamline perf and tune~~ 343 streamline perf Nov 18, 2024

error in mint perf assess testing fixed

a8003cc

evaham1 self-assigned this Nov 19, 2024

evaham1 added 3 commits November 19, 2024 15:30

change perf.assess object so cant be plotted

9aaf7d0

change output format so cant plot for perf.assess.pls

0cf5c11

change output format for perf.assess.sgccda

bef8d1c

evaham1 merged commit 06d7d92 into master Nov 19, 2024
10 of 11 checks passed

evaham1 deleted the 343-streamline-perf-and-tune branch November 19, 2024 06:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

343 streamline perf #344

343 streamline perf #344

evaham1 commented Nov 13, 2024

evaham1 commented Nov 13, 2024

evaham1 commented Nov 15, 2024

evaham1 commented Nov 18, 2024

evaham1 commented Nov 18, 2024

evaham1 commented Nov 19, 2024 •

edited

Loading

343 streamline perf #344

343 streamline perf #344

Conversation

evaham1 commented Nov 13, 2024

evaham1 commented Nov 13, 2024

evaham1 commented Nov 15, 2024

evaham1 commented Nov 18, 2024

evaham1 commented Nov 18, 2024

evaham1 commented Nov 19, 2024 • edited Loading

evaham1 commented Nov 19, 2024 •

edited

Loading