[ENH] Minimum Noise Fraction denoising preprocessor #729

borondics · 2024-07-19T15:28:23Z

Added a new denoising method with the help of SOLARIS colleagues.

The paper it is based on is this.

orangecontrib/spectroscopy/preprocess/__init__.py

markotoplak · 2024-07-19T15:50:35Z

Also add the preprocessor to test_preprocess.PREPROCESSORS_INDEPENDENT_SAMPLES for torture tests.

Also, how did you ensure that the code in transform does what you would want? If you did any comparisons with existing implementations, make a test that encompases that comparison.

orangecontrib/spectroscopy/preprocess/__init__.py

borondics · 2024-07-19T15:57:09Z

Thanks for the suggestions!

I did both.

However, should we also add PCADenoising to test_preprocess.PREPROCESSORS_INDEPENDENT_SAMPLES? It wasn't there.

For the test I used a small script that was implemented by colleagues to get the calculated values for iris and took the numbers. I will also verify this for a spectral dataset.

Some questions: PyLint complains about the MNFDenoising.sd, which seems to be unnecessary but it is also in PCADenoising... Should we remove both?

borondics · 2024-07-19T16:00:06Z

Ha! The new tests didn't pass. :D

markotoplak · 2024-07-19T16:04:59Z

Sorry, I was wrong, these is not an independent samples preprocessor, just like PCA is not, I should have said PREPROCESSORS_GROUPS_OF_SAMPLES

borondics · 2024-07-19T16:19:32Z

Moved it but there is still some problem. Possibly not only with the new addition...

stuart-cls · 2024-07-19T16:43:42Z

@borondics The peakfit failures are due to #720 which should be resolved as soon as lmfit 1.3.2 is released.

markotoplak · 2024-07-22T12:42:29Z

@borondics, please rebase to master so that tests will (mostly) work.

borondics · 2024-07-22T12:53:52Z

The MNF related tests are passing on my computer but I can do this various ways. @markotoplak, right now I am using a try and raising errors while still returning the original data if the MNF fails. What do you say?

borondics · 2024-07-22T15:33:54Z

OK, I think now the MNF tests are also passing here.

markotoplak · 2024-07-23T11:35:21Z

@borondics, some tests fail. Your code can not handle unknowns. You probably need to extend CommonDomainOrderUnknowns instead of CommonDomain and that is it then. This was not needed for PCA because Orange's PCA handles them.

But if I think about it, even PCA denoising should use that one, because the interpolation it uses is better for spectra.

markotoplak · 2024-07-23T11:36:51Z

Another test fails because results of this preprocessor are so unstable that small changes in data produce very different values. Is that expected? If so, we could ignore it in that test.

BTW, this is a sign of high instability of the method and makes its usability questionable.

markotoplak · 2024-07-23T11:39:50Z

That "testing for nans" commit is probably unnecessary if you extend the correct class and anyway, should not be done so. If you ever need something like this, make try: except" cover the least code possible.

markotoplak · 2024-07-24T08:48:37Z

@borondics, I rebased and made this robust to unknown values as I pointed out above.

Now only one tests is failing.

I also find the original (and current) computation code suspicious. When computing the N matrix all differences were computed and then there was additional vector of zeros. This seems like a bug, and your tests also pass if I say N = diffs. How did you verify if the code is fine?

I have some additional suspicions due to the non-passing test. :)

markotoplak · 2024-08-20T12:54:10Z

@borondics, I am merging this assuming that the failed test is not problematic (I just ignored it).

Please, consider again if the computation is correct. The sensitivity of the method to small perturbations in the data (the sensitivity is problematic even on bigger data, like the whole collagen).

Even if the implementation is correct, I would not use the method on data like collagen because of its sensitivity to noise (if the method is meant to be applied to different kinds of data - perhaps already preprocessed in a certain way - it should be, of course, tested with that).

borondics requested review from markotoplak and stuart-cls July 19, 2024 15:28

markotoplak reviewed Jul 19, 2024

View reviewed changes

orangecontrib/spectroscopy/preprocess/__init__.py Outdated Show resolved Hide resolved

markotoplak reviewed Jul 19, 2024

View reviewed changes

orangecontrib/spectroscopy/preprocess/__init__.py Outdated Show resolved Hide resolved

borondics force-pushed the mnf branch from f1e437e to 62dbc65 Compare July 22, 2024 15:27

markotoplak force-pushed the mnf branch from 62dbc65 to 6deb582 Compare July 24, 2024 08:32

markotoplak changed the title ~~Minimum Noise Fraction denoising~~ [ENH] Minimum Noise Fraction denoising preprocessor Jul 24, 2024

borondics and others added 4 commits August 20, 2024 14:06

Minimum Noise Fraction denoising

bd82683

MNF: handle strange data

6c161a2

MNF: use BaseEditorOrange

7c85c6c

MNF: vectorize difference computation

f77e42a

markotoplak force-pushed the mnf branch from a8c632e to 446b1fe Compare August 20, 2024 12:20

markotoplak added 2 commits August 20, 2024 14:22

efficiency

07b0c4e

disable failing test hoping that code is OK

f36b771

markotoplak force-pushed the mnf branch from 446b1fe to f36b771 Compare August 20, 2024 12:22

markotoplak merged commit 19470a4 into Quasars:master Aug 20, 2024
10 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Minimum Noise Fraction denoising preprocessor #729

[ENH] Minimum Noise Fraction denoising preprocessor #729

borondics commented Jul 19, 2024

markotoplak commented Jul 19, 2024

borondics commented Jul 19, 2024

borondics commented Jul 19, 2024

markotoplak commented Jul 19, 2024

borondics commented Jul 19, 2024

stuart-cls commented Jul 19, 2024

markotoplak commented Jul 22, 2024

borondics commented Jul 22, 2024

borondics commented Jul 22, 2024

markotoplak commented Jul 23, 2024

markotoplak commented Jul 23, 2024

markotoplak commented Jul 23, 2024

markotoplak commented Jul 24, 2024

markotoplak commented Aug 20, 2024

[ENH] Minimum Noise Fraction denoising preprocessor #729

[ENH] Minimum Noise Fraction denoising preprocessor #729

Conversation

borondics commented Jul 19, 2024

markotoplak commented Jul 19, 2024

borondics commented Jul 19, 2024

borondics commented Jul 19, 2024

markotoplak commented Jul 19, 2024

borondics commented Jul 19, 2024

stuart-cls commented Jul 19, 2024

markotoplak commented Jul 22, 2024

borondics commented Jul 22, 2024

borondics commented Jul 22, 2024

markotoplak commented Jul 23, 2024

markotoplak commented Jul 23, 2024

markotoplak commented Jul 23, 2024

markotoplak commented Jul 24, 2024

markotoplak commented Aug 20, 2024