JP-1887: refactor OLS ramp fitting. #5872

kmacdonald-stsci · 2021-03-16T12:59:21Z

Refactored OLS ramp fitting functions to reduced the number of function parameters and returns, simplifying the code to make it easier to read.

Addresses #5701 / JP-1887

…prove readability.

…ion of OLS.

codecov · 2021-03-16T13:35:14Z

Codecov Report

Merging #5872 (ccf3fce) into master (59039ce) will increase coverage by 0.28%.
The diff coverage is 82.71%.

@@            Coverage Diff             @@
##           master    #5872      +/-   ##
==========================================
+ Coverage   77.90%   78.19%   +0.28%     
==========================================
  Files         399      399              
  Lines       35209    36118     +909     
==========================================
+ Hits        27430    28241     +811     
- Misses       7779     7877      +98

Flag	Coverage Δ		*Carryforward flag
nightly	`77.90% <77.53%> (ø)`		Carriedforward from 59039ce
unit	`55.98% <78.78%> (?)`

*This pull request uses carry forward flags. Click here to find out more.

Impacted Files	Coverage Δ
jwst/ramp_fitting/ramp_fit.py	`83.57% <82.64%> (+2.97%)`	⬆️
jwst/ramp_fitting/utils.py	`97.15% <100.00%> (+0.54%)`	⬆️
jwst/associations/asn_edit.py	`66.66% <0.00%> (-0.75%)`	⬇️
jwst/associations/lib/process_list.py	`73.49% <0.00%> (-0.59%)`	⬇️
jwst/fits_generator/template.py	`16.98% <0.00%> (-0.27%)`	⬇️
jwst/timeconversion/time_conversion.py	`10.96% <0.00%> (-0.08%)`	⬇️
jwst/datamodels/make_header.py	`91.38% <0.00%> (+0.02%)`	⬆️
jwst/associations/lib/product_utils.py	`94.82% <0.00%> (+0.09%)`	⬆️
jwst/regtest/conftest.py	`67.46% <0.00%> (+1.20%)`	⬆️
... and 2 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 59039ce...ccf3fce. Read the comment docs.

hbushouse · 2021-03-17T13:25:50Z

jwst/ramp_fitting/ramp_fit.py

+        if opt_model is not None
+
+    save_opt : boolean
+       calculate optional fitting results


Minor nit in wording, but this should perhaps say "save optional fitting results", because the results have in fact already been calculated. All we're doing here is putting them into an optional output product.

I changed this wording.

hbushouse · 2021-03-17T13:36:03Z

jwst/ramp_fitting/ramp_fit.py

-    if dropframes1 is None:    # set to default if missing
-        dropframes1 = 0
-        log.debug('Missing keyword DRPFRMS1, so setting to default value of 0')
+    input_model.data = data


Do we have a good feel for whether stuffing these individual data arrays back into a new data model is going to lead to a big jump in memory usage for the step as a whole? We already had an input data model to the step, from which these data arrays were extracted, and now we're putting them back into another data model. I know it's cleaner that way in terms of programming, but this step is prone to memory issues, due to all the very large arrays being used. So I'm worried about creating what amounts to another copy of the data. Would it be possible to just pass in the original data model or does that not work because of the slicing of data needed for multiprocessing?

This is a wrapper I used because I changed the function prototype for ols_ramp_fit to ols_ramp_fit_single. This is an intermediate fix to continue refactoring the OLS code without having to refactor the multi-processing code right now, also. Ultimately, this unpacking and repacking will be refactored out.

hbushouse · 2021-03-17T13:39:57Z

jwst/ramp_fitting/ramp_fit.py

-    err: ndarray
-        Error array
+    int_times : None
+        Not used


So if it's not used, why not remove it?

I didn't remove it because I wasn't sure why it was here to begin with. I will remove it if I determine it's not actually used or can easily be gotten from the classes.

hbushouse · 2021-03-17T14:05:03Z

jwst/ramp_fitting/ramp_fit.py


    orig_cubeshape : (int, int, int) tuple
       Original shape of input dataset

-    orig_nreads : int
+    orig_ngroups: int
       Original number of reads


Should modify this comment here too, to change "reads" to "groups"

Thank you. Since our discussion before I have been removing references to "read" that really indicates "group" in JWST. I changed this documentation.

jdavies-st

The changes look good to me, but they produce some significant floating point differences on a very small number of science pixels on at least 1 of our regression tests, test_nirspec_detector1[rate]. Here I ran all regression tests that run ramp fitting and produce a rate file:

https://plwishmaster.stsci.edu:8081/job/RT/job/jdavies-dev/228/

Ignore the keyword header failures, that's just because this branch is not rebased to the most recent master.

When I run all these pytest jwst/regtest -k rate tests locally, I see the same differences, and I actually see slightly different floating point differences on another test:

_______________________________________________ test_verify_detector1[rate] ________________________________________________

run_detector1 = None
rtdata_module = {'input': '/private/tmp/scratch/test_nirspec_verify_rtdata_module0/nrs_verify_nrs1_uncal.fits',
 'input_remote': 'jwst...h/nrs_verify_nrs1_rate.fits',
 'truth_remote': 'jwst-pipeline/dev/truth/test_nirspec_verify/nrs_verify_nrs1_rate.fits'}
fitsdiff_default_kwargs = {'atol': 1e-07, 'ignore_fields': ['DATE', 'CAL_VER', 'CAL_VCS', 'CRDS_VER', 'CRDS_CTX', 'NAXIS1', ...], 'ignore_hdus': ['ASDF'], 'ignore_keywords': ['DATE', 'CAL_VER', 'CAL_VCS', 'CRDS_VER', 'CRDS_CTX', 'NAXIS1', ...], ...}
suffix = 'rate'

    @pytest.mark.bigdata
    @pytest.mark.parametrize(
        'suffix', [
            'dq_init', 'saturation', 'superbias', 'refpix', 'linearity',
            'dark_current', 'jump', 'rate',
        ])
    def test_verify_detector1(run_detector1, rtdata_module, fitsdiff_default_kwargs, suffix):
        """Test results of the detector1 and image2 processing"""
        rtdata = rtdata_module
        rtdata.input = replace_suffix(ROOT, 'uncal') + '.fits'
        output = replace_suffix(ROOT, suffix) + '.fits'
        rtdata.output = output
        rtdata.get_truth('truth/test_nirspec_verify/' + output)
    
        diff = FITSDiff(rtdata.output, rtdata.truth, **fitsdiff_default_kwargs)
>       assert diff.identical, diff.report()
E       AssertionError: 
E          fitsdiff: 4.2
E          a: /private/tmp/scratch/test_nirspec_verify_rtdata_module0/nrs_verify_nrs1_rate.fits
E          b: /private/tmp/scratch/test_nirspec_verify_rtdata_module0/truth/nrs_verify_nrs1_rate.fits
E          HDU(s) not to be compared:
E           ASDF
E          Keyword(s) not to be compared:
E           CAL_VCS CAL_VER CRDS_CTX CRDS_VER DATE NAXIS1 TFORM*
E          Table column(s) not to be compared:
E           CAL_VCS CAL_VER CRDS_CTX CRDS_VER DATE NAXIS1 TFORM*
E          Maximum number of different data values to be reported: 10
E          Relative tolerance: 1e-05, Absolute tolerance: 1e-07
E         
E         Extension HDU 1 (SCI, 1):
E         
E            Data contains differences:
E              Data differs at [619, 5]:
E                 a> 0.30653778
E                 b> 0.30652106
E              Data differs at [946, 13]:
E                 a> 0.5124498
E                 b> 0.51245815
E              Data differs at [432, 14]:
E                 a> 1.6663404
E                  ?      - ^
E                 b> 1.6664746
E                  ?       ^ +
E              Data differs at [785, 14]:
E                 a> 2.4191005
E                  ?       ^^
E                 b> 2.4191675
E                  ?       ^^
E              Data differs at [163, 24]:
E                 a> -0.904209
E                 b> -0.9047445
E              Data differs at [506, 56]:
E                 a> -1.5337013
E                 b> -1.5331658
E              Data differs at [736, 56]:
E                 a> 1.3189616
E                 b> 1.3194972
E              Data differs at [822, 59]:
E                 a> -1.0683262
E                  ?         ^^
E                 b> -1.0683428
E                  ?        + ^
E              Data differs at [475, 65]:
E                 a> 0.19733553
E                 b> 0.19734389
E              Data differs at [432, 74]:
E                 a> 4.538215
E                  ?       ^^
E                 b> 4.538282
E                  ?       ^^
E              ...
E              227 different pixels found (0.01% different).
E         
E         Extension HDU 2 (ERR, 1):
E         
E            Data contains differences:
E              Data differs at [736, 56]:
E                 a> 0.7799795
E                 b> 0.7800025
E              Data differs at [654, 365]:
E                 a> 0.70179623
E                 b> 0.7017717
E              Data differs at [121, 683]:
E                 a> 0.70261586
E                 b> 0.7025893
E              Data differs at [668, 1554]:
E                 a> 0.7394267
E                  ?        -
E                 b> 0.7394027
E                  ?       +
E              Data differs at [1548, 1625]:
E                 a> 0.74220675
E                 b> 0.7422182
E              Data differs at [2040, 1659]:
E                 a> 0.7310654
E                 b> 0.73107785
E              Data differs at [608, 1945]:
E                 a> 1.0903927
E                  ?       --
E                 b> 1.0903755
E                  ?        ++
E              7 different pixels found (0.00% different).
E         
E         Extension HDU 4 (VAR_POISSON, 1):
E         
E            Data contains differences:
E              Data differs at [619, 5]:
E                 a> 0.020912398
E                 b> 0.020911254
E              Data differs at [946, 13]:
E                 a> 0.03960225
E                  ?         ^^
E                 b> 0.039602898
E                  ?         ^^^
E              Data differs at [432, 14]:
E                 a> 0.11363168
E                 b> 0.11364081
E              Data differs at [785, 14]:
E                 a> 0.16637993
E                 b> 0.16638453
E              Data differs at [736, 56]:
E                 a> 0.088048115
E                 b> 0.088083856
E              Data differs at [475, 65]:
E                 a> 0.014082874
E                  ?        ^^ -
E                 b> 0.01408347
E                  ?        ^^
E              Data differs at [432, 74]:
E                 a> 0.3185729
E                  ?        ^^
E                 b> 0.3185776
E                  ?        ^^
E              Data differs at [1560, 76]:
E                 a> 0.027786894
E                 b> 0.027786272
E              Data differs at [166, 182]:
E                 a> 0.016576806
E                  ?        ^^
E                 b> 0.016579064
E                  ?        ^  +
E              Data differs at [434, 201]:
E                 a> 0.016168522
E                 b> 0.016169054
E              ...
E              64 different pixels found (0.00% different).
E         
E       assert False
E        +  where False = <astropy.io.fits.diff.FITSDiff object at 0x7fb6b09cad60>.identical

/Users/jdavies/dev/jwst/jwst/regtest/test_nirspec_verify.py:76: AssertionError

I suspect this is because we are doing calculations in ramp fitting with np.float32 dtypes in our arrays, and order of operations matter.

…ixels are not being set.

jdavies-st

The failure seen in

jwst/regtest/test_nirspec_subarray.py

was due to the input data claiming meta.exposure.ngroups = 6 but having only 2 groups in the data, which was inconsistent and incorrect. This caused the issue in the code were sometimes that value was pulled from the data shape and sometimes from meta. It is now always pulled from the data shape, which I think is the correct approach. This errant dataset is now fixed, and all the regression tests run fine on both master and in this branch.

I think this is ready to merge after removing the last 2 commits relating to the g_pix comments, and adding an entry to CHANGES.rst.

…quirements.

kmacdonald-stsci added 5 commits March 12, 2021 19:38

Refactoring functions and variables relating to OLS computation to im…

e0dc59a

…prove readability.

Refactoring function parameters for OLS ramp fitting.

53f6d35

Refactoring input and output parameters for functions in the computat…

fb0b61f

…ion of OLS.

Refactoring OLS subfunction parameters to simplify the code.

fd57e76

Changed reference to nreads to ngroups.

fb9ea1b

kmacdonald-stsci requested review from nden, hbushouse and jdavies-st March 16, 2021 12:59

jdavies-st changed the title ~~Jp 1887 refactor OLS ramp fitting.~~ JP-1887: refactor OLS ramp fitting. Mar 16, 2021

hbushouse added the ramp_fitting label Mar 17, 2021

hbushouse added this to the Build 7.8 milestone Mar 17, 2021

hbushouse reviewed Mar 17, 2021

View reviewed changes

Made changes based on code review.

246bfb0

jdavies-st reviewed Mar 26, 2021

View reviewed changes

kmacdonald-stsci added 3 commits April 1, 2021 09:03

Changing variable names.

df14635

Found pixel setting error in regtest, but do not understand why the p…

4cae265

…ixels are not being set.

Removing debug comment

badc6b4

jdavies-st approved these changes Apr 5, 2021

View reviewed changes

kmacdonald-stsci and others added 4 commits April 5, 2021 14:37

Removing debugging comments and making style changes to conform to re…

e30f255

…quirements.

Adding changes based on ramp fitting refactoring.

014c3a9

Adding PR number to update in the CHANGES.rst file.

176806c

Merge branch 'master' into jp_1887_refac_4

ccf3fce

jdavies-st merged commit 8aa848e into spacetelescope:master Apr 6, 2021

kmacdonald-stsci deleted the jp_1887_refac_4 branch April 6, 2021 22:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JP-1887: refactor OLS ramp fitting. #5872

JP-1887: refactor OLS ramp fitting. #5872

kmacdonald-stsci commented Mar 16, 2021 •

edited by jdavies-st

Loading

codecov bot commented Mar 16, 2021 •

edited

Loading

hbushouse Mar 17, 2021

kmacdonald-stsci Mar 17, 2021

hbushouse Mar 17, 2021

kmacdonald-stsci Mar 17, 2021

hbushouse Mar 17, 2021

kmacdonald-stsci Mar 17, 2021

hbushouse Mar 17, 2021

kmacdonald-stsci Mar 17, 2021

jdavies-st left a comment •

edited

Loading

jdavies-st left a comment

JP-1887: refactor OLS ramp fitting. #5872

JP-1887: refactor OLS ramp fitting. #5872

Conversation

kmacdonald-stsci commented Mar 16, 2021 • edited by jdavies-st Loading

codecov bot commented Mar 16, 2021 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jdavies-st left a comment • edited Loading

Choose a reason for hiding this comment

jdavies-st left a comment

Choose a reason for hiding this comment

kmacdonald-stsci commented Mar 16, 2021 •

edited by jdavies-st

Loading

codecov bot commented Mar 16, 2021 •

edited

Loading

jdavies-st left a comment •

edited

Loading