Precomputing Lumi Mask For Event Based Data RelVals #138

AdrianoDee · 2024-12-05T10:30:50Z

In #131 we've introduced the possibility to run data wfs for which the number of events is used to skim the input files. This technically works but create some problems when submitting these wfs.

Basically, even if the list of files on which we want to run is parsed correctly, we would end up anyway staging the entire RAW dataset since the job SplittingAlgo would be still flagged as LumiBased. And currently the job dict for an event based looks like this.

I've done some investigations and there's the possibility to use a FileBased splitting but this would not solve the problem since it would only change the splitting but it would anyway trigger the staging of the whole dataset. The only way to ask for a partial dataset is either specifying a block (not useful) either through a lumi mask.

Thus in this PR I propose to precompute the lumi mask corresponding to the number of events requested specified in the data wf so that the job gets it in tis dict and we stage only the needed files. This is done by implementing a lightweight version of das-up-to-nevents.py in the RelVal machine, to be called (when needed) by run-the-matrix-pdmv.py when the job is created.

The resulting dict for an example job is what one would expect:

nevents limit is needed

core/utils/das.py

core/utils/dqm.py

ggonzr

Regarding 3rd party packages being available, we need to be aware of the Python version available in the cms-sw release as el7 and el8 containers do not allow the user to install external dependencies.

core/utils/dqm.py

AdrianoDee · 2024-12-05T16:48:15Z

Regarding 3rd party packages being available, we need to be aware of the Python version available in the cms-sw release as el7 and el8 containers do not allow the user to install external dependencies.

For me we can directly allow this to run only if the ticket is for cms-sw > 14_0_X (for which I'm 100% sure all the packages above have been installed and have the proper version).

Instead provide two new functions `list_certification_files` and `get_certification_file` for listing all the files related to a certification type and get a specific golden JSON file. Also, sort the imports using `isort`.

Remove type hints as old Python version do not support them

ggonzr · 2024-12-12T01:17:24Z

Testing with this ticket CMSSW_10_6_20__TEST_LUMI_MASK_fullsim_PU_2017_UL-00001 that uses an old release it seems we have a regression problem as the assumption made by the get_cert_type is not correct and a ValueError is raised

[2024-12-12 01:53:18,382][ERROR] STDERR: WARNING: In non-interactive mode release checks e.g. deprecated releases, production architectures are disabled.
CMSSW_10_6_20__TEST_LUMI_MASK_fullsim_PU_2017_UL-00001/singularity-script-00489c3c6c0f0e58372a2dbb2fb628be.sh: line 19: CMSSW_10_6_20__TEST_LUMI_MASK_fullsim_PU_/cvmfs/cms.cern.ch/slc7_amd64_gcc700/external/py2-requests/2.21.0-pafccj2/lib/python2.7/site-packages/requests/__init__.py:91: RequestsDependencyWarning: urllib3 (1.25.2) or chardet (3.0.4) doesn't match a supported version!
  RequestsDependencyWarning)
Traceback (most recent call last):
  File "run_the_matrix_pdmv.py", line 318, in <module>
    main()
  File "run_the_matrix_pdmv.py", line 305, in main
    wmsplit))
  File "run_the_matrix_pdmv.py", line 189, in make_relval_step
    lumisections = get_lumi_ranges_from_dict(step_input)
  File "run_the_matrix_pdmv.py", line 164, in get_lumi_ranges_from_dict
    golden = get_golden_json(step_input.dataSet)
  File "CMSSW_10_6_20__TEST_LUMI_MASK_fullsim_PU_2017_UL-00001/dqm.py", line 119, in get_golden_json
    cert_type = get_cert_type(dataset)
  File "CMSSW_10_6_20__TEST_LUMI_MASK_fullsim_PU_2017_UL-00001/dqm.py", line 76, in get_cert_type
    year = dataset.split("Run")[1][2:4] # from 20XX to XX
IndexError: list index out of range

@AdrianoDee could you check this?

AdrianoDee · 2024-12-12T10:20:15Z

I think this is due to the fact that the wfs there are MC wfs requiring an input (recycled). So this wrongfully triggers all the chain, that instead should be called eventuallly only on data. I'm looking at it to understand how to avoid this.

AdrianoDee · 2024-12-16T15:06:14Z

HI @ggonzr I should have fixed the issue with MC workflows with inputs. Note that the specific ticket in 10_6_20 fails anyway with

/afs/cern.ch/user/p/pdmvserv/relval_submission/CMSSW_10_6_20__TEST_LUMI_MASK_fullsim_PU_2017_UL-00001/singularity-script-aaf0e3c85defa1f3a678bc558c002136.sh: line 19: cd: relval_submission/CMSSW_10_6_20__TEST_LUMI_MASK_fullsim_PU_2017_UL-00001: 
No such filTraceback (most recent call last): File "run_the_matrix_pdmv.py", line 300, in main() File "run_the_matrix_pdmv.py", line 295, in main with open(opt.output_file, 'w', encoding='utf-8') as workflows_file: 
TypeError: 'encoding' is an invalid keyword argument for this function

also in the prod instance. I think this is due to a Python2 incompatibility. The same ticket in 12_6_0 works. I've added the docs (let me know if they're fine for you).

ggonzr

There are some typos to fix.

core/utils/das.py

Fix some typos for docstrings

List the packages and versions used for the development of remote modules located in `utils/`

Format some modules, in special the new ones.

A simplified das-up-to-nevents.py for precomputing lumisections when

4b69577

nevents limit is needed