Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[JME NanoAOD] Changes for event size reduction #32722

Merged

Conversation

nurfikri89
Copy link
Contributor

PR description:

This PR is to reduce the size of custom JME NanoAODs. The following changes are made:

  • Check that npuProd[ibx].getPU_pT_hats() size is more than 0 in PhysicsTools/NanoAOD/plugins/NPUTablesProducer.cc.
  • Switch on storing Parton Shower (PS) weights.
  • b-tagging algorithm outputs are saved for AK4 jets with pT > 15 GeV.
  • Low-level input variables for PileUp jet ID and Quark-Gluon Likelihood are saved for AK4 jets with pT > 10 GeV.
  • Remove AK4 PF and AK8 PF jet collections.
  • Add separate AK8 GEN and AK8 Puppi jet collections for JEC studies. Minimal information are saved.
  • For reclustered GEN jet collections, the minimum pT at the jet clustering level uses the default value (3 GeV) in RecoJets/JetProducers/python/ak4GenJets_cfi.py.
  • For reclustered reco-level jet collections, the minimum pT at the jet clustering level is set to be 0. Then, only jets with (JEC-corrected) pT > 8 GeV are saved. Additionally for MC samples, jets with pT < 8 GeV are saved if they are matched to GEN jets.

With this PR, the event size is now reduced to 6.42 kb/event from 9.32 kb/event. The comparison was made using 10K events from a TTJets RunIISummer19UL17MiniAOD sample. The changes in this PR was discussed in the 13/01/2021 XPOG meeting [1].

[1] https://indico.cern.ch/event/978436/

if this PR is a backport please specify the original PR and why you need to backport that PR:

Once merged, this PR needs to be backported to 10_6_X, for the ultra legacy campaigns.

@cmsbuild cmsbuild added this to the CMSSW_11_3_X milestone Jan 23, 2021
@nurfikri89
Copy link
Contributor Author

nurfikri89 commented Jan 23, 2021

FYI JMAR convenors @camclean @alefisico

@cmsbuild
Copy link
Contributor

-code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-32722/20859

  • This PR adds an extra 32KB to repository

Code check has found code style and quality issues which could be resolved by applying following patch(s)

@cmsbuild
Copy link
Contributor

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-32722/20860

  • This PR adds an extra 32KB to repository

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @nurfikri89 (Nurfikri Norjoharuddeen) for master.

It involves the following packages:

PhysicsTools/NanoAOD
PhysicsTools/PatAlgos

@perrotta, @gouskos, @cmsbuild, @fgolf, @slava77, @jpata, @mariadalfonso can you please review it and eventually sign? Thanks.
@jdamgov, @emilbols, @gouskos, @swertz, @jdolen, @ahinzmann, @smoortga, @schoef, @rappoccio, @mariadalfonso, @JyothsnaKomaragiri, @nhanvtran, @gkasieczka, @clelange, @hatakeyamak, @ferencek, @gpetruc, @andrzejnovak, @peruzzim, @seemasharmafnal, @mmarionncern this is something you requested to watch as well.
@silviodonato, @dpiparo, @qliphy you are the release manager for this.

cms-bot commands are listed here

@gouskos
Copy link
Contributor

gouskos commented Jan 23, 2021

please test

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-43558c/12493/summary.html
COMMIT: 6e83ab4
CMSSW: CMSSW_11_3_X_2021-01-23-1100/slc7_amd64_gcc900
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/32722/12493/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 37
  • DQMHistoTests: Total histograms compared: 2716961
  • DQMHistoTests: Total failures: 1
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2716938
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 36 files compared)
  • Checked 156 log files, 37 edm output root files, 37 DQM output files

@mariadalfonso
Copy link
Contributor

please test workflow 25202.15,10224.15,11024.15

(to test JMEnano workflow)

@mariadalfonso
Copy link
Contributor

@andrzejnovak
please confirm that this update is good too BTV as well

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-43558c/12507/summary.html
COMMIT: 6e83ab4
CMSSW: CMSSW_11_3_X_2021-01-24-2300/slc7_amd64_gcc900
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/32722/12507/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-43558c/10224.15_TTbar_13+2017PU_JMENano+TTbar_13TeV_TuneCUETP8M1_GenSim+DigiPU+RecoFakeHLTPU+HARVESTFakeHLTPU+Nano
  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-43558c/11024.15_TTbar_13+2018PU_JMENano+TTbar_13TeV_TuneCUETP8M1_GenSim+DigiPU+RecoFakeHLTPU+HARVESTFakeHLTPU+Nano
  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-43558c/25202.15_TTbar_13+TTbar_13+DIGIUP15_PU25+RECOUP15_PU25+HARVESTUP15_PU25+NANOUP15MC_PU25_JME

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 37
  • DQMHistoTests: Total histograms compared: 2716961
  • DQMHistoTests: Total failures: 7
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2716932
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 36 files compared)
  • Checked 156 log files, 37 edm output root files, 37 DQM output files

"enabled" : False,
},
{
"jet" : "ak6gen",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What's the motivation for the change from ak8 -> ak6 in the naming ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I had set the new AK8 GEN jet collection reclustering in a function here. The change in the line above is just to put a dummy example using ak6gen. The ak6gen jet clustering is switched off.

@andrzejnovak
Copy link
Contributor

@mariadalfonso afaik btv is not using this JME Nano

@gouskos
Copy link
Contributor

gouskos commented Jan 28, 2021

+xpog

All tests passed, and changes to the code are consistent with the PR description
Please prepare the corresponding backports for CMSSW_11_2_X and CMSSW_10_6_X

@nurfikri89
Copy link
Contributor Author

@gouskos The backport PRs are #32759 (10_6_X) and #32760 (11_2_X)

@slava77
Copy link
Contributor

slava77 commented Jan 28, 2021

+1

for #32722 6e83ab4

  • code changes are in line with the PR description; reco review covers just PhysicsTools/PatAlgos/python/tools/jetCollectionTools.py , which has some extension in functionality without changing the defaults
  • jenkins tests pass and comparisons with the baseline show no relevant differences

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @silviodonato, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

@qliphy
Copy link
Contributor

qliphy commented Jan 29, 2021

+1

@cmsbuild cmsbuild merged commit 3aa05aa into cms-sw:master Jan 29, 2021
cmsbuild added a commit that referenced this pull request Feb 1, 2021
…reduction

[JME NanoAOD] Backport of #32722 (Changes for event size reduction) to 10_6_X
cmsbuild added a commit that referenced this pull request Feb 3, 2021
…reduction

[JME NanoAOD] Backport of #32722 (Changes for event size reduction) to 11_2_X
@nurfikri89 nurfikri89 deleted the from113X_202101191100_nanojme_sizereduction branch May 25, 2021 08:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants