Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@germanfgv
Copy link
Contributor

@germanfgv germanfgv commented Sep 30, 2020

First configuration test for MWGR#4:

Release: CMSSW_11_1_4
RUNS: 337240 and 337234
GT (**):

  • HLT: 111X_dataRun3_HLT_v2
  • Express: 111X_dataRun3_Express_v2
  • Prompt: 111X_dataRun3_Prompt_v2

@germanfgv
Copy link
Contributor Author

run replay please

@gkfthddk
Copy link
Contributor

Host name : vocms047.cern.ch
Container ID : 3
Pull request url : #4542
Current build at jenkins : https://cmssdt.cern.ch/dmwm-jenkins/job/DMWM-T0-PR-test-job/198/.
Jira Issue : https://its.cern.ch/jira/browse/CMSTZDEV-611

@cmsdmwmbot
Copy link

Container Tests
Unit tests finished

@gkfthddk
Copy link
Contributor

There are 1 paused jobs in the replay.

1 similar comment
@gkfthddk
Copy link
Contributor

gkfthddk commented Oct 1, 2020

There are 1 paused jobs in the replay.

@cmsdmwmbot
Copy link

Container Tests
JIRA URL : https://its.cern.ch/jira/browse/CMSTZDEV-611
Tier0_REPLAY v198 DMWM-T0-PR-test-job on vocms047.cern.ch. Initial replay configuration for MWGR4 2020
There are 1 paused jobs in the replay. List of Paused job below.
Cache directory : Exit code
/data/tier0/docker/container3/srv/wmagent/current/install/tier0/JobCreator/JobCache/Repack_Run337240_StreamNanoDST_Tier0_REPLAY_2020_v2010010010_201001_0011/Repack/JobCollection_5_0/job_2777 : None

Return None : 1 job(s)
#Error message ==========
INFO:root:Subprocess stderr was:
None
INFO:root:Executing CMSSW. args: ['/bin/bash', '/srv/job/WMTaskSpace/cmsRun1/cmsRun1-main.sh', '', 'slc7_amd64_gcc820', 'scramv1', 'CMSSW', 'CMSSW_11_1_4', 'FrameworkJobReport.xml', 'cmsRun', 'PSet.py', '', '', '']
INFO:root:PSS: 1103482; RSS: 1103228; PCPU: 50.3; PMEM: 2.4
ERROR:root:Error in CMSSW step cmsRun1
Number of Cores: 1
Job has exceeded maxPSS: 1000 MB
Job has PSS: 1103 MB

ERROR:root:Attempting to kill step using SIGUSR2
INFO:root:addOutputFile method called with outputModule: write_L1Accept_RAW, aFile: None
INFO:root:addOutputFile method fileRef: , whole tree: {}
ERROR:root:Tried to divide by zero doing storage statistics report parsing.
ERROR:root:Either you aren't reading and writing data, or you aren't reporting it.
ERROR:root:Not adding any storage performance info to report.
INFO:root:Steps.Executors.CMSSW.post called
INFO:root:StepName: cmsRun1, StepType: CMSSW, with result: 0
INFO:root:Steps.Executor logging started

#====================

There are only 1 memory errors, Will check again after 2 hours.
There are 1 paused jobs in the replay. List of Paused job below.
Cache directory : Exit code
/data/tier0/docker/container3/srv/wmagent/current/install/tier0/JobCreator/JobCache/Repack_Run337240_StreamNanoDST_Tier0_REPLAY_2020_v2010010010_201001_0011/Repack/JobCollection_5_0/job_2777 : None

Return None : 1 job(s)
#Error message ==========
INFO:root:Subprocess stderr was:
None
INFO:root:Executing CMSSW. args: ['/bin/bash', '/srv/job/WMTaskSpace/cmsRun1/cmsRun1-main.sh', '', 'slc7_amd64_gcc820', 'scramv1', 'CMSSW', 'CMSSW_11_1_4', 'FrameworkJobReport.xml', 'cmsRun', 'PSet.py', '', '', '']
INFO:root:PSS: 1103482; RSS: 1103228; PCPU: 50.3; PMEM: 2.4
ERROR:root:Error in CMSSW step cmsRun1
Number of Cores: 1
Job has exceeded maxPSS: 1000 MB
Job has PSS: 1103 MB

ERROR:root:Attempting to kill step using SIGUSR2
INFO:root:addOutputFile method called with outputModule: write_L1Accept_RAW, aFile: None
INFO:root:addOutputFile method fileRef: , whole tree: {}
ERROR:root:Tried to divide by zero doing storage statistics report parsing.
ERROR:root:Either you aren't reading and writing data, or you aren't reporting it.
ERROR:root:Not adding any storage performance info to report.
INFO:root:Steps.Executors.CMSSW.post called
INFO:root:StepName: cmsRun1, StepType: CMSSW, with result: 0
INFO:root:Steps.Executor logging started

#====================

Replay was closed by puased job

@germanfgv
Copy link
Contributor Author

run replay please

@gkfthddk
Copy link
Contributor

Host name : vocms047.cern.ch
Container ID : 1
Pull request url : #4542
Current build at jenkins : https://cmssdt.cern.ch/dmwm-jenkins/job/DMWM-T0-PR-test-job/203/.
Jira Issue : https://its.cern.ch/jira/browse/CMSTZDEV-614

@gkfthddk
Copy link
Contributor

There are 1 paused jobs in the replay.

@gkfthddk
Copy link
Contributor

There are 2 paused jobs in the replay.

@cmsdmwmbot
Copy link

Container Tests
JIRA URL : https://its.cern.ch/jira/browse/CMSTZDEV-614
Tier0_REPLAY v203 DMWM-T0-PR-test-job on vocms047.cern.ch. Initial replay configuration for MWGR4 2020
There are 1 paused jobs in the replay. List of Paused job below.
Cache directory : Exit code
/data/tier0/docker/container1/srv/wmagent/current/install/tier0/JobCreator/JobCache/Repack_Run337240_StreamNanoDST_Tier0_REPLAY_2020_v2010131925_201013_1926/Repack/JobCollection_9_0/job_2823 : None

Return None : 1 job(s)
#Error message ==========
INFO:root:Subprocess stderr was:
None
INFO:root:Executing CMSSW. args: ['/bin/bash', '/srv/job/WMTaskSpace/cmsRun1/cmsRun1-main.sh', '', 'slc7_amd64_gcc820', 'scramv1', 'CMSSW', 'CMSSW_11_1_4', 'FrameworkJobReport.xml', 'cmsRun', 'PSet.py', '', '', '']
INFO:root:PSS: 264282; RSS: 263640; PCPU: 42.0; PMEM: 0.5
INFO:root:PSS: 1460448; RSS: 1460132; PCPU: 61.3; PMEM: 3.2
ERROR:root:Error in CMSSW step cmsRun1
Number of Cores: 1
Job has exceeded maxPSS: 1000 MB
Job has PSS: 1460 MB

ERROR:root:Attempting to kill step using SIGUSR2
INFO:root:addOutputFile method called with outputModule: write_L1Accept_RAW, aFile: None
INFO:root:addOutputFile method fileRef: , whole tree: {}
ERROR:root:Tried to divide by zero doing storage statistics report parsing.
ERROR:root:Either you aren't reading and writing data, or you aren't reporting it.
ERROR:root:Not adding any storage performance info to report.
INFO:root:Steps.Executors.CMSSW.post called
INFO:root:StepName: cmsRun1, StepType: CMSSW, with result: 0

#====================

There are only 1 memory errors, Will check again after 3 hours.
There are 2 paused jobs in the replay. List of Paused job below.
Cache directory : Exit code
/data/tier0/docker/container1/srv/wmagent/current/install/tier0/JobCreator/JobCache/Repack_Run337240_StreamNanoDST_Tier0_REPLAY_2020_v2010131925_201013_1926/Repack/
└─JobCollection_9_0/job_2823 : None
└─JobCollection_5_0/job_2777 : None

Return None : 2 job(s)
#Error message ==========
INFO:root:Subprocess stderr was:
None
INFO:root:Executing CMSSW. args: ['/bin/bash', '/srv/job/WMTaskSpace/cmsRun1/cmsRun1-main.sh', '', 'slc7_amd64_gcc820', 'scramv1', 'CMSSW', 'CMSSW_11_1_4', 'FrameworkJobReport.xml', 'cmsRun', 'PSet.py', '', '', '']
INFO:root:PSS: 264282; RSS: 263640; PCPU: 42.0; PMEM: 0.5
INFO:root:PSS: 1460448; RSS: 1460132; PCPU: 61.3; PMEM: 3.2
ERROR:root:Error in CMSSW step cmsRun1
Number of Cores: 1
Job has exceeded maxPSS: 1000 MB
Job has PSS: 1460 MB

ERROR:root:Attempting to kill step using SIGUSR2
INFO:root:addOutputFile method called with outputModule: write_L1Accept_RAW, aFile: None
INFO:root:addOutputFile method fileRef: , whole tree: {}
ERROR:root:Tried to divide by zero doing storage statistics report parsing.
ERROR:root:Either you aren't reading and writing data, or you aren't reporting it.
ERROR:root:Not adding any storage performance info to report.
INFO:root:Steps.Executors.CMSSW.post called
INFO:root:StepName: cmsRun1, StepType: CMSSW, with result: 0

#====================

Replay was closed by puased job

@germanfgv germanfgv merged commit 1ad3eb1 into master Nov 6, 2020
@germanfgv germanfgv deleted the 2020MWGR4_Replay_337240_337234 branch November 16, 2020 14:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants