Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Rocoto and scrontab on Gaea C6 #112

Description

@BijuThomas-NOAA

crontab is not accessible on Gaea. Instead, users will need to use scrontab in order to submit cron jobs. When the rocoto command is executed using scrontab, rocoto fails to submit the job(it keep trying but failing):

+ rocotorun --verbose=5 -w hafs_workflow.xml -d hafs_workflow.db
08/15/24 10:42:14 EDT :: hafs_workflow.xml :: Submission status of Task1 for cycle 202108271200 could not be retrieved because the server process at druby://192.188.179.70:46157 died
08/15/24 10:42:14 EDT :: hafs_workflow.xml :: Submission of Task1 for cycle 202108271200 probably, but not necessarily, failed.  It will be resubmitted
08/15/24 10:42:14 EDT :: hafs_workflow.xml :: Submitting Task1 using sbatch < /tmp/sbatch.in20240815-276566-hq1ki2 with input

2024-08-15 10:38:14 -0400 :: gaea61 :: Submitting Task1
2024-08-15 10:38:14 -0400 :: gaea61 :: Submission status of Task1 is pending at druby://192.188.179.70:38415
2024-08-15 10:40:14 -0400 :: gaea61 :: Submission status of Task1 for cycle 202108271200 could not be retrieved because the server process at druby://192.188.179.70:38415 died
2024-08-15 10:40:14 -0400 :: gaea61 :: Submission of Task1 for cycle 202108271200 probably, but not necessarily, failed.  It will be resubmitted
2024-08-15 10:40:14 -0400 :: gaea61 :: Submitting Task1
2024-08-15 10:40:14 -0400 :: gaea61 :: Submission status of Task1 is pending at druby://192.188.179.70:46157
2024-08-15 10:42:14 -0400 :: gaea61 :: Submission status of Task1 for cycle 202108271200 could not be retrieved because the server process at druby://192.188.179.70:46157 died
2024-08-15 10:42:14 -0400 :: gaea61 :: Submission of Task1 for cycle 202108271200 probably, but not necessarily, failed.  It will be resubmitted

Any suggestions on this?. There is no issue when rocotorun is being executed repeatedly using a bash while loop instead of scron.

Here are the scrontab directives:

#SCRON --partition=cron_c6
#SCRON --time=0:10:00
#SCRON --account=windfall
#SCRON --job-name=scron_test
#SCRON --chdir=/gpfs/f6/hurr1/proj-shared/Biju.Thomas/rocoto_test
*/2 * * * *  /gpfs/f6/hurr1/proj-shared/Biju.Thomas/rocoto_test/launch.sh 1>> //gpfs/f6/hurr1/proj-shared/Biju.Thomas/rocoto_test/cron.log 2>&1

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions