Opened 3 years ago

Closed 3 years ago

Last modified 3 years ago

#1983 closed help (answered)

Security warning during model run on MONSooN when sending message

Reported by: marcus Owned by: um_support
Component: Rose Keywords:
Cc: Platform: MONSooN
UM Version: 10.4

Description

Hi,

Yesterday at 2016-09-20T10:18:57Z I got the following error message at the end of my puma-aa356 model run 795718.xcm00.

The model run seems to have succeeded but there was a security problem sending a message which interrupted the running suite. What message does the system try to send? Is this something to do with the security patches that were applied yesterday at 0900-1100?

File:
/home/makoe/cylc-run/puma-aa356/log/job/19971101T0000Z/atmos_main/NN/job.out

> Using output file
> /home/makoe/cylc-run/puma-aa356/work/19971101T0000Z/atmos_main/pe_outp
> ut/aa356.fort6.pe0 cylc (scheduler - 2016-09-20T10:18:57Z): succeeded 
> at 2016-09-20T10:18:57Z Send message: try 1 of 7 failed: security 
> reasons
>    retry in 5.0 seconds, timeout is 30 Send message: try 2 of 7 
> failed: security reasons
>    retry in 5.0 seconds, timeout is 30 Send message: try 3 of 7 
> failed: security reasons
>    retry in 5.0 seconds, timeout is 30 Send message: try 4 of 7 
> failed: security reasons
>    retry in 5.0 seconds, timeout is 30 Send message: try 5 of 7 
> failed: security reasons
>    retry in 5.0 seconds, timeout is 30 Send message: try 6 of 7 
> failed: security reasons
>    retry in 5.0 seconds, timeout is 30 Send message: try 7 of 7 
> failed: security reasons JOB SCRIPT EXITING (TASK SUCCEEDED)

Many thanks,

Marcus

Change History (3)

comment:1 Changed 3 years ago by ros

Hi Marcus,

When a task (atmos in this case) finishes running on the HPC cylc then sends a message back to the cylc daemon running on exvmsrose to say "Task x has finished".
This is the message that failed to send. The exvmsrose server was probably down being patched (and would take with it the cylc daemon that controls the running of the suite) when your task finished hence the problem.

Regards,
Ros.

Last edited 3 years ago by ros (previous) (diff)

comment:2 Changed 3 years ago by ros

  • Reporter changed from ros to makoe
  • Resolution set to answered
  • Status changed from new to closed

Hi Ros, thanks for letting me know.
Best regards,
Marcus

comment:3 Changed 3 years ago by ros

  • Reporter changed from makoe to marcus
Note: See TracTickets for help on using tickets.