#3043 closed help (answered)

Cannot stop running suite

Reported by: ggxmy Owned by: um_support
Component: Rose/Cylc Keywords:
Cc: Platform: Monsoon2
UM Version: 11.1

Description

This is basically the same problem as #2873 but on Monsoon.
The advice there was to do 'cylc stop' on the right machine. However as shown below it doesn't seem to work for me… Please can I have advice on this?

Thanks,
Masaru

myosh@xcslc0:u-bn684 $ rose suite-run
[FAIL] Suite "u-bn684" appears to be running:
[FAIL] Contact info from: "/home/d03/myosh/cylc-run/u-bn684/.service/contact"
[FAIL]     CYLC_SUITE_HOST=xcslc1
[FAIL]     CYLC_SUITE_OWNER=myosh
[FAIL]     CYLC_SUITE_PORT=43108
[FAIL]     CYLC_SUITE_PROCESS=18474 /usr/bin/python2 /common/fcm/cylc-7.8.3/bin/cylc-run u-bn684 --host=localhost
[FAIL] Try "cylc stop 'u-bn684'" first?

CYLC_SUITE_HOST=xcslc1 so I login there and tried to stop the suite;

myosh@xcslc1:u-bn684 $ cylc stop 'u-bn684'
/usr/lib64/python2.6/site-packages/requests/packages/urllib3/connection.py:337: SubjectAltNameWarning: Certificate for xcslc1 has no `subjectAltName`, falling back to check for a `commonName` for now. This feature is being removed by major browsers and deprecated by RFC 2818. (See https://github.com/shazow/urllib3/issues/497 for details.)
  SubjectAltNameWarning
myosh@xcslc1:u-bn684 $ rose suite-run
[FAIL] Suite "u-bn684" appears to be running:
[FAIL] Contact info from: "/home/d03/myosh/cylc-run/u-bn684/.service/contact"
[FAIL]     CYLC_SUITE_HOST=xcslc1
[FAIL]     CYLC_SUITE_OWNER=myosh
[FAIL]     CYLC_SUITE_PORT=43108
[FAIL]     CYLC_SUITE_PROCESS=18474 /usr/bin/python2 /common/fcm/cylc-7.8.3/bin/cylc-run u-bn684 --host=localhost
[FAIL] Try "cylc stop 'u-bn684'" first?
myosh@xcslc1:u-bn684 $ rose suite-clean
Clean u-bn684? y/n (default n) y
[FAIL] Suite "u-bn684" appears to be running:
[FAIL] Contact info from: "/home/d03/myosh/cylc-run/u-bn684/.service/contact"
[FAIL]     CYLC_SUITE_HOST=xcslc1
[FAIL]     CYLC_SUITE_OWNER=myosh
[FAIL]     CYLC_SUITE_PORT=43108
[FAIL]     CYLC_SUITE_PROCESS=18474 /usr/bin/python2 /common/fcm/cylc-7.8.3/bin/cylc-run u-bn684 --host=localhost
[FAIL] Try "cylc stop 'u-bn684'" first?

Change History (2)

comment:1 Changed 12 months ago by ros

Instruction on how to kill a stuck suite are available on the collaboration wiki:
https://collab.metoffice.gov.uk/twiki/bin/view/Support/TroubleshootingStuckJobsInRose

Regards,
Ros.

comment:2 Changed 12 months ago by ggxmy

  • Resolution set to answered
  • Status changed from new to closed

Hi Ros.,
Great! That worked.
Cheers.
Masaru

Note: See TracTickets for help on using tickets.