Opened 3 years ago
Closed 3 years ago
#2531 closed help (fixed)
nesting suite problems
Reported by: | willkeat | Owned by: | willie |
---|---|---|---|
Component: | UM Model | Keywords: | nesting suite |
Cc: | Platform: | ARCHER | |
UM Version: |
Description (last modified by ros)
On 19/06/18 14:36, Will Keat wrote:
Hello,
I am trying to run a double nested 300m model following this example:
https://code.metoffice.gov.uk/trac/rmed/wiki/suites/nesting/worked_eg_2018
But I am having a strange problem. When I try and run the suite (u-ay913) on archer I get the attached error message (no host selected).
Do you have any ideas why this might be? I've checked that I am able to ssh from puma to archer OK without the need to enter my password, and tripled checked my username is OK!
Thanks,
Will
Change History (9)
comment:1 Changed 3 years ago by ros
comment:2 Changed 3 years ago by ros
I copied the suite from a worked example (https://code.metoffice.gov.uk/trac/rmed/wiki/suites/nesting/worked_eg_2018); suite u-av356.
The suite is attempting to run a 300m model nested within a 1.5 km model nested within the global model over Johannesburg. Turning off the 100m orography does allow me to at least run it (even though it fails on something else), any idea why I can’t seem to use it?
comment:3 Changed 3 years ago by ros
- Description modified (diff)
comment:4 Changed 3 years ago by ros
- Reporter changed from will to willkeat
comment:5 Changed 3 years ago by willie
- Summary changed from nestin gsuite problems to nesting suite problems
Hi Will,
This is the new Nesting Suite tutorial designed for Monsoon. It probably hasn't been ported to ARCHER yet, although it does have an old ARCHER site file. It could take some time to port properly.
You could try the previous version of the Nesting Suite turoial (June 2017) which has been ported to ARCHER - see my u-ao230.
Regards
Willie
comment:6 Changed 3 years ago by willie
- Owner changed from um_support to willie
- Status changed from new to accepted
Comment by WillKeat:
I have copied your suite and adapted it to the region and configuration I require but it has fallen over with the following error:
ModuleCmd_Load.c(244):ERROR:105: Unable to locate a modulefile for 'cray-snplauncher' gc_abort (Processor 0): ANCIL EXIT Rank 0 [Fri Jun 29 11:52:39 2018] [c6-2c1s5n2] application called MPI_Abort(MPI_COMM_WORLD, 9) - process 0 _pmiu_daemon(SIGCHLD): [NID 04310] [c6-2c1s5n2] [Fri Jun 29 11:52:39 2018] PE RANK 0 exit signal Aborted [FAIL] ancil_igbp_wrapper # return-code=134 Received signal ERR cylc (scheduler - 2018-06-29T11:52:41Z): CRITICAL Task job script received signal ERR at 2018-06-29T11:52:41Z cylc (scheduler - 2018-06-29T11:52:41Z): CRITICAL failed at 2018-06-29T11:52:41Z
Do you know why this might be please?
comment:7 Changed 3 years ago by willie
Hi Will
What's the suite-id?
Regards
Willie
comment:8 Changed 3 years ago by willie
Hi Will,
I can't see your files on ARCHER. Please follow the instructions at http://cms.ncas.ac.uk/wiki/CmsHelpdesk
Regards
Willie
comment:9 Changed 3 years ago by willie
- Resolution set to fixed
- Status changed from accepted to closed
Will Keating via email:
My supervisor managed to have some success running the model after chatting with Stu Webster – the issues seem to have been with trying to run the suite on archer…
Cheers for your help though. Feel free to close the ticket.
The problem comes from JHB_test_JHB_1p5_ancil_orog_srtm_pp, which inherits ANCIL_LOCAL and HOST_IDL, hence:
but "not-yet" is unknown.
DId you create the suite? I am not familiar with what it is supposed to do.