Opened 2 years ago

Closed 2 years ago

#2279 closed help (fixed)

No Hosts Selected - Monsoon Submission Error

Reported by: pliojop Owned by: ros
Component: UM Model Keywords: Monsoon, Rose, Host
Cc: Platform: Monsoon2
UM Version: 10.6

Description

Hi,

I have a Rose suite (u-aq187) that is running in the nested suite. When I start to submit the job, I get an error message:

[FAIL] bash -ec H=$(rose\ host-select\ exvmsrose);\ echo\ $H # return-code=1, stderr=
[FAIL] [WARN] exvmsrose: (timed out)
[FAIL] [FAIL] No hosts selected.

This is the same as in ticket #2192, however I was unable to utilise the solution in that ticket.

In my GUI for my suite, there is no option for setting the host and a grep around the suite files in the directory didn't seem to indicate a host.

Any help than can be offered would be much appreciated.

Many thanks

James Pope

Change History (5)

comment:1 Changed 2 years ago by ros

  • Owner changed from um_support to ros
  • Status changed from new to accepted

Hi James,

I'm a little confused as I have just taken a copy of your ~/roses/u-aq187 directory but when I try running with rose suite-run I get the error saying you're trying to submit to xcm and build is turned off??? Have I got the right suite?

Cheers,
Ros.

comment:2 Changed 2 years ago by pliojop

Hi Ros,

u-aq187 is the correct job. Within the Rose GUI under

suite conf → jinja2:suite.rc → General run options

I have selected the site as Monsoon and then set the appropriate charging code.

If you are seeing the job as pointing at another machine, is there another level where I need to be setting the target machine?

Thanks

James

comment:3 Changed 2 years ago by ros

Hi James,

The HPC_HOST is set in /home/d02/jpope/roses/u-aq187/site/monsoon-cray-xc40/suite-adds.rc. Change it to be xcslc0 or maybe just xcsc will be good enough. Also the number of cores per node (NCPU_PER_NODE) needs changing from 32 to 36.

Give that a try and see if it makes any difference.

Cheers,
Ros.

comment:4 Changed 2 years ago by pliojop

Thanks Ros,

That has resolved the issue.

Many thanks

James

comment:5 Changed 2 years ago by pliojop

  • Resolution set to fixed
  • Status changed from accepted to closed
Note: See TracTickets for help on using tickets.