Opened 5 years ago

Closed 5 years ago

#1553 closed help (fixed)

ensemble run stopped

Reported by: ggxmy Owned by: um_support
Component: UM Model Keywords:
Cc: Platform: ARCHER
UM Version: 8.4

Description

Dear CMS Helpdesk,

I submitted an ensemble with 31 members, each of which is UM vn8.4 UKCA job: tdwua-z and tdwva-e. It started this afternoon and was running well, but stopped only 1 ¾ hours later. I submitted it to the regular queue with wallclock limit of 24 hours.

My submit script are
ensemble_sub_tdwua-118170729_20150428-183337 and
submit_ensemble_sub_tdwua-118170729_20150428-183337
in /home/n02/n02/masara/umui_runs/scripts/ensemble_submissions/ .

I have no idea why it was stopped. The log file /work/n02/n02/masara/um/output/tdwua000.tdwua.d15118.t170743.leave.20150428-183337 shows nothing but this;

--------------------------------------------------------------------------------
*** masara   Job: 2865736.sdb   started: 30/04/15 13:44:59   host: mom5 ***
*** masara   Job: 2865736.sdb   started: 30/04/15 13:44:59   host: mom5 ***
*** masara   Job: 2865736.sdb   started: 30/04/15 13:44:59   host: mom5 ***
*** masara   Job: 2865736.sdb   started: 30/04/15 13:44:59   host: mom5 ***

--------------------------------------------------------------------------------
ModuleCmd_Switch.c(172):ERROR:152: Module 'PrgEnv_cray' is currently not loaded
ModuleCmd_Load.c(226):ERROR:105: Unable to locate a modulefile for 'netcdf'
Terminated
--------------------------------------------------------------------------------

Resources requested: ncpus=8928,place=free,walltime=24:00:00
Resources allocated: cpupercent=415,cput=02:13:17,mem=768920kb,ncpus=8928,vmem=11516384kb,walltime=01:45:07

*** masara   Job: 2865736.sdb   ended: 30/04/15 15:30:23   queue: standard ***
*** masara   Job: 2865736.sdb   ended: 30/04/15 15:30:23   queue: standard ***
*** masara   Job: 2865736.sdb   ended: 30/04/15 15:30:23   queue: standard ***
*** masara   Job: 2865736.sdb   ended: 30/04/15 15:30:23   queue: standard ***
--------------------------------------------------------------------------------

Can you see what went wrong? How can I fix the problem?

Thank you.
Masaru Yoshioka

Change History (1)

comment:1 Changed 5 years ago by annette

  • Resolution set to fixed
  • Status changed from new to closed

This was answered offline, and was to do with the user's environment.

Annette

Note: See TracTickets for help on using tickets.