Opened 3 years ago

Closed 3 years ago

#1852 closed error (answered)

No leave file - run failed

Reported by: im13009 Owned by: um_support
Component: UM Model Keywords:
Cc: Platform: ARCHER
UM Version: 4.5

Description

Hello,

Every time that I try to run the job tecib, the model fails. Furthermore, I am not getting the .leave file in /home/n02/n02/im13009/um/umui_out.

I did the compilation and the executable seems ok, however, every time that I try to run the job the model fails. The executable is called HadCM3.exec and it is in /work/n02/n02/im13009/execLouise

I have also tried to run an old job (tdzub) with the executable of the new job tecib. I have the same problem. As soon as the job starts running, the model fails.

My user id is im13009.
Many thanks,
Irene.

Change History (2)

comment:1 Changed 3 years ago by willie

Hi Irene,

I had a look in the file

 ~im13009/um/umui_out/tecib000.tecib.d16096.t163427.leave

This says that

aprun: -N cannot exceed -n

You're trying to run on 4x4 processors which you can't do on ARCHER with this version. Try changing it to 4x6 which is a multiple of 24 processors, matching ARCHER's architecture.

Regards
Willie

}}}

comment:2 Changed 3 years ago by ros

  • Resolution set to answered
  • Status changed from new to closed
Note: See TracTickets for help on using tickets.