#1715 closed error (fixed)

Model run crashed

Reported by: im13009 Owned by: um_support
Priority: normal Component: UM Model
Keywords: Cc:
Platform: ARCHER UM Version: 4.5

Description

Hi,

I'm a new user of Archer and I have tried to run several jobs and I always get the same error. All my jobs crashed before I get any results. The job IDs are: tdzza, tdzub and tdzuc. The model crashed every time I try to run a new job(see output files tdzza000.tdzza.d15306.t132200.leave, tdzuc000.tdzuc.d15303.t152220.leave and tdzub000.tdzub.d15299.t124358.leave). You can find these output files in /home/n02/n02/im13009/um/umui_out. Attached you can find a screenshot of the error I get for all the jobs.

Attachments (6)

Screenshot - Error.docx (566.9 KB) - added by im13009 18 months ago.
Screenshot erros
Errors.docx (559.3 KB) - added by im13009 18 months ago.
Errors - new version.docx (761.8 KB) - added by im13009 18 months ago.
Errors - Failed in model executable.docx (168.9 KB) - added by im13009 18 months ago.
Hand_edit error.docx (196.8 KB) - added by im13009 18 months ago.
Errors-failed executable & Disk quota.docx (241.7 KB) - added by im13009 18 months ago.

Change History (22)

Changed 18 months ago by im13009

Screenshot erros

comment:1 Changed 18 months ago by grenville

Irene

Where did you get the job - did it run for its previous owner?

Grenville

comment:2 Changed 18 months ago by im13009

Yes, all these jobs run for their previous owners. The tdzza job is a copy of the job id xluba (user Maxllo) and the jobs tdzuc and tdzub are copies of the job id xlxza (user lsim).

Many thanks,
Irene.

comment:3 Changed 18 months ago by grenville

Irene

Please try running tdzub with Louise's executable (/work/n02/n02/lsim/execs/HadCM3.exec) — this is set in model selection→sub-model independent→compile options for the model

Grenville

Changed 18 months ago by im13009

comment:4 Changed 18 months ago by im13009

I have tried it and this time I don't get the same exact error. However, I still get an error related to the executable. The error message is: “HadCM3.exec not found”. I also get other errors. Attached you can find an screenshot of all these errors. I still don't get any results.

Changed 18 months ago by im13009

comment:5 Changed 18 months ago by im13009

Sorry! One of the errors was actually my fault. I have already run the model again. However, I still get some errors and I don't get any results. Attached you can find a file with screenshots of all the errors. The output file is tdzub000.tdzub.d15306.t182926 (/home/n02/n02/im13009/um/umui_out).

comment:6 Changed 18 months ago by simon

The executable doesn't have execute permissions. Try

chmod 755 /work/n02/n02/im13009/execLouise/HadCM3.exec

and resubmit

Changed 18 months ago by im13009

comment:7 Changed 18 months ago by im13009

Hi,

I have changed the permission and I still get errors related to the executable. Error message "failed in model executable". The output file is tdzub000.tdzub.d15313.t190615.leave (/home/n02/n02/im13009/um/umui_out). Attached you can find a file with screenshots of the errors.

Many thanks!

comment:8 Changed 18 months ago by simon

You appear to be a victim of an obscure ksh bug. Try adding
/home/simon/umui_jobs/hand_edit/fix_sizes
to your hand edits and resubmitting.

Changed 18 months ago by im13009

comment:9 Changed 18 months ago by im13009

Hi!

I have added /home/simon/umui_jobs/hand_edit/fix_sizes to my hand edits and I get a GHUI error message. Attached you can find a screenshot.

Many thanks!

comment:10 Changed 18 months ago by simon

You are applying it in the incorrect place.
It goes in Submodel Independent→Post Processing→Local post-processing scripts

Simon.

Changed 18 months ago by im13009

comment:11 Changed 18 months ago by im13009

Hi,

I have already applied it in the correct place and run the model. This time I get some results but the model has crushed at some point. I get some errors related to the executable (failed executable) and disk quota (disk quota exceeded). Attached you can find some screenshots.

Many thanks,
Irene.

comment:12 Changed 18 months ago by im13009

I have forgotten to mention that the outfile is tdzub000.tdzub.d15316.t133536.leave (home/n02/n02/im13009/um/umui_out).

comment:13 Changed 18 months ago by simon

The model ran OK, but you ran out of disk space. Your quota now been increased from 10GB to 300GB.
Note that this will a couple of hours to become active.

Simon.

comment:14 Changed 18 months ago by im13009

Hi,

This is just to let you know that the model ran Ok. I have got all the results.

By the way, do I need to add /home/simon/umui_jobs/hand_edit/fix_sizes to my hand edits every time that I run the model?

Thank you for all your help!
Irene.

comment:15 Changed 18 months ago by simon

Excellent.

Yes, the hand edit should be added to all of your UM vn4.5 experiments.

Simon.

comment:16 Changed 18 months ago by simon

  • Resolution set to fixed
  • Status changed from new to closed
Note: See TracTickets for help on using tickets.