Opened 3 years ago

Closed 3 years ago

#1809 closed help (fixed)

vn8.4 release job for ARCHER

Reported by: akpandeyjnu Owned by: ros
Priority: normal Component: UKCA
Keywords: Cc: Luke, dstevens
Platform: ARCHER UM Version: 8.4

Description

Hello,

I'm having an issue with the vn8.4 release job (xlavb). I've copied it and created a new job with Job Id xmkgc and xmkgd. I've changed the user ID, email, job run length, and the model run time. I have also added some diagnostics in the STASH in xmkgc.
I am trying to run the job for 1 month. The job starts running but fails after 10 days and contains the following error:-
For Job xmkgc:
/work/n02/n02/alok/xmkgc/bin/qsatmos: Executing model run
*
UM Executable : /work/n02/n02/alok/xmkgc/bin/xmkgc.exe
*
mkdir:: File exists
At line 307 of file ff2pp.F
Fortran runtime error: Disk quota exceeded
At line 307 of file ff2pp.F
Fortran runtime error: Disk quota exceeded
sys-122 : UNRECOVERABLE error on system request

Disk quota exceeded

Encountered during an I/O operation on unit 6
Fortran unit 6 is connected to a sequential formatted text file:

"/work/n02/n02/alok/xmkgc/pe_output/xmkgc.fort6.pe43"

sys-122 : UNRECOVERABLE error on system request

Disk quota exceeded

Encountered during an I/O operation on unit 6
Fortran unit 6 is connected to a sequential formatted text file:

"/work/n02/n02/alok/xmkgc/pe_output/xmkgc.fort6.pe109"

sys-122 : UNRECOVERABLE error on system request

Disk quota exceeded

Encountered during an I/O operation on unit 6
Fortran unit 6 is connected to a sequential formatted text file:

"/work/n02/n02/alok/xmkgc/pe_output/xmkgc.fort6.pe97"

sys-122 : UNRECOVERABLE error on system request

Disk quota exceeded

For job xmkgd:
/work/n02/n02/alok/xmkgd/bin/qsatmos: Executing model run

*
UM Executable : /work/n02/n02/alok/xmkgd/bin/xmkgd.exe
*
BUFFOUT: Write Failed: Disk quota exceeded

????????????????????????????????????????????????????????????????????????????????
???!!!???!!!???!!!???!!!???!!!???!!! ERROR ???!!!???!!!???!!!???!!!???!!!???!!!?
? Error in routine: UM_WRITDUMP
? Error Code: 400
? Error Message: Failure writing out field
? Error generated from processor: 0
? This run generated 818 warnings
????????????????????????????????????????????????????????????????????????????????

The leave files for both jobs are /output/xmkgc000.xmkgc.d16039.t134433.leave and
/output/ xmkgd000.xmkgd.d16033.t121153.leave

The first 9 days outputs files are in /rdf/archive. The 10-day file is in work (/home/n02/n02/alok/work/xmkgc and /home/n02/n02/alok/work/xmkgd). A a dump file is also available i.e xmkgda.da19991211_00 in the /work/xmkgd and xmkgca.da19991211_00 in the /work/xmkgc

I have tried the both jobs 2-3 times but every time model crashes after 10 days run. Is this issue with disk space availability in the /n02/work or any issue with quota allotted for me?

Regards

Alok

Change History (4)

comment:1 Changed 3 years ago by ros

  • Owner changed from um_support to ros
  • Status changed from new to accepted

Hi Alok,

You've exceeded your personal quota on /work. I've just increased it for you. The change will take a little while to come into effect.

Regards,
Ros.

comment:2 Changed 3 years ago by akpandeyjnu

Hi Ros.,

I am planning to run two or more jobs same time. I found that my personal quota of /work is 100Gb only. Can you please increase my work quota if possible. It will help me in running more jobs.

Regards,

Alok

comment:3 Changed 3 years ago by ros

Hi Alok,

I have increased your quota to 500Gb. If you think you will need more, please let me know how much space you need.

Cheers,
Ros.

comment:4 Changed 3 years ago by ros

  • Resolution set to fixed
  • Status changed from accepted to closed
Note: See TracTickets for help on using tickets.