Opened 5 years ago
Closed 5 years ago
#1809 closed help (fixed)
vn8.4 release job for ARCHER
Reported by: | akpandeyjnu | Owned by: | ros |
---|---|---|---|
Component: | UKCA | Keywords: | |
Cc: | Luke, dstevens | Platform: | ARCHER |
UM Version: | 8.4 |
Description
Hello,
I'm having an issue with the vn8.4 release job (xlavb). I've copied it and created a new job with Job Id xmkgc and xmkgd. I've changed the user ID, email, job run length, and the model run time. I have also added some diagnostics in the STASH in xmkgc.
I am trying to run the job for 1 month. The job starts running but fails after 10 days and contains the following error:-
For Job xmkgc:
/work/n02/n02/alok/xmkgc/bin/qsatmos: Executing model run
*
UM Executable : /work/n02/n02/alok/xmkgc/bin/xmkgc.exe
*
mkdir:: File exists
At line 307 of file ff2pp.F
Fortran runtime error: Disk quota exceeded
At line 307 of file ff2pp.F
Fortran runtime error: Disk quota exceeded
sys-122 : UNRECOVERABLE error on system request
Disk quota exceeded
Encountered during an I/O operation on unit 6
Fortran unit 6 is connected to a sequential formatted text file:
"/work/n02/n02/alok/xmkgc/pe_output/xmkgc.fort6.pe43"
sys-122 : UNRECOVERABLE error on system request
Disk quota exceeded
Encountered during an I/O operation on unit 6
Fortran unit 6 is connected to a sequential formatted text file:
"/work/n02/n02/alok/xmkgc/pe_output/xmkgc.fort6.pe109"
sys-122 : UNRECOVERABLE error on system request
Disk quota exceeded
Encountered during an I/O operation on unit 6
Fortran unit 6 is connected to a sequential formatted text file:
"/work/n02/n02/alok/xmkgc/pe_output/xmkgc.fort6.pe97"
sys-122 : UNRECOVERABLE error on system request
Disk quota exceeded
For job xmkgd:
/work/n02/n02/alok/xmkgd/bin/qsatmos: Executing model run
*
UM Executable : /work/n02/n02/alok/xmkgd/bin/xmkgd.exe
*
BUFFOUT: Write Failed: Disk quota exceeded
????????????????????????????????????????????????????????????????????????????????
???!!!???!!!???!!!???!!!???!!!???!!! ERROR ???!!!???!!!???!!!???!!!???!!!???!!!?
? Error in routine: UM_WRITDUMP
? Error Code: 400
? Error Message: Failure writing out field
? Error generated from processor: 0
? This run generated 818 warnings
????????????????????????????????????????????????????????????????????????????????
The leave files for both jobs are /output/xmkgc000.xmkgc.d16039.t134433.leave and
/output/ xmkgd000.xmkgd.d16033.t121153.leave
The first 9 days outputs files are in /rdf/archive. The 10-day file is in work (/home/n02/n02/alok/work/xmkgc and /home/n02/n02/alok/work/xmkgd). A a dump file is also available i.e xmkgda.da19991211_00 in the /work/xmkgd and xmkgca.da19991211_00 in the /work/xmkgc
I have tried the both jobs 2-3 times but every time model crashes after 10 days run. Is this issue with disk space availability in the /n02/work or any issue with quota allotted for me?
Regards
Alok
Change History (4)
comment:1 Changed 5 years ago by ros
- Owner changed from um_support to ros
- Status changed from new to accepted
comment:2 Changed 5 years ago by akpandeyjnu
Hi Ros.,
I am planning to run two or more jobs same time. I found that my personal quota of /work is 100Gb only. Can you please increase my work quota if possible. It will help me in running more jobs.
Regards,
Alok
comment:3 Changed 5 years ago by ros
Hi Alok,
I have increased your quota to 500Gb. If you think you will need more, please let me know how much space you need.
Cheers,
Ros.
comment:4 Changed 5 years ago by ros
- Resolution set to fixed
- Status changed from accepted to closed
Hi Alok,
You've exceeded your personal quota on /work. I've just increased it for you. The change will take a little while to come into effect.
Regards,
Ros.