Opened 5 years ago

Closed 5 years ago

#1510 closed help (fixed)

crun failing

Reported by: sara123fenech Owned by: um_support
Component: UM Model Keywords: quota
Cc: Platform: ARCHER
UM Version: 7.3

Description

To whom it may concern,

I am trying to run a job having id xktof. It consists of an nrun and crun. (This is a copy of the job xjnjn). The nrun has successfully run however my crun is failing. The .leave file (xktof000.xktof.d15076.t172050.leave) suggests that this is a quota issue as seen below:

Job started at : Wed Mar 18 02:50:24 GMT 2015
     Run started from UMUI
     Running from control files in /home/n02/n02/sfen/umui_runs/xktof-076172039
nrun cp xiupn Cray XC30 - test
This job is running on machine mom5,
using UM directory /work/n02/n02/hum,
***************************************************************
   Starting script :   qsexecute
   Starting time   :   Wed Mar 18 02:50:26 GMT 2015
***************************************************************


/work/n02/n02/sfen/um/xktof/bin/qsexecute: Executing combine

 STOP
/work/n02/n02/sfen/um/xktof/bin/qscombine: Job terminated normally

/work/n02/n02/sfen/um/xktof/bin/qsexecute: Executing model run

*********************************************************
UM Executable : /work/n02/n02/sfen/um/xktof/bin/xktof.exe
*********************************************************


aprun: Apid 13232891: Write failure to stdout of 2048 bytes, ret -1: Disk quota exceeded
aprun: Apid 13232891: Exiting due to errors. Application aborted
xktof: Run failed
*****************************************************************
   Ending script   :   qsexecute
   Completion code :   1
   Completion time :   Wed Mar 18 03:38:44 GMT 2015
*****************************************************************

my puma quota suggests that I'm only using 241M of 977M


Filesystem  blocks   quota   limit   grace   files   quota   limit   grace
      /dev/sdb3    241M    977M   1075M           17831       0       0

and my archer quota suggests that I'm using 1242M out of 5120M


Filesystem  blocks   quota   limit   grace   files   quota   limit   grace
netapp1:/vol/vol2
                  1242M   5120M   5120M           41760   4295m   4295m


Do you think this is because there is not enough space for the output files or is it due to some account settings?

Thanks for your help

Sara

Change History (3)

comment:1 Changed 5 years ago by willie

Hi Sara,

The disk quota exceeded message refers to ARCHER /work. You can see your quota if you login to ARCHER SAFE. To find out how much you are actually using,

cd /work/n02/n02/sfen/um
du -mshc *

This may take a few minutes.

So you need to make some space. You will need the size of you NRUN output DATADIR/um/xktof times the number of CRUNS to finish the job.

Regards

Willie

comment:2 Changed 5 years ago by sara123fenech

Hi Willie

Thanks a lot for your reply. I have checked the space I'm occupying in my work directory and it adds up to 29G. I have removed everything from this directory and only have the nrun outputs for my xktof job which was a month long. My crun is now 4 months long and thus I would need at least 120G for that.

I have checked my fs2 quota on safe and it's only 40G (not enough for the crun). Is there any way that this could be increased please?

Thanks for your help
Sara

comment:3 Changed 5 years ago by annette

  • Keywords quota added
  • Platform set to ARCHER
  • Resolution set to fixed
  • Status changed from new to closed

Hi Sara,

I see your disk quota has been increased. I assume this has fixed your problems so I'm closing the ticket.

Best regards,
Annette

Note: See TracTickets for help on using tickets.