Opened 3 years ago

Closed 3 years ago

#2218 closed help (answered)

Problem in running UM

Reported by: jfgu Owned by: um_support
Component: UM Model Keywords:
Cc: Platform:
UM Version: <select version>


Hello, I have a problem in running UM. My submitted job suite u-an747 stopped running in the last cycle, saying there is a disk quota problem. However, I checked my work space, it's still far from exceeding the quota limit. This problem also occured last week. Could someone help me with this? Thank you very much!

The error information is as follows:

BUFFOUT: Write Failed: Disk quota exceeded

FLUSH_UNIT_BUFFER: Error Flushing Buffered Data on PE 0
FLUSH_UNIT_BUFFER: Status is 1.0
FLUSH_UNIT_BUFFER: Length Requested was 524288
FLUSH_UNIT_BUFFER: Length written was 0

???!!!???!!!???!!!???!!!???!!! ERROR ???!!!???!!!???!!!???!!!???!!!
? Error code: 1
? Error from routine: portio2a:flush_unit_buffer
? Error message: Failed in output_buffer()
? Error from processor: 0
? Error number: 58

[0] exceptions: An non-exception application exit occured.
[0] exceptions: whilst in a serial region
[0] exceptions: Task had pid=36813 on host nid03037
[0] exceptions: Program is "/work/n02/n02/jfgu/cylc-run/u-an747/share/fcm_make/build-atmos/bin/um-atmos.exe"
Warning in umPrintMgr: umPrintExceptionHandler : Handler Invoked
Rank 0 [Thu Jul 6 10:41:18 2017] [c7-1c2s7n1] application called MPI_Abort(MPI_COMM_WORLD, 9) - process 0
_pmiu_daemon(SIGCHLD): [NID 03037] [c7-1c2s7n1] [Thu Jul 6 10:41:27 2017] PE RANK 0 exit signal Aborted
[NID 03037] 2017-07-06 11:41:27 Apid 27540196: initiated application termination
[FAIL] um-atmos # return-code=137
Received signal ERR
cylc (scheduler - 2017-07-06T10:41:35Z): CRITICAL Task job script received signal ERR at 2017-07-06T10:41:35Z
cylc (scheduler - 2017-07-06T10:41:35Z): CRITICAL failed at 2017-07-06T10:41:35Z

Change History (3)

comment:1 Changed 3 years ago by jfgu

This problem also occured in my standard job u-an829. The same error information.
Anything wrong with my setup?
Both of the two suite run OK for hours. Please could someone help me? Thank you very much!


comment:2 Changed 3 years ago by jfgu

Sorry, I found my work space does exceed the limit. The ARCHER SAFE doesn't update the information. Sorry for the bothering. You can close the ticket now.


comment:3 Changed 3 years ago by grenville

  • Resolution set to answered
  • Status changed from new to closed
Note: See TracTickets for help on using tickets.