Opened 7 months ago

Closed 7 months ago

#2680 closed help (fixed)

process central_ancilla used more than 10485760kB of memory on node shared100

Reported by: amenon Owned by: um_support
Component: UM Model Keywords: ancils
Cc: Platform: Monsoon2
UM Version: 10.9

Description

Hi,

I am trying to generate some land ancilliaries in Monsoon. Three of the jobs, i.e, ozone, dust and smcsnow ancil creation are failing with the following error:

error: process central_ancilla used more than 10485760kB of memory on node shared100

Some times they also fail with walltime exceeded error. So I tried increasing the wallclock time from 600s to 14400s, still unable to get these jobs succeed. Could you please have a look at this? I found this issue in ticket #2163. But it seems this issue got resolved by itself. I re-tried submitting these jobs several times in the past few days without any success.

The suite id is u-bd222 in Monsoon

Arathy


Change History (3)

comment:1 Changed 7 months ago by willie

Hi Arathy,

In your site setup for Monsoon, suite-adds.rc file has lines like

    [[ANCIL_OZONE]]
        [[[directives]]]
            -l select=1:ncpus=1:mem=5GB
            -l walltime = 00:30:00

so change to mem=15GB and similarly for the others.

Willie

comment:2 Changed 7 months ago by amenon

Hi Willie,

Thanks, the job succeeded now. I had to change the mem to 50 GB in the end, but it succeeded. We could close this ticket now.

Arathy

comment:3 Changed 7 months ago by willie

  • Resolution set to fixed
  • Status changed from new to closed
Note: See TracTickets for help on using tickets.