Opened 2 years ago

Closed 2 years ago

#2207 closed help (fixed)

Submitting problem from PUMA

Reported by: jfgu Owned by: um_support
Component: Rose Keywords:
Cc: Platform: ARCHER
UM Version: 10.6

Description

Hi, I got a problem with submitting a standard job. Please see the attached file for the problem. Coule someone help me with this? Thank you very much!

Jian-Feng

Attachments (2)

submit problem1.png (123.5 KB) - added by jfgu 2 years ago.
submitting problem
submit problem2.png (604.1 KB) - added by jfgu 2 years ago.

Download all attachments as: .zip

Change History (9)

Changed 2 years ago by jfgu

submitting problem

comment:1 Changed 2 years ago by ros

Hi Jian-Feng,

Have you tried to submit the suite more that once? If not could you try again please. Your quotas look ok everywhere so not sure why it couldn't finish sorting out the log files. Also can you cut and past the full log output (ie. what's shown when you click "show log").

Cheers,
Ros.

comment:2 Changed 2 years ago by jfgu

Hi Ros, here is the log output:

[INFO] 2017-06-21T12:17:04+0100 Configuration: /home/jfgu/roses/u-an039/
[INFO] 2017-06-21T12:17:04+0100 file: rose-suite.conf
[INFO] 2017-06-21T12:17:04+0100 create: log.20170621T111704Z
[INFO] 2017-06-21T12:17:04+0100 delete: log
[INFO] 2017-06-21T12:17:04+0100 symlink: log.20170621T111704Z ⇐ log
[INFO] 2017-06-21T12:17:04+0100 tar -czf log.20170621T103640Z.tar.gz log.20170621T103640Z
[INFO] 2017-06-21T12:17:04+0100 log.20170621T103640Z.tar.gz ⇐ log.20170621T103640Z
[INFO] 2017-06-21T12:17:04+0100 delete: log.20170621T103640Z/
[INFO] 2017-06-21T12:17:04+0100 create: log/suite
[INFO] 2017-06-21T12:17:04+0100 create: log/rose-conf
[INFO] 2017-06-21T12:17:04+0100 svn info —non-interactive
[INFO] 2017-06-21T12:17:04+0100 svn status —non-interactive
[INFO] 2017-06-21T12:17:04+0100 svn diff —internal-diff —non-interactive
[INFO] 2017-06-21T12:17:04+0100 git describe
[INFO] 2017-06-21T12:17:04+0100 symlink: rose-conf/20170621T121704-run.conf ⇐ log/rose-suite-run.conf
[INFO] 2017-06-21T12:17:04+0100 symlink: rose-conf/20170621T121704-run.version ⇐ log/rose-suite-run.version
[INFO] 2017-06-21T12:17:04+0100 export CYLC_VERSION=6.11.4
[INFO] 2017-06-21T12:17:04+0100 export ROSE_ORIG_HOST=puma
[INFO] 2017-06-21T12:17:04+0100 export ROSE_VERSION=2016.11.1
[INFO] 2017-06-21T12:17:04+0100 unchanged: Jinja2Filters
[INFO] 2017-06-21T12:17:04+0100 source: /home/jfgu/roses/u-an039/Jinja2Filters
[INFO] 2017-06-21T12:17:04+0100 unchanged: app
[INFO] 2017-06-21T12:17:04+0100 source: /home/jfgu/roses/u-an039/app
[INFO] 2017-06-21T12:17:04+0100 unchanged: bin
[INFO] 2017-06-21T12:17:04+0100 source: /home/jfgu/roses/u-an039/bin
[INFO] 2017-06-21T12:17:04+0100 unchanged: meta
[INFO] 2017-06-21T12:17:04+0100 source: /home/jfgu/roses/u-an039/meta
[INFO] 2017-06-21T12:17:04+0100 unchanged: rose-suite.info
[INFO] 2017-06-21T12:17:04+0100 source: /home/jfgu/roses/u-an039/rose-suite.info
[INFO] 2017-06-21T12:17:04+0100 unchanged: suite-setup.rc
[INFO] 2017-06-21T12:17:04+0100 source: /home/jfgu/roses/u-an039/suite-setup.rc
[INFO] 2017-06-21T12:17:04+0100 delete: suite.rc
[INFO] 2017-06-21T12:17:04+0100 install: suite.rc
[INFO] 2017-06-21T12:17:08+0100 ssh -oBatchMode=yes login.archer.ac.uk bash —login -c \'ROSE_VERSION=2016.11.1\ rose\ suite-run\ -v\ -v\ —name=u-an039\ —run=run\ —remote=uuid=084b85da-19d1-4999-8961-5774b0194b56,root-dir=$DATADIR\'

Jian-Feng

comment:3 Changed 2 years ago by ros

Hi Jian-Feng,

Ok, unfortunately that has given no further hints. I can see that you haven't yet built the model executable for this suite so the easiest thing will be to do a rose suite-run --new which will delete the cylc-run directories for this suite and start from a clean slate and hopefully fix the problem.

Cheers,
Ros.

comment:4 Changed 2 years ago by jfgu

Hi Ros, I tried rose suite-run —new, but it still doesn't work. It is very strange I submitted the same job yesterday and it runs OK. What I change is the cycling period, wallclock time. Attached is the error. And the following is the log output:

[INFO] 2017-06-21T13:41:03+0100 cylc get-global-config -i [hosts][localhost]run\ directory
[INFO] 2017-06-21T13:41:03+0100 cylc get-global-config -i [hosts][localhost]work\ directory
[INFO] 2017-06-21T13:41:03+0100 Configuration: /home/jfgu/roses/u-an039/
[INFO] 2017-06-21T13:41:03+0100 file: rose-suite.conf
[INFO] 2017-06-21T13:41:03+0100 cylc —version
[INFO] 2017-06-21T13:41:04+0100 pgrep -f -u jfgu python.*/bin/cylc-(run|restart)(\ |\ .+\ )u-an039(\ |$)
[INFO] 2017-06-21T13:41:04+0100 pgrep -f -u jfgu python.*/bin/cylc-(run|restart)(\ |\ .+\ )u-an039(\ |$)
[INFO] 2017-06-21T13:41:04+0100 delete: /home/jfgu/cylc-run/u-an039/
[INFO] 2017-06-21T13:41:04+0100 cylc refresh —unregister
[INFO] 2017-06-21T13:41:05+0100 delete: /home/jfgu/.cylc/u-an039
[INFO] 2017-06-21T13:41:05+0100 create: /home/jfgu/cylc-run/u-an039
[INFO] 2017-06-21T13:41:05+0100 create: log.20170621T124105Z
[INFO] 2017-06-21T13:41:05+0100 symlink: log.20170621T124105Z ⇐ log
[INFO] 2017-06-21T13:41:05+0100 create: log/suite
[INFO] 2017-06-21T13:41:05+0100 create: log/rose-conf
[INFO] 2017-06-21T13:41:05+0100 svn info —non-interactive
[INFO] 2017-06-21T13:41:05+0100 svn status —non-interactive
[INFO] 2017-06-21T13:41:05+0100 svn diff —internal-diff —non-interactive
[INFO] 2017-06-21T13:41:05+0100 git describe
[INFO] 2017-06-21T13:41:05+0100 symlink: rose-conf/20170621T134105-run.conf ⇐ log/rose-suite-run.conf
[INFO] 2017-06-21T13:41:05+0100 symlink: rose-conf/20170621T134105-run.version ⇐ log/rose-suite-run.version
[INFO] 2017-06-21T13:41:05+0100 create: share
[INFO] 2017-06-21T13:41:05+0100 create: share/cycle
[INFO] 2017-06-21T13:41:05+0100 create: work
[INFO] 2017-06-21T13:41:05+0100 export CYLC_VERSION=6.11.4
[INFO] 2017-06-21T13:41:05+0100 export ROSE_ORIG_HOST=puma
[INFO] 2017-06-21T13:41:05+0100 export ROSE_VERSION=2016.11.1
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/bin
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/suite-setup.rc
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/rose-suite.info
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/Jinja2Filters
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/meta
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/app
[INFO] 2017-06-21T13:41:05+0100 install: suite-setup.rc
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/suite-setup.rc
[INFO] 2017-06-21T13:41:05+0100 create: meta
[INFO] 2017-06-21T13:41:05+0100 rsync -a —exclude=.* —timeout=1800 —rsh=ssh\ -oBatchMode=yes —checksum /home/jfgu/roses/u-an039/meta/ meta
[INFO] 2017-06-21T13:41:05+0100 install: meta
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/meta
[INFO] 2017-06-21T13:41:05+0100 install: rose-suite.info
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/rose-suite.info
[INFO] 2017-06-21T13:41:05+0100 create: bin
[INFO] 2017-06-21T13:41:05+0100 rsync -a —exclude=.* —timeout=1800 —rsh=ssh\ -oBatchMode=yes —checksum /home/jfgu/roses/u-an039/bin/ bin
[INFO] 2017-06-21T13:41:05+0100 install: bin
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/bin
[INFO] 2017-06-21T13:41:05+0100 create: Jinja2Filters
[INFO] 2017-06-21T13:41:05+0100 rsync -a —exclude=.* —timeout=1800 —rsh=ssh\ -oBatchMode=yes —checksum /home/jfgu/roses/u-an039/Jinja2Filters/ Jinja2Filters
[INFO] 2017-06-21T13:41:05+0100 install: Jinja2Filters
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/Jinja2Filters
[INFO] 2017-06-21T13:41:05+0100 create: app
[INFO] 2017-06-21T13:41:05+0100 rsync -a —exclude=.* —timeout=1800 —rsh=ssh\ -oBatchMode=yes —checksum /home/jfgu/roses/u-an039/app/ app
[INFO] 2017-06-21T13:41:05+0100 install: app
[INFO] 2017-06-21T13:41:05+0100 source: /home/jfgu/roses/u-an039/app
[INFO] 2017-06-21T13:41:06+0100 install: suite.rc
[INFO] 2017-06-21T13:41:06+0100 cylc get-directory u-an039
[INFO] 2017-06-21T13:41:06+0100 cylc refresh —unregister
[INFO] 2017-06-21T13:41:07+0100 cylc unregister u-an039
[INFO] 2017-06-21T13:41:07+0100 0 suite(s) unregistered.
[INFO] 2017-06-21T13:41:07+0100 cylc register u-an039 /home/jfgu/cylc-run/u-an039
[INFO] 2017-06-21T13:41:08+0100 REGISTER u-an039: /home/jfgu/cylc-run/u-an039
[INFO] 2017-06-21T13:41:08+0100 symlink: /home/jfgu/cylc-run/u-an039 ⇐ /home/jfgu/.cylc/u-an039
[INFO] 2017-06-21T13:41:08+0100 cylc validate -v —strict u-an039
[INFO] 2017-06-21T13:41:09+0100 Loading site/user config files
[INFO] 2017-06-21T13:41:09+0100 Reading file /home/fcm/cylc-6.11.4/conf/global.rc
[INFO] 2017-06-21T13:41:09+0100 Reading file /home/jfgu/cylc-run/u-an039/suite.rc
[INFO] 2017-06-21T13:41:09+0100 Processing with Jinja2
[INFO] 2017-06-21T13:41:09+0100 Writing file /home/jfgu/cylc-run/u-an039/suite.rc.processed
[INFO] 2017-06-21T13:41:09+0100 2017-06-21T13:41:09+01 INFO - Expanding [runtime] namespace lists and parameters
[INFO] 2017-06-21T13:41:09+0100 2017-06-21T13:41:09+01 INFO - Parsing the runtime namespace hierarchy
[INFO] 2017-06-21T13:41:09+0100 2017-06-21T12:41:09Z INFO - Parsing [special tasks]
[INFO] 2017-06-21T13:41:09+0100 2017-06-21T12:41:09Z INFO - Parsing the dependency graph
[INFO] 2017-06-21T13:41:09+0100 2017-06-21T12:41:09Z INFO - Configuring internal queues
[INFO] 2017-06-21T13:41:09+0100 2017-06-21T12:41:09Z INFO - Checking for defined tasks not used in the graph
[INFO] 2017-06-21T13:41:09+0100 2017-06-21T12:41:09Z INFO - Checking [visualization] node attributes
[INFO] 2017-06-21T13:41:09+0100 Instantiating tasks to check trigger expressions
[INFO] 2017-06-21T13:41:09+0100 + atmos_atmos.20000101T0000Z ok
[INFO] 2017-06-21T13:41:09+0100 + atmos_recon.20000101T0000Z ok
[INFO] 2017-06-21T13:41:09+0100 + fcm_make2.20000101T0000Z ok
[INFO] 2017-06-21T13:41:09+0100 + fcm_make.20000101T0000Z ok
[INFO] 2017-06-21T13:41:09+0100 Valid for cylc-6.11.4
[INFO] 2017-06-21T13:41:09+0100 WARNING: deprecated items were automatically upgraded in 'suite definition':
[INFO] 2017-06-21T13:41:09+0100 * (6.11.0) [cylc][event hooks] → [cylc][events] - value unchanged
[INFO] 2017-06-21T13:41:09+0100 * (6.11.0) [runtime][root][event hooks] → [runtime][root][events] - value unchanged
[INFO] 2017-06-21T13:41:09+0100 * (6.11.0) [runtime][XC30][job submission] → [runtime][XC30][job] - value unchanged
[INFO] 2017-06-21T13:41:09+0100 * (6.11.0) [runtime][LINUX][job submission] → [runtime][LINUX][job] - value unchanged
[INFO] 2017-06-21T13:41:09+0100 * (6.11.0) [runtime][XC30][job][method] → [runtime][XC30][job][batch system] - value unchanged
[INFO] 2017-06-21T13:41:09+0100 * (6.11.0) [runtime][LINUX][job][method] → [runtime][LINUX][job][batch system] - value unchanged
[INFO] 2017-06-21T13:41:09+0100 cylc get-config -ao -i [remote]owner -i [remote]host u-an039
[INFO] 2017-06-21T13:41:10+0100 ssh -oBatchMode=yes login.archer.ac.uk bash —login -c \'ROSE_VERSION=2016.11.1\ rose\ suite-run\ -v\ -v\ —name=u-an039\ —new\ —run=run\ —remote=uuid=b0b800be-b583-4874-937c-2c666e9a6d14,root-dir=$DATADIR\'

Jian-Feng

Changed 2 years ago by jfgu

comment:5 Changed 2 years ago by ros

Hi Jian-Feng,

Have you tried resubmitting this suite since PUMA came back up today? If you haven't already done so you will need to restart your ssh-agent as per the email Andy sent to the PUMA mailing list earlier today.

Regards,
Ros.

comment:6 Changed 2 years ago by jfgu

Hi Ros, it works now. Thank you very much!

Jian-Feng

comment:7 Changed 2 years ago by ros

  • Component changed from UM Model to Rose
  • Platform set to ARCHER
  • Resolution set to fixed
  • Status changed from new to closed
  • UM Version changed from <select version> to 10.6

Great. Thanks for letting us know. I shall close this query now.

Cheers,
Ros.

Note: See TracTickets for help on using tickets.