Opened 9 years ago
Closed 9 years ago
#883 closed error (fixed)
Job submission failure to Hector
Reported by: | watson | Owned by: | ros |
---|---|---|---|
Component: | UMUI | Keywords: | HECToR |
Cc: | Platform: | ||
UM Version: | 6.6.3 |
Description (last modified by ros)
Hi, I tried submitting a number of jobs to Hector just now using the UMUI and they all gave the following error output:
FCM_MAIN: Submitting umuisubmit_run ... qsub: script file:: No such file or directory FCM_MAIN: Submit failed
What is the problem here?
Cheers,
Peter
Change History (9)
comment:1 Changed 9 years ago by ros
comment:2 Changed 9 years ago by watson
Hi Ros,
xhjbp is one such job.
Cheers,
Peter
comment:3 Changed 9 years ago by ros
- Keywords HECToR added
- UM Version changed from <select version> to 6.6.3
Hi Peter,
Have you tried processing and submitting the job again? I've just taken a copy of your job and it submitted to the queue on HECToR just fine.
Could you give it another go and see if you still encounter problems.
Cheers,
Ros.
comment:4 Changed 9 years ago by watson
Hi Ros,
I'm afraid I still get the same error. I also tried logging out of Puma and back in again, to no avail.
Cheers,
Peter
comment:5 Changed 9 years ago by ros
- Owner changed from um_support to ros
- Status changed from new to accepted
Hi Peter,
Ok. Can you please confirm that you can ssh from PUMA to HECToR without the need to enter a password or passphrase (ie. that you have ssh setup correctly)?
Is there any other information output to the UMUI submission output window or to the terminal window from which you started up the UMUI?
Cheers,
Ros.
comment:6 Changed 9 years ago by watson
Hi Ros,
I can ssh to Hector fine.
The full output in the submission window is:
Calling FCM_MAIN_SCR - local…
(This may take several minutes.)
Checking remote run directory …
FCM_MAIN: Submitting umuisubmit_run …
qsub: script file:: No such file or directory
FCM_MAIN: Submit failed
There is no output in the terminal window.
Cheers,
Peter
comment:7 Changed 9 years ago by ros
Hi Peter,
That's a pain. I would expect there to be more output after the line "FCM_MAIN: Submit failed"
I'll take a look at your .profile and such like, since as it submits fine for me, that points to an environment problem.
Regards,
Ros.
comment:8 Changed 9 years ago by watson
Hi Ros,
I eventually realised that I had hit the quota in my home space on Hector, which I guess meant the submit scripts on Hector couldn't be created. Things had been building up there that I had been oblivious to. Now I've cleared some space, everything seems to be working.
Cheers,
Peter
comment:9 Changed 9 years ago by ros
- Description modified (diff)
- Resolution set to fixed
- Status changed from accepted to closed
Hi Peter,
Thanks for letting us know you managed to solve this. I'll close this query now.
Regards,
Ros.
Hi Peter,
What is the job id of one of the failing jobs?
Regards,
Ros.