Changes between Version 28 and Version 29 of RoseCylc/Hints

07/11/15 15:33:53 (2 years ago)


  • RoseCylc/Hints

    v28 v29  
    33= Useful information for running with Rose =  
     5== Links ==  
    57See also (redirects to Collaboration Twiki):   
    810* [ Rose FAQ] 
     12== Hints and tips ==  
    1014=== Switching versions of Rose and/or cylc === 
    7882* Or add {{{args=-vvv}}} at the top of the fcm_make {{{rose-app.conf}}} file. 
    80 === Rose tip of the day ===  
    82 For more Rose hints see the "tip of the day" from the Rose team: 
     84== Troubleshooting common errors ==  
     86=== Rosie go asks for "username for u" === 
     88By default rosie is set up to load suites from the local puma repository and the Met Office Science Repository Service (MOSRS).  
     89If your MOSRS password isn't cached, Rosie will prompt for it at startup. Clicking 'cancel' then produces an error:  
     91Traceback (most recent call last): 
     92  File "/home/fcm/rose-2015.04.1/lib/python/rosie/browser/", line 994, in handle_update_treemodel_local_status 
     93    self.display_box.update_treemodel_local_status(local_suites, 
     94AttributeError: 'MainWindow' object has no attribute 'display_box' 
     95get_known_keys: {} 
     98There are two potential solutions:  
     1001. Re-cache your MOSRS password  
     1022. Tell Rosie to only load puma suites:  
     104   {{{rosie go --prefix=puma}}} 
     106   Users that don't have a MOSRS account may wish to set this up as an alias.   
     108=== Unable to submit jobs (MONSooN) ===  
     110The suite will fail straight away and the following error appears in the {{{log/suite/err}}} file:  
     112Host key verification failed. 
     1132015-01-21T14:56:23Z ERROR - [fcm_make.1] -Failed to construct job submission command 
     1142015-01-21T14:56:23Z WARNING - Command '['ssh', '-oBatchMode=yes', '-oConnectTimeout=10', 'exvmsrose', 'mkdir -p "$HOME/cylc-run/nemovar_build" "$HOME/cylc-run/nemovar_build/lo 
     116g/job"']' returned non-zero exit status 255 
     1172015-01-21T14:56:23Z ERROR - [fcm_make.1] -submission failed  
     120This is because of an inability to ssh into the Rose VM from the Cylc VM interactively.  
     122To solve, log in to the Cylc VM and then back to the Rose VM specifying the full paths, to add these to the known_hosts file.  
     1241. Check whether exvmscylc or exvmsrose appear in the known_hosts file already. If so delete these entries, especially if you accessed the VMs before their rebuild:  
     126cd .ssh 
     127mv known_hosts known_hosts.OLD 
     128sed '/^exvmsrose/d;/exvmscylc/d' known_hosts.OLD > known_hosts 
     1312. Now from exvmsrose, ssh into exvmscylc using the full path:  
     135  This should provide output something like this:  
     137The authenticity of host ' (' can't be established. 
     138RSA key fingerprint is 98:c8:5e:b9:b3:d2:2f:c4:9c:89:78:08:d6:78:70:3a. 
     139Are you sure you want to continue connecting (yes/no)?  
     141  Type {{{yes}}}.   
     1433. Now from exvmscylc, log in to exvmsrose using the full path:  
     147  And again type {{{yes}}} at the prompt.  
     1494. Type {{{exit}}} to get back to the Rose VM, then ssh into exvmsrose again, and this should succeed without any interative prompts.  
     1515. Now type {{{exit}}} twice to get back to the original Rose terminal. And try re-submitting the rose suite.  
     153=== No gcylc window ===  
     155When submitting a job, no gcylc window appears.  
     157Sometimes the gui is slow to load. If it does not appear at all however, check that you have X11 forwarding set up from your **initial location and the lander**.  
     159To do so ssh with the -Y option or alternatively, append the following line to your ~/.ssh/config file:  
     161Host * 
     162ForwardX11 yes 
     165=== Rose suite running but can't shutdown === 
     167A rose suite is supposedly running, i.e. {{{rose suite-scan}}} gives something like:  
     169puma-aa046 gmslis@exvmscylc:7767  
     171Or trying to re-run the suite gives an error {{{rose suite-run}}}  
     173[FAIL] Suite "puma-aa046" may still be running. 
     174[FAIL] Host "exvmscylc" has process: 
     175[FAIL]     9468 python /home/fcm/cylc-6.1.2/bin/cylc-run puma-aa046 
     176[FAIL]     9469 python /home/fcm/cylc-6.1.2/bin/cylc-run puma-aa046 
     177[FAIL] Try "rose suite-shutdown --name=puma-aa046" first?  
     180However, when trying to shutdown the suite, {{{rose suite-stop}}} reports that the suite isn't running:  
     182Really shutdown puma-aa046 at exvmscylc? [y/n] y 
     184'ERROR, remote port file not found'  
     187This is due to orphaned tasks on the Cylc VM, which can occur when exvmscylc and exvmsrose cannot communicate non-interactively.  
     189To solve, log in to exvmscylc, and run {{{cylc scan}}}, this should show running tasks. To stop these, type:  
     191cylc shutdown --now 
     193This may report something like "Command queued", but re-running {{{cylc scan}}} will show that the tasks are now finished.  
     195=== Can't run rose suite-log on MONSooN ===  
     197On the MONSooN Rose VM (exvmsrose) running {{{rose suite-log}}} may do nothing.  
     199To launch, instead run:  
     201firefox http://localhost:8080/ 
     203And search for the suite id.  
     205=== Device or resource busy when running suite === 
     207Unable to run suite.  
     210exmsrose puma-aa045$ rose suite-run 
     211[INFO] create: log.20150121T164500Z 
     212[INFO] delete: log 
     213[INFO] symlink: log.20150121T164500Z <= log 
     214[INFO] log.20150121T163546Z.tar.gz <= log.20150121T163546Z 
     215[FAIL] [Errno 16] Device or resource busy: 'log.20150121T163546Z/job/1/fcm_make/01/.nfs0000000000451b5d00000065' 
     218You have one of the output files open somewhere, which means rose can't archive the old output. Close the file.  
     220=== Warning when opening gcylc ===  
     222A warning appears when the Rose/cylc run-time task manager, called gcylc, opens:  
     225ParseError: File not found: /home/annette/.cylc/gcylc.rc 
     226WARNING: user config parsing failed (continuing) 
     229This is harmless but to avoid create an empty file in your home space:  
     231touch ~/.cylc/gcylc.rc 
     235=== .vimrc error with fcm commit ===  
     237When trying to commit changes to a rose suite the following error occurs: 
     239exmsrose puma-aa045$ fcm commit 
     240[info] vi: starting commit message editor... 
     241Error detected while processing /home/aospre/.vimrc: 
     242line    5: 
     243E518: Unknown option: foldlevelstart=99 
     244Press ENTER or type command to continue 
     245[FAIL] log message is empty 
     248This error occurs with the Cylc syntax highlighting for Vim. Changing the default FCM editor to be vim rather than vi stops this error.  
     250In your {{{.profile}}} add the following line: 
     252export SVN_EDITOR=vim 
     255=== Jinja error from rose suite-run ===  
     257After editing the suite, a cryptic Jinja error message appears from {{{rose suite-run}}}:  
     260[FAIL] cylc validate -v --strict puma-aa069 # return-code=1, stderr= 
     261[FAIL] Jinja2 Error: 
     262[FAIL]   File "<unknown>", line 58, in template 
     263[FAIL] TemplateSyntaxError: expected token 'end of print statement', got '=' 
     266This is caused by some error in the {{{suite.rc}}} file caused by the Jinja syntax or Rose variables.  
     268To debug, go to {{{~/cylc-run/<suite-name>}}}, open the {{{suite.rc}}} file and navigate to the line number causing the error.  
     270If the {{{suite.rc}}} file uses includes, then to generate the parsed file run:  
     272cylc view -i <suite-name> 
     275After identifying the error, fix in the original {{{suite.rc}}} or {{{rose-suite.conf}}} file in the {{{roses}}} directory.  
     276Editing the file in the {{{cylc-run}}} directory will have no effect!