UMUI panel subindep_SubmitMethod and subindep_ResQsub differs XC30 and XC40
|Reported by:||markr||Owned by:||ros|
|Keywords:||aprun, placement, openmp||Cc:|
using the XC40 (Monsoon) I see that the qsub panel offers different choices to that of the XC30 (Archer) I presume that is due to the differing number of cores per node (24 archer and 32 monsoon).
I would like to be able to control this value for Monsoon as well so that I can show a "pure" OpenMP effect for 1,2,3,4,5,6,7,8 threads while keeping the MPI distribution the same.
Also I note that the advice is to activate hyperthreads. I have found this detrimental to the cases I use by about 20% slower.
This also interferes with the OpenMP performance.
When I deactivate HT then the code takes less time and the OpenMP performance is closer to ideal for the sections I am analyzing.
Have you any advice on how I could provide a variation on the aprun command? I presume it is set per version per machine within the um account scripts.