2,684
edits
Line 53: | Line 53: | ||
when run with MAXIMUM_NUMBER_OF_JOBS=16 and MAXIMUM_NUMBER_OF_PROCESSORS=16. These parameters, as well as the KNL setup could still be optimized. | when run with MAXIMUM_NUMBER_OF_JOBS=16 and MAXIMUM_NUMBER_OF_PROCESSORS=16. These parameters, as well as the KNL setup could still be optimized. | ||
Update Feb 21, 2017 using XDS BUILT=20161205 | Update Feb 21, 2017 using XDS BUILT=20161205, and the CentOS-7.3 default kernel 3.10.0-514.6.1.el7: | ||
INIT: elapsed wall-clock time 33.4 sec | INIT: elapsed wall-clock time 33.4 sec | ||
COLSPOT: elapsed wall-clock time 49.3 sec | COLSPOT: elapsed wall-clock time 49.3 sec | ||
INTEGRATE: total elapsed wall-clock time 59.8 sec | INTEGRATE: total elapsed wall-clock time 59.8 sec | ||
Using a pre-release library that makes use of the <code>LIB=</code> [http://homes.mpimf-heidelberg.mpg.de/~kabsch/xds/html_doc/xds_parameters.html#LIB= option] of XDS: | |||
INIT: elapsed wall-clock time 30.4 sec | INIT: elapsed wall-clock time 30.4 sec | ||
COLSPOT: elapsed wall-clock time 40.7 sec | COLSPOT: elapsed wall-clock time 40.7 sec | ||
Line 73: | Line 73: | ||
COLSPOT: elapsed wall-clock time 38.6 sec | COLSPOT: elapsed wall-clock time 38.6 sec | ||
INTEGRATE: total elapsed wall-clock time 53.2 sec | INTEGRATE: total elapsed wall-clock time 53.2 sec | ||
Now setting the KNL to SNC4 mode, and the MCDRAM to cache (using it in flat mode is impractical because the --preferred argument takes only 1 argument | Now setting the KNL to SNC4 mode, and the MCDRAM to cache (using it in flat mode is impractical because the --preferred argument takes only 1 argument; to determine the correct argument requires scripting): | ||
INIT.LP: elapsed wall-clock time 29.6 sec | INIT.LP: elapsed wall-clock time 29.6 sec | ||
COLSPOT.LP: elapsed wall-clock time 37.8 sec | COLSPOT.LP: elapsed wall-clock time 37.8 sec | ||
INTEGRATE: total elapsed wall-clock time | INTEGRATE: total elapsed wall-clock time 49.6 sec | ||
Conclusions: since INIT benefits from more PROCESSORs, one could run XDS twice for fastest turnaround; the first run with JOBS=XYCORR INIT and a high number of processors (99 is maximum). The second run with JOB=COLSPOT IDXREF DEFPIX INTEGRATE CORRECT, and an optimized JOBS/PROCESSORS combination. The SNC4 mode is indeed fastest - to do better than the cache mode of the MCDRAM, one needs to adapt the forkcolspot and forkintegrate script- see [[Performance]]. | |||
== Troubleshooting == | == Troubleshooting == |