Eiger: Difference between revisions

276 bytes added ,  22 February 2017
Line 53: Line 53:
when run with MAXIMUM_NUMBER_OF_JOBS=16 and MAXIMUM_NUMBER_OF_PROCESSORS=16. These parameters, as well as the KNL setup could still be optimized.
when run with MAXIMUM_NUMBER_OF_JOBS=16 and MAXIMUM_NUMBER_OF_PROCESSORS=16. These parameters, as well as the KNL setup could still be optimized.


Update Feb 21, 2017 using XDS BUILT=20161205
Update Feb 21, 2017 using XDS BUILT=20161205, and the CentOS-7.3 default kernel 3.10.0-514.6.1.el7:
  INIT:            elapsed wall-clock time      33.4 sec
  INIT:            elapsed wall-clock time      33.4 sec
  COLSPOT:        elapsed wall-clock time      49.3 sec
  COLSPOT:        elapsed wall-clock time      49.3 sec
  INTEGRATE: total elapsed wall-clock time      59.8 sec
  INTEGRATE: total elapsed wall-clock time      59.8 sec
Now using Dectris' library (v. 20170215) with <code>LIB=/usr/local/lib64/dectris-neggia.so</code>:
Using a pre-release library that makes use of the <code>LIB=</code> [http://homes.mpimf-heidelberg.mpg.de/~kabsch/xds/html_doc/xds_parameters.html#LIB= option] of XDS:
  INIT:            elapsed wall-clock time      30.4 sec
  INIT:            elapsed wall-clock time      30.4 sec
  COLSPOT:        elapsed wall-clock time      40.7 sec
  COLSPOT:        elapsed wall-clock time      40.7 sec
Line 73: Line 73:
  COLSPOT:        elapsed wall-clock time      38.6 sec
  COLSPOT:        elapsed wall-clock time      38.6 sec
  INTEGRATE: total elapsed wall-clock time      53.2 sec
  INTEGRATE: total elapsed wall-clock time      53.2 sec
Now setting the KNL to SNC4 mode, and the MCDRAM to cache (using it in flat mode is impractical because the --preferred argument takes only 1 argument, but 4 would have to be specified unless heavy scripting were done):
Now setting the KNL to SNC4 mode, and the MCDRAM to cache (using it in flat mode is impractical because the --preferred argument takes only 1 argument; to determine the correct argument requires scripting):
  INIT.LP:        elapsed wall-clock time      29.6 sec
  INIT.LP:        elapsed wall-clock time      29.6 sec
  COLSPOT.LP:      elapsed wall-clock time      37.8 sec  
  COLSPOT.LP:      elapsed wall-clock time      37.8 sec  
  INTEGRATE: total elapsed wall-clock time      50.2 sec
  INTEGRATE: total elapsed wall-clock time      49.6 sec


Conclusion: since INIT benefits from more PROCESSORs, one could run XDS twice for fastest turnaround; the first run with JOBS=XYCORR INIT and a high number of processors (99 is maximum). The second run with JOB=COLSPOT IDXREF DEFPIX INTEGRATE CORRECT, and an optimized JOBS/PROCESSORS combination.
Conclusions: since INIT benefits from more PROCESSORs, one could run XDS twice for fastest turnaround; the first run with JOBS=XYCORR INIT and a high number of processors (99 is maximum). The second run with JOB=COLSPOT IDXREF DEFPIX INTEGRATE CORRECT, and an optimized JOBS/PROCESSORS combination. The SNC4 mode is indeed fastest - to do better than the cache mode of the MCDRAM, one needs to adapt the forkcolspot and forkintegrate script- see [[Performance]].


== Troubleshooting ==
== Troubleshooting ==
2,684

edits