2,684
edits
(→Xeon Phi (Knights Landing, KNL): updates) |
|||
Line 49: | Line 49: | ||
The benchmark was run on a single KNL7210 processor (256 cores) set to quadrant mode and using the MCDRAM as cache. XDS was compiled with the -xMIC-AVX512 option of ifort. This gives | The benchmark was run on a single KNL7210 processor (256 cores) set to quadrant mode and using the MCDRAM as cache. XDS was compiled with the -xMIC-AVX512 option of ifort. This gives | ||
COLSPOT: elapsed wall-clock time 48.3 sec | COLSPOT: elapsed wall-clock time 48.3 sec | ||
INTEGRATE: total elapsed wall-clock time 61.2 sec | INTEGRATE: total elapsed wall-clock time 61.2 sec | ||
when run with MAXIMUM_NUMBER_OF_JOBS=16 and MAXIMUM_NUMBER_OF_PROCESSORS=16. These parameters, as well as the KNL setup could still be optimized. | when run with MAXIMUM_NUMBER_OF_JOBS=16 and MAXIMUM_NUMBER_OF_PROCESSORS=16. These parameters, as well as the KNL setup could still be optimized. | ||
Update Feb 21, 2017 using XDS BUILT=20161205 | |||
INIT: elapsed wall-clock time 33.4 sec | |||
COLSPOT: elapsed wall-clock time 49.3 sec | |||
INTEGRATE: total elapsed wall-clock time 59.8 sec | |||
Now using Dectris' library (v. 20170215) with <code>LIB=/usr/local/lib64/dectris-neggia.so</code>: | |||
INIT: elapsed wall-clock time 30.4 sec | |||
COLSPOT: elapsed wall-clock time 40.7 sec | |||
INTEGRATE: total elapsed wall-clock time 52.9 sec | |||
Now additionally running with <code>numactl --preferred=1 xds_par</code> after having modified the forkintegrate script such that it starts mintegrate_par with the same numactl parameters: | |||
INIT.LP: elapsed wall-clock time 29.8 sec | |||
COLSPOT: elapsed wall-clock time 40.0 sec | |||
INTEGRATE: total elapsed wall-clock time 51.3 sec | |||
This was running with a 8GB/8GB split MCDRAM. The same run, but with 8 JOBs and 32 PROCESSORS, takes | |||
INIT.LP: elapsed wall-clock time 25.3 sec | |||
COLSPOT: elapsed wall-clock time 40.1 sec | |||
INTEGRATE: total elapsed wall-clock time 53.1 sec | |||
== Troubleshooting == | == Troubleshooting == |