ISa: Difference between revisions

From XDSwiki
Jump to navigation Jump to search
No edit summary
Practical considerations: electron dffraction
 
(21 intermediate revisions by the same user not shown)
Line 1: Line 1:
CORRECT is the scaling step of XDS.
CORRECT is the scaling step of XDS. It also re-refines the geometry of the experiment (cell parameters, crystal orientation, distance, beam direction and spindle direction) and gives all relevant statistics. The main outputs of this step are XDS_ASCII.HKL (reflection file) and CORRECT.LP (logfile), but there is a number of other [http://xds.mpimf-heidelberg.mpg.de/html_doc/xds_files.html files] that are also produced for diagnostic purposes.


==Assessing the overall quality of an experimental setup==
==An estimate for the overall quality of an experimental setup==
A useful Unix command for evaluating the overall quality of an experimental setup is
awk '/CORRECTION PARAMETERS/{for (i=0;i<9;++i) getline;print 1/($1*$2)^0.5}' CORRECT.LP
Why? Because this will give you the best (maximum) value of I/sigma(I) for any reflection in your dataset - even if your crystal is great, all reflections are bound to be worse than that.


Why does the command give you such a useful value? It just finds the line "CORRECTION PARAMETERS FOR THE STANDARD ERROR OF REFLECTION INTENSITIES", skips the next 9 lines, and grabs the values of "a" and "b" from [[CORRECT.LP]]. These values appear in the formula v(I)=a*(v0(I)+b*I^2) which is used by CORRECT to adjust the variances of the intensities, to match their experimental spread. For strong and well-measured reflections, the variance is dominated by the systematic error that is introduced by any beam or detector instability. For weak reflections, v0(I), the variance from counting statistics, dominates. The minimum value for v(I) this formula gives
A single number that depends on the overall quality of an experimental setup (beam, crystal, spindle, detector, cryo, software, ...) is the upper limit of I/sigma(I) for any reflection in your dataset - even if your crystal is great, all reflections are bound to be worse than that.  


If your crystal is badly split or broken, then of course a and b will be strongly influenced by the crystal. If your crystal is good (and no matter how good your crystal is!), then a and b will reflect the quality of the beamline.
This number is called '''I/Sigma(I)<sup>asymptotic</sup>''' ([Diederichs, Acta Cryst. (2010). D66, 733-740 http://dx.doi.org/10.1107/S0907444910014836])


I personally have seen values lower than 20 for good crystals at bad beamlines. On the bright side, I have also seen a value of 87 for Z. Dauter's 0.98A Proteinase K sulfur-SAD data from J. Holton's APS/22-ID beamline.
What is that number? Scaling procedures (like the ones used in XDS, SCALA, SCALEPACK and DSCALEAVERAGE) scale (or rather, inflate) the variances of individual observations such that they match the experimental spread of symmetry-related observations. To this end, two contributions to the variance v(I) of a reflection are considered: the first component is random error, and the other component is systematic error. The two values a and b appearing in the variance-scaling formula v(I)=a*(v0(I)+b*I<sup>2</sup>) are printed out by CORRECT. a scales the random error component, a*b scales the systematic error component. For strong and well-measured reflections, the variance is dominated by the systematic error a*b* I<sup>2</sup> that is introduced by any beam /spindle / detector /cryo or other instability or malfunction. For weak reflections, a*v0(I), the variance from counting statistics, dominates.
 
Versions of XDS since May 10, 2010 print out I/Sigma(I)<sup>asymptotic</sup> = 1/&radic;(a*b) as "ISa". ISa is the I/sigma of an infinitely strong reflection. If there were no systematic error, ISa would be infinite. In the presence of systematic error, ISa is finite and is the upper limit of I/sigma of any observation in your dataset.
 
As you can see from the formula, low values of a and b are ''good'' in the sense that a high upper limit of I/sigma(I) results. If e.g. the crystal is badly split or broken, or reflections are too close on the detector, or the data reduction is not good (wrong parameters), then the values of a and b are elevated.
 
If your crystal is good, then a and b will reflect the quality of the other components of the experimental setup (e.g. beamline stability).
 
ISa is well suited to judge the quality of the experimental setup, because its value does not depend on random error, whereas the low-resolution R<sub>meas</sub> does, and is thus influenced by crystal size and exposure. If you see a high value of the low-resolution R<sub>meas</sub>, you don't know if it is high because the crystal diffracted weakly, or because the beamline was broken. Conversely, a low value of ISa indicates that something is broken, no matter how big the crystal is or how it was exposed.
 
== Practical considerations ==
In practice, both crystal quality and beamline quality limit the value of I/Sigma(I)<sup>asymptotic</sup>. A good crystal (even with elevated mosaicity and medium resolution) should give a high value on a good beamline.
I have seen values around 15-20 for good crystals that allowed me to solve a MAD structure, but that required high multiplicity of observations. Values around 30 allowed me to solve a sulfur-SAD structure at medium resolution (diffraction to 2.3 A, anomalous signal to 3 A).
I have also seen a value around 40 for Z. Dauter's 0.98A Proteinase K (2ID8) sulfur-SAD data from the APS/22-ID beamline, and recently even higher values were obtained at the SLS, beamline X06SA, with a Pilatus detector.
 
On the other hand, I have sometimes obtained values less than 10 with good test crystals, clearly indicating strong systematic errors. It is always good to discuss this with the people who are responsible for the beamline. They might know what is broken, or might be able to find out what went wrong.
 
'''Values of ISa below 5 are almost always a sign that something is severely wrong, and that the data are useless. Possible reasons are: wrong geometry specified in XDS.INP, wrong spacegroup (too high symmetry), high radiation damage, mechanical deficiencies during measurement.'''
 
A high systematic error may be partly compensated by high multiplicity. Conversely, high multiplicity is not needed to solve a structure, if the data have a high I/Sigma(I)<sup>asymptotic</sup>.
For molecular replacement and refinement, a high value of I/Sigma(I)<sup>asymptotic</sup> is not strictly needed (but the maps are better with better data, and the model R values lower!).
 
Please note that for electron diffraction, there is a systematic disagreement between symmetry-related reflections due to dynamic diffraction. Therefore one expects lower values of ISa for electron diffraction than for X-ray and neutron diffraction.

Latest revision as of 07:54, 30 May 2025

CORRECT is the scaling step of XDS. It also re-refines the geometry of the experiment (cell parameters, crystal orientation, distance, beam direction and spindle direction) and gives all relevant statistics. The main outputs of this step are XDS_ASCII.HKL (reflection file) and CORRECT.LP (logfile), but there is a number of other files that are also produced for diagnostic purposes.

An estimate for the overall quality of an experimental setup

A single number that depends on the overall quality of an experimental setup (beam, crystal, spindle, detector, cryo, software, ...) is the upper limit of I/sigma(I) for any reflection in your dataset - even if your crystal is great, all reflections are bound to be worse than that.

This number is called I/Sigma(I)asymptotic ([Diederichs, Acta Cryst. (2010). D66, 733-740 http://dx.doi.org/10.1107/S0907444910014836])

What is that number? Scaling procedures (like the ones used in XDS, SCALA, SCALEPACK and DSCALEAVERAGE) scale (or rather, inflate) the variances of individual observations such that they match the experimental spread of symmetry-related observations. To this end, two contributions to the variance v(I) of a reflection are considered: the first component is random error, and the other component is systematic error. The two values a and b appearing in the variance-scaling formula v(I)=a*(v0(I)+b*I2) are printed out by CORRECT. a scales the random error component, a*b scales the systematic error component. For strong and well-measured reflections, the variance is dominated by the systematic error a*b* I2 that is introduced by any beam /spindle / detector /cryo or other instability or malfunction. For weak reflections, a*v0(I), the variance from counting statistics, dominates.

Versions of XDS since May 10, 2010 print out I/Sigma(I)asymptotic = 1/√(a*b) as "ISa". ISa is the I/sigma of an infinitely strong reflection. If there were no systematic error, ISa would be infinite. In the presence of systematic error, ISa is finite and is the upper limit of I/sigma of any observation in your dataset.

As you can see from the formula, low values of a and b are good in the sense that a high upper limit of I/sigma(I) results. If e.g. the crystal is badly split or broken, or reflections are too close on the detector, or the data reduction is not good (wrong parameters), then the values of a and b are elevated.

If your crystal is good, then a and b will reflect the quality of the other components of the experimental setup (e.g. beamline stability).

ISa is well suited to judge the quality of the experimental setup, because its value does not depend on random error, whereas the low-resolution Rmeas does, and is thus influenced by crystal size and exposure. If you see a high value of the low-resolution Rmeas, you don't know if it is high because the crystal diffracted weakly, or because the beamline was broken. Conversely, a low value of ISa indicates that something is broken, no matter how big the crystal is or how it was exposed.

Practical considerations

In practice, both crystal quality and beamline quality limit the value of I/Sigma(I)asymptotic. A good crystal (even with elevated mosaicity and medium resolution) should give a high value on a good beamline.

I have seen values around 15-20 for good crystals that allowed me to solve a MAD structure, but that required high multiplicity of observations. Values around 30 allowed me to solve a sulfur-SAD structure at medium resolution (diffraction to 2.3 A, anomalous signal to 3 A). I have also seen a value around 40 for Z. Dauter's 0.98A Proteinase K (2ID8) sulfur-SAD data from the APS/22-ID beamline, and recently even higher values were obtained at the SLS, beamline X06SA, with a Pilatus detector.

On the other hand, I have sometimes obtained values less than 10 with good test crystals, clearly indicating strong systematic errors. It is always good to discuss this with the people who are responsible for the beamline. They might know what is broken, or might be able to find out what went wrong.

Values of ISa below 5 are almost always a sign that something is severely wrong, and that the data are useless. Possible reasons are: wrong geometry specified in XDS.INP, wrong spacegroup (too high symmetry), high radiation damage, mechanical deficiencies during measurement.

A high systematic error may be partly compensated by high multiplicity. Conversely, high multiplicity is not needed to solve a structure, if the data have a high I/Sigma(I)asymptotic. For molecular replacement and refinement, a high value of I/Sigma(I)asymptotic is not strictly needed (but the maps are better with better data, and the model R values lower!).

Please note that for electron diffraction, there is a systematic disagreement between symmetry-related reflections due to dynamic diffraction. Therefore one expects lower values of ISa for electron diffraction than for X-ray and neutron diffraction.