ISa: Difference between revisions

From XDSwiki
Jump to navigation Jump to search
m moved CORRECT to ISa: seems more appropriate
Practical considerations: electron dffraction
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
CORRECT is the scaling step of XDS. It also re-refines the geometry of the experiment (cell parameters, crystal orientation, distance, beam direction and spindle direction) and gives all relevant statistics. The main outputs of this step are XDS_ASCII.HKL (reflection file) and CORRECT.LP (logfile), but there is a number of other [http://www.mpimf-heidelberg.mpg.de/~kabsch/xds/html_doc/xds_files.html files] that are also produced for diagnostic purposes.
CORRECT is the scaling step of XDS. It also re-refines the geometry of the experiment (cell parameters, crystal orientation, distance, beam direction and spindle direction) and gives all relevant statistics. The main outputs of this step are XDS_ASCII.HKL (reflection file) and CORRECT.LP (logfile), but there is a number of other [http://xds.mpimf-heidelberg.mpg.de/html_doc/xds_files.html files] that are also produced for diagnostic purposes.


==An estimate for the overall quality of an experimental setup==
==An estimate for the overall quality of an experimental setup==
Line 5: Line 5:
A single number that depends on the overall quality of an experimental setup (beam, crystal, spindle, detector, cryo, software, ...) is the upper limit of I/sigma(I) for any reflection in your dataset - even if your crystal is great, all reflections are bound to be worse than that.  
A single number that depends on the overall quality of an experimental setup (beam, crystal, spindle, detector, cryo, software, ...) is the upper limit of I/sigma(I) for any reflection in your dataset - even if your crystal is great, all reflections are bound to be worse than that.  


This number is called <math>{I/Sigma(I)}^{asymptotic}</math> ([Diederichs, Acta Cryst. (2010). D66, 733-740 http://dx.doi.org/10.1107/S0907444910014836])
This number is called '''I/Sigma(I)<sup>asymptotic</sup>''' ([Diederichs, Acta Cryst. (2010). D66, 733-740 http://dx.doi.org/10.1107/S0907444910014836])


What is that number? Scaling procedures (like the ones used in XDS, SCALA, SCALEPACK and DSCALEAVERAGE) scale (or rather, inflate) the variances of individual observations such that they match the experimental spread of symmetry-related observations. To this end, two contributions to the variance of a reflection are added: the first component is random error, and the other component is systematic error. The two values a and b appearing in the formula v(I)=a*(v0(I)+b*I^2) are printed out by CORRECT. a scales the random error component, a*b scales the systematic error component. For strong and well-measured reflections, the variance is dominated by the systematic error a*b*I^2 that is introduced by any beam /spindle / detector /cryo or other instability or malfunction. For weak reflections, a*v0(I), the variance from counting statistics, dominates.  
What is that number? Scaling procedures (like the ones used in XDS, SCALA, SCALEPACK and DSCALEAVERAGE) scale (or rather, inflate) the variances of individual observations such that they match the experimental spread of symmetry-related observations. To this end, two contributions to the variance v(I) of a reflection are considered: the first component is random error, and the other component is systematic error. The two values a and b appearing in the variance-scaling formula v(I)=a*(v0(I)+b*I<sup>2</sup>) are printed out by CORRECT. a scales the random error component, a*b scales the systematic error component. For strong and well-measured reflections, the variance is dominated by the systematic error a*b* I<sup>2</sup> that is introduced by any beam /spindle / detector /cryo or other instability or malfunction. For weak reflections, a*v0(I), the variance from counting statistics, dominates.  


=== Versions of XDS before May 10, 2010 ===
Versions of XDS since May 10, 2010 print out I/Sigma(I)<sup>asymptotic</sup> = 1/&radic;(a*b) as "ISa". ISa is the I/sigma of an infinitely strong reflection. If there were no systematic error, ISa would be infinite. In the presence of systematic error, ISa is finite and is the upper limit of I/sigma of any observation in your dataset.  
A Unix command to obtain <math>{I/Sigma(I)}^{asymptotic}</math> from CORRECT.LP is
awk '/a *b *I/{getline;print ($1*($2+4e-4))^-0.5}' CORRECT.LP


The command just finds the line " a         b             INPUT DATA SET" in [[CORRECT.LP]], grabs the values of "a" and "b" from the next line, and prints out 1/sqrt(a*(b+0.0004)).  
As you can see from the formula, low values of a and b are ''good'' in the sense that a high upper limit of I/sigma(I) results. If e.g. the crystal is badly split or broken, or reflections are too close on the detector, or the data reduction is not good (wrong parameters), then the values of a and b are elevated.


The 0.0004 stems from the fact that the INTEGRATE step does its own adjustment of the variances, and the two adjustments (in INTEGRATE and CORRECT) have to be combined.
If your crystal is good, then a and b will reflect the quality of the other components of the experimental setup (e.g. beamline stability).  


=== Versions of XDS since May 10, 2010 ===
ISa is well suited to judge the quality of the experimental setup, because its value does not depend on random error, whereas the low-resolution R<sub>meas</sub> does, and is thus influenced by crystal size and exposure. If you see a high value of the low-resolution R<sub>meas</sub>, you don't know if it is high because the crystal diffracted weakly, or because the beamline was broken. Conversely, a low value of ISa indicates that something is broken, no matter how big the crystal is or how it was exposed.


Newer versions print out <math>{I/Sigma(I)}^{asymptotic}</mathas "ISa".  
== Practical considerations ==
In practice, both crystal quality and beamline quality limit the value of I/Sigma(I)<sup>asymptotic</sup>. A good crystal (even with elevated mosaicity and medium resolution) should give a high value on a good beamline.
   
I have seen values around 15-20 for good crystals that allowed me to solve a MAD structure, but that required high multiplicity of observations. Values around 30 allowed me to solve a sulfur-SAD structure at medium resolution (diffraction to 2.3 A, anomalous signal to 3 A).
I have also seen a value around 40 for Z. Dauter's 0.98A Proteinase K (2ID8) sulfur-SAD data from the APS/22-ID beamline, and recently even higher values were obtained at the SLS, beamline X06SA, with a Pilatus detector.


As you can see from the formula, low values of a and b are ''good'' in the sense that a high upper limit of I/sigma(I) results. If e.g. the crystal is badly split or broken, or reflections are too close on the detector, or the data reduction is not good (wrong parameters), then the values of a and b are elevated.
On the other hand, I have sometimes obtained values less than 10 with good test crystals, clearly indicating strong systematic errors. It is always good to discuss this with the people who are responsible for the beamline. They might know what is broken, or might be able to find out what went wrong.


If your crystal is good (and no matter ''how'' good your crystal is!), then a and b will reflect the quality of the other components of the experimental setup (e.g. beamline stability).
'''Values of ISa below 5 are almost always a sign that something is severely wrong, and that the data are useless. Possible reasons are: wrong geometry specified in XDS.INP, wrong spacegroup (too high symmetry), high radiation damage, mechanical deficiencies during measurement.'''
 
== Practical considerations ==
In practice, both crystal quality and beamline quality limit the value of <math>{I/Sigma(I)}^{asymptotic}</math> . A good crystal (even with elevated mosaicity and medium resolution) should give a high value on a good beamline.
I have seen values around 15-20 for good crystals that still allowed my to solve a MAD structure, but that required high multiplicity of observations. Values around 30 allowed me to solve a sulfur-SAD structure at medium resolution (diffraction to 2.3 A, anomalous signal to 3 A).  
I have also seen a value around 40 for Z. Dauter's 0.98A Proteinase K (2ID8) sulfur-SAD data from J. Holton's APS/22-ID beamline, and recently even higher values were obtained at the SLS, beamline X06SA, with a Pilatus detector.


On the other hand, I have sometimes obtained values less than 10 with good test crystals. It is always good to discuss this with the people who are responsible for the beamline. They might know what is broken, or might be able to find out what went wrong.
A high systematic error may be partly compensated by high multiplicity. Conversely, high multiplicity is not needed to solve a structure, if the data have a high I/Sigma(I)<sup>asymptotic</sup>.
For molecular replacement and refinement, a high value of I/Sigma(I)<sup>asymptotic</sup> is not strictly needed (but the maps are better with better data, and the model R values lower!).


A low <math>{I/Sigma(I)}^{asymptotic}</math> may be compensated by high multiplicity, at the expense of radiation damage. Conversely, high multiplicity is not needed to solve a structure, if the data have a high <math>{I/Sigma(I)}^{asymptotic}</math>.  
Please note that for electron diffraction, there is a systematic disagreement between symmetry-related reflections due to dynamic diffraction. Therefore one expects lower values of ISa for electron diffraction than for X-ray and neutron diffraction.
For molecular replacement and refinement, a high value of <math>{I/Sigma(I)}^{asymptotic}</math> is not strictly needed (but the maps are better with better data!).

Latest revision as of 07:54, 30 May 2025

CORRECT is the scaling step of XDS. It also re-refines the geometry of the experiment (cell parameters, crystal orientation, distance, beam direction and spindle direction) and gives all relevant statistics. The main outputs of this step are XDS_ASCII.HKL (reflection file) and CORRECT.LP (logfile), but there is a number of other files that are also produced for diagnostic purposes.

An estimate for the overall quality of an experimental setup

A single number that depends on the overall quality of an experimental setup (beam, crystal, spindle, detector, cryo, software, ...) is the upper limit of I/sigma(I) for any reflection in your dataset - even if your crystal is great, all reflections are bound to be worse than that.

This number is called I/Sigma(I)asymptotic ([Diederichs, Acta Cryst. (2010). D66, 733-740 http://dx.doi.org/10.1107/S0907444910014836])

What is that number? Scaling procedures (like the ones used in XDS, SCALA, SCALEPACK and DSCALEAVERAGE) scale (or rather, inflate) the variances of individual observations such that they match the experimental spread of symmetry-related observations. To this end, two contributions to the variance v(I) of a reflection are considered: the first component is random error, and the other component is systematic error. The two values a and b appearing in the variance-scaling formula v(I)=a*(v0(I)+b*I2) are printed out by CORRECT. a scales the random error component, a*b scales the systematic error component. For strong and well-measured reflections, the variance is dominated by the systematic error a*b* I2 that is introduced by any beam /spindle / detector /cryo or other instability or malfunction. For weak reflections, a*v0(I), the variance from counting statistics, dominates.

Versions of XDS since May 10, 2010 print out I/Sigma(I)asymptotic = 1/√(a*b) as "ISa". ISa is the I/sigma of an infinitely strong reflection. If there were no systematic error, ISa would be infinite. In the presence of systematic error, ISa is finite and is the upper limit of I/sigma of any observation in your dataset.

As you can see from the formula, low values of a and b are good in the sense that a high upper limit of I/sigma(I) results. If e.g. the crystal is badly split or broken, or reflections are too close on the detector, or the data reduction is not good (wrong parameters), then the values of a and b are elevated.

If your crystal is good, then a and b will reflect the quality of the other components of the experimental setup (e.g. beamline stability).

ISa is well suited to judge the quality of the experimental setup, because its value does not depend on random error, whereas the low-resolution Rmeas does, and is thus influenced by crystal size and exposure. If you see a high value of the low-resolution Rmeas, you don't know if it is high because the crystal diffracted weakly, or because the beamline was broken. Conversely, a low value of ISa indicates that something is broken, no matter how big the crystal is or how it was exposed.

Practical considerations

In practice, both crystal quality and beamline quality limit the value of I/Sigma(I)asymptotic. A good crystal (even with elevated mosaicity and medium resolution) should give a high value on a good beamline.

I have seen values around 15-20 for good crystals that allowed me to solve a MAD structure, but that required high multiplicity of observations. Values around 30 allowed me to solve a sulfur-SAD structure at medium resolution (diffraction to 2.3 A, anomalous signal to 3 A). I have also seen a value around 40 for Z. Dauter's 0.98A Proteinase K (2ID8) sulfur-SAD data from the APS/22-ID beamline, and recently even higher values were obtained at the SLS, beamline X06SA, with a Pilatus detector.

On the other hand, I have sometimes obtained values less than 10 with good test crystals, clearly indicating strong systematic errors. It is always good to discuss this with the people who are responsible for the beamline. They might know what is broken, or might be able to find out what went wrong.

Values of ISa below 5 are almost always a sign that something is severely wrong, and that the data are useless. Possible reasons are: wrong geometry specified in XDS.INP, wrong spacegroup (too high symmetry), high radiation damage, mechanical deficiencies during measurement.

A high systematic error may be partly compensated by high multiplicity. Conversely, high multiplicity is not needed to solve a structure, if the data have a high I/Sigma(I)asymptotic. For molecular replacement and refinement, a high value of I/Sigma(I)asymptotic is not strictly needed (but the maps are better with better data, and the model R values lower!).

Please note that for electron diffraction, there is a systematic disagreement between symmetry-related reflections due to dynamic diffraction. Therefore one expects lower values of ISa for electron diffraction than for X-ray and neutron diffraction.