2QVO.xds: Difference between revisions

628 bytes removed ,  14 March 2011
no edit summary
No edit summary
No edit summary
Line 3: Line 3:
===dataset 1===
===dataset 1===


Using "generate_XDS.INP ../../APS/22-ID/2qvo/ACA10_AF1382_1.0???" we obtain:
Using "[[generate_XDS.INP]] ../../APS/22-ID/2qvo/ACA10_AF1382_1.0???" we obtain:
<pre>
JOB= XYCORR INIT COLSPOT IDXREF DEFPIX INTEGRATE CORRECT
JOB= XYCORR INIT COLSPOT IDXREF DEFPIX INTEGRATE CORRECT
ORGX= 1996.00 ORGY= 2028.00  ! check these values with adxv !
ORGX= 1996.00 ORGY= 2028.00  ! check these values with adxv !
Line 18: Line 19:
UNIT_CELL_CONSTANTS= 70 80 90 90 90 90 ! put correct values if known
UNIT_CELL_CONSTANTS= 70 80 90 90 90 90 ! put correct values if known
INCLUDE_RESOLUTION_RANGE=50 0  ! after CORRECT, insert high resol limit; re-run CORRECT
INCLUDE_RESOLUTION_RANGE=50 0  ! after CORRECT, insert high resol limit; re-run CORRECT


FRIEDEL'S_LAW=FALSE    ! This acts only on the CORRECT step
FRIEDEL'S_LAW=FALSE    ! This acts only on the CORRECT step
Line 50: Line 50:
FRACTION_OF_POLARIZATION=0.98  ! better value is provided by beamline staff!
FRACTION_OF_POLARIZATION=0.98  ! better value is provided by beamline staff!
POLARIZATION_PLANE_NORMAL=0 1 0
POLARIZATION_PLANE_NORMAL=0 1 0
</pre>


Now we run xds_par. This runs to completion. We should at least inspect, using XDS-Viewer, the file FRAME.cbf since this shows us the last frame of the dataset, with boxes superimposed which correspond to the expected locations of reflections.  
Now we run "xds_par". This runs to completion. We should at least inspect, using [[XDS-Viewer]], the file FRAME.cbf since this shows us the last frame of the dataset, with boxes superimposed which correspond to the expected locations of reflections.  


The automatic spacegroup determination (CORRECT.LP) comes up with
The automatic spacegroup determination (CORRECT.LP) comes up with
Line 156: Line 157:
==SHELXC/D/E structure solution==
==SHELXC/D/E structure solution==


This is done in a subdirectory of the XDS data reduction directory. Here, we generate XDSCONV.INP (I used MERGE=TRUE, sometimes the results are better that way) and run xdsconv and [[ccp4com:SHELX_C/D/E|SHELXC]]:
This is done in a subdirectory of the XDS data reduction directory (either dataset "1" or "2", and we can also try it in a xscale subdirectory). Here, we generate XDSCONV.INP (I used MERGE=TRUE, sometimes the results are better that way) and run xdsconv and [[ccp4com:SHELX_C/D/E|SHELXC]]:
<pre>
<pre>
#!/bin/csh -f
#!/bin/csh -f
Line 176: Line 177:
end
end
   
   
This writes j.hkl, j_fa.hkl and j_fa.ins. However, we overwrite j_fa.ins now:
This writes j.hkl, j_fa.hkl and j_fa.ins. However, we overwrite j_fa.ins now (these lines are just the ones that [[ccp4com:hkl2map|hkl2map]] would write):
<pre>
cat > j_fa.ins <<end
cat > j_fa.ins <<end
TITL j_fa.ins SAD in P42
TITL j_fa.ins SAD in P42
Line 197: Line 199:
END
END
end
end
   
</pre>
shelxd j_fa
and then
  shelxd j_fa


This gives best CC All/Weak of 35.61 / 26.03 for dataset 2, and best CC All/Weak of 36.74 / 21.55 for dataset 1.  
This gives best CC All/Weak of 36.74 / 21.55 for dataset 1, and best CC All/Weak of 35.61 / 26.03 for dataset 2, and .  


Next we run G. Sheldrick's beta-Version of [[ccp4com:SHELX_C/D/E|SHELXE]] Version 2009/4:
Next we run G. Sheldrick's beta-Version of [[ccp4com:SHELX_C/D/E|SHELXE]] Version 2009/4:


  shelxe.beta j j_fa -a6 -q -h -s0.55 -m20 -b  
  shelxe.beta j j_fa -a -q -h -s0.55 -m20 -b  
and the the inverse hand:
shelxe.beta j j_fa -a -q -h -s0.55 -m20 -b -i


Some important lines in the output: for dataset 2, I get
One of these solves the structure, the other gives bad statistics.
    79 residues left after pruning, divided into chains as follows:
A:  20  B:  22  C:  37
CC for partial structure against native data =  50.42 %
...
  <wt> = 0.300, Contrast = 0.731, Connect. = 0.817 for dens.mod. cycle 20
...
Estimated mean FOM = 0.659  Pseudo-free CC = 68.71 %


for dataset 1, I get
Some important lines in the output: for dataset 1, I get
    80 residues left after pruning, divided into chains as follows:
A:  23  B:  57


CC for partial structure against native data =  45.79 %
...
<wt> = 0.300, Contrast = 0.711, Connect. = 0.812 for dens.mod. cycle 20
...
Estimated mean FOM = 0.611  Pseudo-free CC = 63.70 %




'''clearly indicating that the structure can be solved with each of the two datasets individually.'''
'''clearly indicating that the structure can be solved with each of the two datasets individually.'''
For completeness, we run the inverse hand:
shelxe.beta j j_fa -a6 -q -h -s0.55 -m20 -b -i
but of course this gives much worse statistics.


==Optimization of data reduction==
==Optimization of data reduction==


The only safe way to optimize the data reduction is to look at external quality indicators. Internal R-factors, and even the correlation coefficient of the anomalous signal are of comparatively little value. A readily available external quality indicator is CC All/CC Weak as obtained by [[ccp4com:SHELX_C/D/E|SHELXD]].
The safest way to optimize the data reduction is to look at external quality indicators. Internal R-factors, and even the correlation coefficient of the anomalous signal are of comparatively little value. A readily available external quality indicator is CC All/CC Weak as obtained by [[ccp4com:SHELX_C/D/E|SHELXD]].


WFAC1 was already discussed above. Another candidate for optimization is MAXIMUM_ERROR_OF_SPOT_POSITION. By default this is 3.0 . In the case of these data, this default appears to be too small, because the STANDARD DEVIATION OF SPOT   POSITION (PIXELS) (as reported by IDXREF, INTEGRATE and CORRECT after refinement) is quite high (1.5 and more). This prevents XDS from using all the reflections for geometry refinement.
WFAC1 was already discussed above. Another candidate for optimization is MAXIMUM_ERROR_OF_SPOT_POSITION. By default this is 3.0 . In the case of these data, this default appears to be too small, because the STANDARD DEVIATION OF SPOT POSITION (PIXELS) (as reported by IDXREF, INTEGRATE and CORRECT after refinement) is quite high (1.5 and more). This prevents XDS from using all the reflections for geometry refinement. In general, it makes sense to use MAXIMUM_ERROR_OF_SPOT_POSITION= (at least 3 times the STANDARD DEVIATION OF SPOT POSITION (PIXELS)) 


I found that MAXIMUM_ERROR_OF_SPOT_POSITION=6.0 significantly improved the internal statistics (mostly the r-factors, but not so much the correlation coefficient of the anom signal), and improved CC All/CC Weak indicators (to more than 40). SHELXE then produces significantly better and more complete models. Try for yourself!
I found that MAXIMUM_ERROR_OF_SPOT_POSITION=6.0 significantly improved the internal statistics (mostly the r-factors, but not so much the correlation coefficient of the anom signal), and improved CC All/CC Weak indicators (to more than 40). SHELXE then produces significantly better and more complete models. Try for yourself!


One thing I noticed that if I specify the known spacegroup in IDXREF then the results are worse than if the integration is performed in P1. Likewise, [[optimization]] did not work: recycling of GXPARM.XDS to use as XPARM.XDS, and thus imposing the lattice symmetry in the geometry refinement in INTEGRATE. These findings my correspond to the fact that in P1 the angles do not refine to 90.0xx or 89.9xx degrees. In other words, the metric symmetry is not as well fulfilled as it should be. This results in fairly large deviations from the ideal P42 positions; the refinement of cell parameters in P1 partly compensates for this. I have however no idea why this deviation from metric symmetry could occur.  
[[Optimization]] does improve things as much as it often does: recycling of GXPARM.XDS to use as XPARM.XDS, and thus imposing the lattice symmetry in the geometry refinement in INTEGRATE. These findings my correspond to the fact that in P1 the angles do not refine to 90.0xx or 89.9xx degrees. In other words, the metric symmetry is not as well fulfilled as it should be. This results in fairly large deviations from the ideal P42 positions; the refinement of cell parameters in P1 partly compensates for this. I have however no idea why this deviation from metric symmetry could occur.  


==Optimization of structure solution==
==Optimization of structure solution==


There are some parameters in the SHELXC/D/E approach above that could be optimized as well: first of all, MERGE=TRUE in XDSCONV.INP turned later out to be the wrong choice (using the default MERGE=FALSE turns out to give a model with 85 consecutive residues for dataset 1). Then of course, the resolution limit for SHELXD could be varied, and the solvent content for SHELXE. For SHELXE in particular, many things could be tried.
The resolution limit for SHELXD could be varied. For SHELXE, the solvent content could be varied, and the number of autobuilding cycles, and probably also the high resolution cutoff.


==Limits==
==Limits==


With dataset 2, I tried to use 270 frames but could not solve the structure using the above SHELXC/D/E approach (not even with MAXIMUM_ERROR_OF_SPOT_POSITION=6.0). With 315 frames, it was possible.
With dataset 2, I tried to use 270 frames but could not solve the structure using the above SHELXC/D/E approach (not even with MAXIMUM_ERROR_OF_SPOT_POSITION=6.0). With 315 frames, it was possible.
2,652

edits