1,330
edits
mNo edit summary |
mNo edit summary |
||
Line 4: | Line 4: | ||
'''SHELXC''' is designed to provide a simple and fast way of setting up the files for the programs '''SHELXD''' (heavy atom location) and '''SHELXE''' (phasing and density modification) for macromolecular phasing by the MAD, SAD, SIR and SIRAS methods. These three programs may be run in batch mode or called from a GUI such as [[CCP4i]] or (better) [[hkl2map]]. SHELXC is much less versatile than the Bruker AXS XPREP program for this purpose, but if you are sure of the space group and there are no problems with the indexing or twinning and the f’ and f” parts of the scattering factors do not need to be refined, SHELXC should be adequate. SHELXC can read either HKL2000 format .sca files or SHELX .hkl files (F-squared unless the -f switch is used to specify F). To transfer data from CCP4 it is advisable to generate .sca files using 'output unmerged polish' from SCALA or to use the program mtz2sca written by Tim Grüne and supplied with SHELX. The current version of SHELXC outputs extra useful diagnostic statistics if fed unmerged data. SHELXC, SHELXD and SHELXE are stand-alone executables that do not require environment variables or parameter files etc., so all that is needed to install them is to put them in a directory that is in the ‘path’ (e.g. /usr/local/bin or ~/bin under Linux). | '''SHELXC''' is designed to provide a simple and fast way of setting up the files for the programs '''SHELXD''' (heavy atom location) and '''SHELXE''' (phasing and density modification) for macromolecular phasing by the MAD, SAD, SIR and SIRAS methods. These three programs may be run in batch mode or called from a GUI such as [[CCP4i]] or (better) [[hkl2map]]. SHELXC is much less versatile than the Bruker AXS XPREP program for this purpose, but if you are sure of the space group and there are no problems with the indexing or twinning and the f’ and f” parts of the scattering factors do not need to be refined, SHELXC should be adequate. SHELXC can read either HKL2000 format .sca files or SHELX .hkl files (F-squared unless the -f switch is used to specify F). To transfer data from CCP4 it is advisable to generate .sca files using 'output unmerged polish' from SCALA or to use the program mtz2sca written by Tim Grüne and supplied with SHELX. The current version of SHELXC outputs extra useful diagnostic statistics if fed unmerged data. SHELXC, SHELXD and SHELXE are stand-alone executables that do not require environment variables or parameter files etc., so all that is needed to install them is to put them in a directory that is in the ‘path’ (e.g. /usr/local/bin or ~/bin under Linux). | ||
SHELXC reads a filename stem on the command line plus some instructions from 'standard input'. It writes some statistics to 'standard output' and prepares the three files needed to run SHELXD and SHELXE. It can be called from a GUI using a single command line such as: | SHELXC reads a filename stem on the command line plus some instructions from 'standard input'. It writes some statistics to 'standard output' and prepares the three files needed to run SHELXD and SHELXE. It can be called from a GUI using a single command line such as: | ||
shelxc xx <t | |||
which would read the instructions from the file t and write the files xx.hkl (h,k,l,I,sig(I) in SHELX HKLF4 format for density modification by SHELXE), xx_fa.ins (cell, symmetry etc. for heavy atom location using SHELXD) and xx_fa.hkl (h,k,l,FA,sig(FA),alpha for both SHELXD and SHELXE). The starting phases for density modification are estimated as (heavy atom phase + alpha) in the simplified approach used by SHELXE, alpha is calculated by SHELXC from the anomalous and dispersive differences. For SAD alpha is 90º (I+ > I–) or 270º (I+ < I–), for SIR and RIP alpha is 0º or 180º and for SIRAS or MAD alpha may be anywhere in the range 0º to 360º. | which would read the instructions from the file t and write the files xx.hkl (h,k,l,I,sig(I) in SHELX HKLF4 format for density modification by SHELXE), xx_fa.ins (cell, symmetry etc. for heavy atom location using SHELXD) and xx_fa.hkl (h,k,l,FA,sig(FA),alpha for both SHELXD and SHELXE). The starting phases for density modification are estimated as (heavy atom phase + alpha) in the simplified approach used by SHELXE, alpha is calculated by SHELXC from the anomalous and dispersive differences. For SAD alpha is 90º (I+ > I–) or 270º (I+ < I–), for SIR and RIP alpha is 0º or 180º and for SIRAS or MAD alpha may be anywhere in the range 0º to 360º. | ||
<p>The above command line could be used under UNIX or Windows; under UNIX the commands to run SHELXC, SHELXD and SHELXE and the instructions for SHELXC may also be combined into a single script file as shown in the following examples. In these scripts, the instructions start on the line after '<<EOF' and are terminated by 'EOF'. The instructions may be given in any order; CELL (unit-cell), SPAG (space group in PDB notation, spaces are ignored) and FIND (followed by the number of heavy atoms) must be given; the optional instructions SFAC, MIND, NTRY, SHEL, ESEL and DSUL, if present, are copied to the SHELXD input file. <br><br> | <p>The above command line could be used under UNIX or Windows; under UNIX the commands to run SHELXC, SHELXD and SHELXE and the instructions for SHELXC may also be combined into a single script file as shown in the following examples. In these scripts, the instructions start on the line after '<<EOF' and are terminated by 'EOF'. The instructions may be given in any order; CELL (unit-cell), SPAG (space group in PDB notation, spaces are ignored) and FIND (followed by the number of heavy atoms) must be given; the optional instructions SFAC, MIND, NTRY, SHEL, ESEL and DSUL, if present, are copied to the SHELXD input file. <br><br> | ||
== MAD example == | == MAD example == | ||
shelxc jia <<EOF | shelxc jia <<EOF | ||
NAT jia_nat.hkl | NAT jia_nat.hkl | ||
HREM jia_hrem.sca | HREM jia_hrem.sca | ||
PEAK jia_peak.sca | PEAK jia_peak.sca | ||
INFL jia_infl.sca | INFL jia_infl.sca | ||
LREM jia_lrem.sca | LREM jia_lrem.sca | ||
CELL 96.00 120.00 166.13 90 90 90 | CELL 96.00 120.00 166.13 90 90 90 | ||
SPAG C2221 | SPAG C2221 | ||
FIND 8 | FIND 8 | ||
NTRY 10 | NTRY 10 | ||
EOF | EOF | ||
shelxd jia_fa | shelxd jia_fa | ||
shelxe jia jia_fa -s0.6 -m20 | shelxe jia jia_fa -s0.6 -m20 | ||
shelxe jia jia_fa -s0.6 -m20 -i | shelxe jia jia_fa -s0.6 -m20 -i | ||
In this example (kindly donated by Zbigniew Dauter; Li et al., Nature Struct. Biol. 7 (2000) 555-559), Se-Met MAD data at four wavelengths are used to calculated the FA-values and phase shifts | In this example (kindly donated by Zbigniew Dauter; Li et al., Nature Struct. Biol. 7 (2000) 555-559), Se-Met MAD data at four wavelengths are used to calculated the FA-values and phase shifts that are written to the file jia_fa.hkl. The native (S-Met) data are read from jia_nat.hkl and written to jia.hkl. The file jia_fa.ins is prepared using the given cell, space group, FIND and NTRY instructions as well as a suitable SHEL command to truncate the resolution. SHELXD then searches for 8 (FIND) selenium atoms using 10 attempts (NTRY), and SHELXE is run for 20 cycles (-m) of density modification for both heavy atom enantiomorphs (-i inverts) with a solvent content (-s) of 0.6. The protein phases are written to jia.phs and jia_i.phs resp. If NAT is not specified, SHELXC would analyze the four MAD datasets to generate the (SeMet) native data jia.hkl, in which case -h should be specified for SHELXE since the selenium atoms are present in the ‘native’ structure. For MAD at least two wavelengths are required, at least one of which should be PEAK or INFL. | ||
If the MAD experiment fails, one should insert the line 'SMAD' somewhere in the SHELXC input instructions and run the job again. This makes a MAD experiment into a SAD experiment in which a suitably weighted mean of the anomalous differences is employed and the dispersive differences are ignored. If the CC values in SHELXD come out better, this SAD approach is likely to give a better solution, but it may be then worth trying commenting out one or more of the PEAK, INFL, HREM and LREM commands to see if there is a further improvement (if just one remains, it should be renamed SAD).<br><br> | If the MAD experiment fails, one should insert the line 'SMAD' somewhere in the SHELXC input instructions and run the job again. This makes a MAD experiment into a SAD experiment in which a suitably weighted mean of the anomalous differences is employed and the dispersive differences are ignored. If the CC values in SHELXD come out better, this SAD approach is likely to give a better solution, but it may be then worth trying commenting out one or more of the PEAK, INFL, HREM and LREM commands to see if there is a further improvement (if just one remains, it should be renamed SAD).<br><br> | ||
Line 32: | Line 31: | ||
== SAD Example == | == SAD Example == | ||
This example of thaumatin phasing by means of the native sulfur anomalous signal (Debreczeni et al., Acta Cryst. D59 (2003) 688-696) uses 1.55 Å in-house CuKalpha data: | This example of thaumatin phasing by means of the native sulfur anomalous signal (Debreczeni et al., Acta Cryst. D59 (2003) 688-696) uses 1.55 Å in-house CuKalpha data: | ||
shelxc thau <<EOF | shelxc thau <<EOF | ||
SAD thau-nat.hkl | SAD thau-nat.hkl | ||
CELL 58.036 58.036 151.29 90 90 90 | CELL 58.036 58.036 151.29 90 90 90 | ||
SPAG P41212 | SPAG P41212 | ||
FIND 9 | FIND 9 | ||
DSUL 8 | DSUL 8 | ||
MIND –3.5 | MIND –3.5 | ||
NTRY 100 | NTRY 100 | ||
EOF | EOF | ||
shelxd thau_fa | shelxd thau_fa | ||
shelxe thau thau_fa -h -s0.5 -m20 | shelxe thau thau_fa -h -s0.5 -m20 | ||
shelxe thau thau_fa -h -s0.5 -m20 –i | shelxe thau thau_fa -h -s0.5 -m20 –i | ||
The anomalous differences are extracted from the native data so only one data file is required. The sites specified by FIND consist of one methionine and 8 super-sulfurs, which are then resolved into disulfides using the DSUL instruction that is passed on to SHELXD (Debreczeni et al., Acta Cryst. D59 (2003) 2125-2132). Alternatively one could try to find the individual sulfurs with:<br> | The anomalous differences are extracted from the native data so only one data file is required. The sites specified by FIND consist of one methionine and 8 super-sulfurs, which are then resolved into disulfides using the DSUL instruction that is passed on to SHELXD (Debreczeni et al., Acta Cryst. D59 (2003) 2125-2132). Alternatively one could try to find the individual sulfurs with:<br> | ||
SHEL 999 2.0 | |||
SHEL 999 2.0 | FIND 17 | ||
FIND 17 | MIND –1.7 | ||
MIND –1.7 | |||
Here the resolution cutoff has been reduced from 2.1 Å (which SHELXC would have suggested) to 2.0 Å to improve the chances of resolving the sulfurs. The SHEL, FIND, MIND and NTRY instructions are transferred to the file thau_fa.ins for the sulfur atom location with SHELXD. Note that the phases can be improved further in this case by using more SHELXE cycles than the usual 20. <br><br> | Here the resolution cutoff has been reduced from 2.1 Å (which SHELXC would have suggested) to 2.0 Å to improve the chances of resolving the sulfurs. The SHEL, FIND, MIND and NTRY instructions are transferred to the file thau_fa.ins for the sulfur atom location with SHELXD. Note that the phases can be improved further in this case by using more SHELXE cycles than the usual 20. <br><br> | ||
Line 60: | Line 58: | ||
This involves the solution of the thaumatin structure using the above 1.55 Å data as native and 2.0 Å CuKalpha data from a quick iodide soak. SIRAS usually gives the best results for iodide soaks, but it is also possible in this case to use SIR (change ‘SIRA’ to ‘SIR’) or iodine SAD (change ‘SIRA’ to ‘SAD’). <br> | This involves the solution of the thaumatin structure using the above 1.55 Å data as native and 2.0 Å CuKalpha data from a quick iodide soak. SIRAS usually gives the best results for iodide soaks, but it is also possible in this case to use SIR (change ‘SIRA’ to ‘SIR’) or iodine SAD (change ‘SIRA’ to ‘SAD’). <br> | ||
shelxc thaui <<EOF | |||
shelxc thaui <<EOF | NAT thau-nat.hkl | ||
NAT thau-nat.hkl | SIRA thau-iod.hkl | ||
SIRA thau-iod.hkl | CELL 58.036 58.036 151.29 90 90 90 | ||
CELL 58.036 58.036 151.29 90 90 90 | SPAG P41212 | ||
SPAG P41212 | FIND 17 | ||
FIND 17 | NTRY 10 | ||
NTRY 10 | MIND –3.5 –0.1 | ||
MIND –3.5 –0.1 | EOF | ||
EOF | shelxd thaui_fa | ||
shelxd thaui_fa | shelxe thaui thaui_fa -s0.5 –m20 | ||
shelxe thaui thaui_fa -s0.5 –m20 | shelxe thaui thaui_fa -s0.5 –m20 -i | ||
shelxe thaui thaui_fa -s0.5 –m20 -i | |||
Line 100: | Line 97: | ||
SHELXE has following modes of action (xx and yy are filename stems):<br> | SHELXE has following modes of action (xx and yy are filename stems):<br> | ||
shelxe xx [reads xx.hkl and xx.ins, phases from atoms] | |||
shelxe xx [reads xx.hkl and xx.ins, phases from atoms] | shelxe xx yy [normal mode: reads xx.hkl, yy.hkl, yy.res] | ||
shelxe xx yy [normal mode: reads xx.hkl, yy.hkl, yy.res] | shelxe xx.phi [reads xx.phi, xx.hkl, xx.ins] | ||
shelxe xx.phi [reads xx.phi, xx.hkl, xx.ins] | shelxe xx.fcf [reads only xx.fcf] | ||
shelxe xx.fcf [reads only xx.fcf] | shelxe xx.phi yy [reads xx.phi, xx.hkl, xx.ins, yy.hkl] | ||
shelxe xx.phi yy [reads xx.phi, xx.hkl, xx.ins, yy.hkl] | shelxe xx.fcf yy [reads xx.fcf, yy.hkl, yy.res] | ||
shelxe xx.fcf yy [reads xx.fcf, yy.hkl, yy.res] | |||
xx.hkl contains native data, yy.hkl contains FA and alpha and should have been created using SHELXC or XPREP. xx.phi has .phs format (h,k,l,F,fom,phi in free format) and can be made by renaming a .phs output file from SHELXE, but only the starting phases are read from it; if a .phi file is read, the cell and symmetry are read from xx.ins and the native F-values are read from xx.hkl. xx.fcf (from a SHELXL structure refinement) provides cell, symmetry and starting phases. The output phases are written to xx.phs, the log file is written to xx.lst and, if -b is set, improved substructure phases are output to xx.pha and revised heavy atoms to xx.hat.<br> | xx.hkl contains native data, yy.hkl contains FA and alpha and should have been created using SHELXC or XPREP. xx.phi has .phs format (h,k,l,F,fom,phi in free format) and can be made by renaming a .phs output file from SHELXE, but only the starting phases are read from it; if a .phi file is read, the cell and symmetry are read from xx.ins and the native F-values are read from xx.hkl. xx.fcf (from a SHELXL structure refinement) provides cell, symmetry and starting phases. The output phases are written to xx.phs, the log file is written to xx.lst and, if -b is set, improved substructure phases are output to xx.pha and revised heavy atoms to xx.hat.<br> | ||
Line 116: | Line 112: | ||
SHELXE normally requires a few command line switches, e.g.<br> | SHELXE normally requires a few command line switches, e.g.<br> | ||
shelxe xx yy -m20 -s0.45 -h8 -b | shelxe xx yy -m20 -s0.45 -h8 -b | ||
would do 20 cycles density modification with a solvent content of 0.45, phasing from the first 8 heavy atoms in the yy.res file from SHELXD assuming that they are also present in the native structure (-h8), and then use the modified density to generate improved heavy atoms (-b). The switch -i may be added to invert the substructure (and if necessary the space group), this writes xx_i.phs instead of xx.phs etc., and so may be run in parallel. <br> | would do 20 cycles density modification with a solvent content of 0.45, phasing from the first 8 heavy atoms in the yy.res file from SHELXD assuming that they are also present in the native structure (-h8), and then use the modified density to generate improved heavy atoms (-b). The switch -i may be added to invert the substructure (and if necessary the space group), this writes xx_i.phs instead of xx.phs etc., and so may be run in parallel. <br> |