Downloading data from PDB and setting up for SAD structure determination with phenix.sad_data_from_pdb

Author(s)

phenix.sad_data_from_pdb: Tom Terwilliger

Purpose

phenix.sad_data_from_pdb is a command line tool for downloading coordinates and anomalous structure factor data for a PDB deposition and setting up for running one or more wavelengths of data as SAD datasets.

Usage

How phenix.sad_data_from_pdb works:

phenix.sad_data_from_pdb first uses phenix.fetch_pdb to download the data from the PDB.

Then it converts the structure factor data to Scalepack unmerged original index format, calculates model map coefficients, and identifies the anomalously-scattering atoms in the PDB file, the wavelengths of data collection, and the resolution of the data. If these are not present in the PDB deposition then the procedure will not work.

Examples

Standard run of phenix.sad_data_from_pdb:

Running phenix.sad_data_from_pdb is easy. You can just type:

phenix.sad_data_from_pdb pdb_ids="2etd"

and phenix.sad_data_from_pdb will download the data for 2etd and set up a directory with this data and with subdirectories for each wavelength of data present (w1, w2, and w3 in this case). In each subdirectory a file run_autosol_std.csh will be created that is suitable for running phenix.autosol using that anomalous data.

Possible Problems

Specific limitations and problems:

Literature

Additional information

List of all available keywords

base_directory = None Directory where files are to be located
pdb_ids = None PDB ID to fetch and use to create SAD datasets
wavelength_id = None Select only the wavelength specified by wavelength_id
ha_list = None List of anomalously-scattering atoms (used instead of list extracted from PDB file )
steps = *fetch *cif_to_sca *fmodel *ha *autosol Steps to carry out: fetch downloads files from the PDB, cif_to_sca converts PDB CIF files to Scalepack unmerged original index, fmodel calculates model structure factors, ha identifies anomalously-scattering atoms in the PDB file, autosol sets up for a run of autosol. To select several steps put them in quotes with the stars: steps='*ha *autosol'
initialize = None If set, remove any existing data in the PDB directories
best_only = True Keep only atom with strongest anomalous scattering and ignore the rest
merge = None If set, merge multiple observations
replace_full_directory = None You can change the full directory name for a single dataset