Quick model-building of a single model with resolve using build_one_model
Author(s)
- build_one_model: Tom Terwilliger
Purpose
If you have an mtz map coefficients file and a sequence file, you can
use build_one_model to just build a single model with resolve. You can
also extend an existing model.
Usage
How build_one_model works:
build_one_model runs resolve model-building, by default in
superquick_build mode. If you supply a model, then resolve will try to
extend the ends of each chain in your model.
Output files from build_one_model
build_one_model.pdb: A PDB file with the resulting model
Examples
Standard run of build_one_model:
Running build_one_model is easy. From the command-line you can type:
phenix.build_one_model map_coeffs.mtz \
sequence.dat free_in=exptl_fobs_phases_freeR_flags.mtz
where exptl_fobs_phases_freeR_flags.mtz has your free R flags for
refinement. If you want to supply a PDB file to extend instead you can
do that:
phenix.build_one_model map_coeffs.mtz \
sequence.dat free_in=exptl_fobs_phases_freeR_flags.mtz \
model.pdb
Possible Problems
Specific limitations and problems:
Literature
Additional information
List of all available keywords
- build_one_model
- input_files
- seq_file = None File with 1-letter code sequence of molecule.
Chains separated by blank line or greater-than sign
- pdb_in = None Optional starting PDB file (ends will be extended if present)
- map_coeffs = None MTZ file with coefficients for a map
- map_coeffs_labels = None Map coeffs labels (like: FWT,PHFWT). Alternative to labin_map_coeffs
- labin_map_coeffs = None Labin line for MTZ file with map coefficients.
This is optional if build_one_model
can guess the correct coefficients
for FP PHI . Otherwise specify:
LABIN FP=myFP PHIB=myPHI
where myFP is your column label for FP
- ncs_info_file = None ncs_spec file with NCS information
(written by simple_ncs_from_pdb or find_ncs)
- remove_free
- free_in = None MTZ file containing FreeR_flags NOTE free_in only used in build_one_model. Ignored by phase_and_build Used as source of freeR information for real_space refinement. Note other columns of data may be present and can be used in reciprocal-space refinement. A suitable file is exptf_fobs_phases_freeR_flags.mtz from autosol/autobuild or my_model_refine_data.mtz from phenix.refine
- labin_free = None Labin line for MTZ file with FreeR_flags.
This is optional if build_one_model
can guess the correct coefficients
for FreeR_flags.Otherwise specify:
FreeR_flags==myFreeR_flags
- map_coeffs_no_free = None Optional MTZ file with coefficients for a map with freeR set removed. Use instead of free_in. This map will be used for real-space refinement Not needed if use_all_in_refine is True
- map_coeffs_labels_no_free = None Map coeffs labels (like: FWT,PHFWT). Alternative to labin_map_coeffs_no_free
- labin_no_free = None Labin line for MTZ file with map coefficients and
freeR set removed.
This is optional if build_one_model
can guess the correct coefficients
for FP PHI . Otherwise specify:
LABIN FP=myFP PHIB=myPHI
where myFP is your column label for FP
- test_flag_value = None This parameter sets the value of the test set that is to be free. Normally phenix sets up test sets with values of 0 and 1 with 1 as the free set. The CCP4 convention is values of 0 through 19 with 0 as the free set. Either of these is recognized by default in Phenix. If you have any other convention (for example values of 0 to 19 and test set is 1) then you can specify this with test_flag_value.
- output_files
- target_output_format = *None pdb mmcif Desired output format (if possible). Choices are None ( try to use input format), pdb, mmcif. If output model does not fit in pdb format, mmcif will be used. Default is pdb.
- pdb_out = build_one_model.pdb Output PDB file
- log = build_one_model.log Output log file
- params_out = build_one_model_params.eff Parameters file to rerun build_one_model
- model_building_log = model_building.log Log file for model-building
- refinement
- rs_refine = True You can run real-space refinement after model-building NOTE: real_space refinement requires a source of FreeR_flag and standard requires Fobs SigFobs and a source of FreeR_flag For real-space refinement you can supply either an mtz file with a FreeR_flag column or an mtz map file that has all the FreeR reflections removed
- use_all_in_refine = False You can use all your data in refinement (i.e., for cryo-EM data) if you want to. Not suitable for X-ray data.
- macro_cycles = None Macro cycles in real-space refinement (None=default)
- rs_refine_adp = False Refine with ADP in real-space refinement
- real_space_refine_before_merge = False Run real-space refinement on partial models before merging
- macro_cycles_in_refine_before_merge = 3 Macro cycles in real_space_refine_before_merge
- model_building
- quick = False You can run quickly (superquick_build/delta_phi=30.) or more thoroughly (default, thorough_build/delta_phi=20.)
- extend_only = True If input model is supplied, just extend it (do not build a new model from scratch).
- extend = True If input model is supplied, extend it. (Set to False to not extend input model or helices_strands_only model).
- trace_chain_only = None You can find just a trace_chain model.
- target_time_for_trace_chain = 15 Target length of time to work on trace_chain (sec)
- helices_strands_only = None You can find just helices and strands and extend them. Same as insert_helices=True and include_strands=True. This is useful at resolutions of 3. A and worse.
- insert_helices = None You can find helices and use them as a starting point for model-building. This is useful if your resolution is worse than 3 A. Same as helices_strands_only=True include_strands=False.
- include_strands = None You can find strands and use them as a starting point for model-building. This is useful if your resolution is worse than 3 A. Same as helices_strands_only=True insert_helices=False.
- build_rna_helices = None Use build_rna_helices tool to build RNA if chain_type=RNA
- n_build_repeat = 0 Iterations of building
- n_random_frag = None Tries of fragment extension
- delta_phi = None Rotation angle for secondary structure search (set with quick/thorough to 30/20 degrees by default)
- rna_rho_min_main_base = -1. starting minimum density at main-chain atoms for RNA building
- rna_rho_min_main_low = -1. final minimum density at main-chain atoms for RNA building
- rna_rho_min_main_seg = -1. minimum density at main-chain atoms in segment ID in RNA building
- rna_rho_min_side_seg = -1. minimum density at side-chain atoms in segment ID in RNA building
- cc_min = 0.40 Minimum map-model correlation to keep a segment after refinement
- cc_min_rna = 0.25 Minimum map-model correlation (RNA building)
- merge_with_combine_models = True Merge using parallel combine_models method. Can include standard merge. Alternative is merging with resolve.
- merge_by_segment_correlation = True Merge using segment correlation if merge_with_combine_models is selected. Can be used along with merge_remainder=True/False and standard_merge=True/False
- merge_remainder = True Merge remainder in merge_by_segment_correlation
- standard_merge = True If merge_with_combine_models is set, merge by reading all chains and running resolve merging.
- use_cc_in_combine_extend = False You can choose to use the correlation of density rather than density at atomic positions to score models in the merge_second_model or merge_both_models step. This may be useful at lower resolution (> 3 A)
- directories
- temp_dir = "temp_dir" Optional temporary work directory
- output_dir = "" Output directory where files are to be written
- top_output_dir = "" Top output directory for control files
- crystal_info
- resolution = 0. high-resolution limit for map calculation
- solvent_fraction = None You can specify the solvent fraction Normally it is set automatically
- chain_type = *PROTEIN DNA RNA Chain type (for identifying main-chain and side-chain atoms)
- scattering_table = *n_gaussian wk1995 it1992 electron neutron Choice of scattering table for structure factor calculations. Standard for X-ray is n_gaussian, for cryoEM is electron.
- semet = False You can specify that your protein contains selenomethionine
- control
- verbose = False Verbose output
- raise_sorry = False Raise sorry if problems
- debug = False Debugging output
- carry_on = False Try to continue from previous run. For phase_and_build only
- dry_run = False Just read in and check parameter names
- do_only_one_thing = False Just do one thing
- write_run_directory_to_file = None The working directory name is written to this file
- multiprocessing = *multiprocessing sge lsf pbs condor pbspro slurm Choices are multiprocessing (single machine) or queuing systems
- queue_run_command = None run command for queue jobs. For example qsub.
- parallel_step = *final_assembly assembly peak_search all Last step to carry out in parallel in model-building. peak_search means find the secondary structure in parallel. assembly means find ss and extend in parallel. final_assembly means find ss, extend and create final model. all means do entire model-building nproc times.
- nproc = 1 Number of processors to use (None is do not use multiprocessing)
- i_ran_seed = 712341 Random seed for model-building
- remove_pdb_block = None Clear out any .pdb_block files written by previous runs of map_to_model. Normally only set to False when you are running jobs with split_up_job. Default is True unless do_only_one_thing is set. Used in phase_and_build only.
- model_start = None You can use this to specify skipping model-building of model numbers below this (used to start in the middle). Used in phase_and_build only.
- model_end = None You can use this to specify skipping model-building of model numbers above this (used to start in the middle). Used in phase_and_build only.
- resolve_size = 12 Size for resolve
- resolve_command_list = None You can supply any resolve command here NOTE: for command-line usage you need to enclose the whole set of commands in double quotes (") and each individual command in single quotes (') like this: resolve_command_list="'no_build' 'b_overall 23' "