<div dir="ltr">Hi Zhu,<div><br></div><div>I would suggest starting by stepping back and looking for problems all along the way.</div><div><br></div><div>I notice your space group is shown as C222.  This is an uncommon space group, occurring only 0.25% of the time in the PDB.  A much more common but related space group is C222(1) which occurs 20 times more frequently (5%).  Have a very close look at your systematic absences and rerun pointless and xtriage to check and see if you have the right space group.  Depending on your cell dimensions there are other possibilities for your space group as well (listed for you by these programs as well).</div><div><br></div><div>Look over your data processing output and your xtriage output carefully.  Is there translational non-crystallographic symmetry? Twinning? Are the intensity statistics as expected? If there are any unusual situations or characteristics, follow them up.</div><div><br></div><div>When you run autosol it will test the space group you specify and its enantiomer (if any).  If there are other possibilities you would need to run them separately.</div><div><br></div><div>For autobuilding you have two choices.  You can run autobuild, and as Randy pointed out, autobuild will build only one type of chain at a time (see below).</div><div><br></div><div>Your other option is running map_to_model. This tool is intended for cryo-EM but you can (sometimes) use it with crystallographic data. You can just give it a try. It can build multiple chain types, uses multiprocessing, and can take a long time.</div><div><br></div><div>With autobuild when you have DNA and protein the suggested approach is (see <a href="https://www.phenix-online.org/documentation/reference/autobuild.html">https://www.phenix-online.org/documentation/reference/autobuild.html</a>): </div><div><br></div><div><ul class="gmail-simple" style="margin-bottom:1em;color:rgb(0,0,0);font-family:Verdana,Helvetica,Arial,sans-serif;font-size:14.4px"><li>The AutoBuild model-building can only build one type of chain at a time (default chain_type=&#39;PROTEIN&#39;; other choices are RNA and DNA). If you supply a PDB file containing more than one type of chain for rebuilding, then all the residues that are not that type of chain are treated as ligands and are (by default, keep_input_ligands=True) included in refinement but not in rebuilding. Any input solvent molecules are (by default, keep_input_waters=False) ignored.</li></ul><p style="color:rgb(0,0,0);font-family:Verdana,Helvetica,Arial,sans-serif;font-size:14.4px">You can include more than one type of chain in rebuilding by supplying one type of chains as ligands with input_lig_file_list and rebuilding another type:</p><pre class="gmail-literal-block" style="margin-left:2em;margin-right:2em;color:rgb(0,0,0)">chain_type=PROTEIN  # build only protein

input_lig_file_list=MyDNA.pdb  # just read in DNA coordinates and include in refinement

</pre><p style="color:rgb(0,0,0);font-family:Verdana,Helvetica,Arial,sans-serif;font-size:14.4px">In this case only protein chains will be built, but the DNA coordinates in MyDNA.pdb will be included in all refinements and will be written out to the final coordinate file. You may wish to add the keyword:</p><pre class="gmail-literal-block" style="margin-left:2em;margin-right:2em;color:rgb(0,0,0)">keep_pdb_atoms=False  #keep the ligand atoms if model (pdb) and ligand overlap

</pre><p style="color:rgb(0,0,0);font-family:Verdana,Helvetica,Arial,sans-serif;font-size:14.4px">which will tell AutoBuild that the ligand (DNA) atoms are to be kept if the model that is being built (protein) overlaps with it. (The default is to keep the model that is being built and to discard any ligand atoms that overlap).</p><p style="color:rgb(0,0,0);font-family:Verdana,Helvetica,Arial,sans-serif;font-size:14.4px">This whole process is likely to require substantial editing of the PDB files by hand because when you build DNA, a lot of chains are going to be built into the protein region, and when you build protein, it is going to be accidentally built into the DNA.</p><p style="color:rgb(0,0,0);font-family:Verdana,Helvetica,Arial,sans-serif;font-size:14.4px"><br></p><p style="color:rgb(0,0,0);font-family:Verdana,Helvetica,Arial,sans-serif;font-size:14.4px">All the best,</p><p style="color:rgb(0,0,0);font-family:Verdana,Helvetica,Arial,sans-serif;font-size:14.4px">Tom T</p></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Mar 24, 2020 at 2:55 AM Randy Read &lt;<a href="mailto:rjr27@cam.ac.uk" target="_blank">rjr27@cam.ac.uk</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Dear Zhu,<br>

<br>

Questions specific to Phenix should really go to the Phenix-BB, so I am cross-posting my reply there.  Here I’ll focus more on generic issues.  There are also CCP4 tools that you could consider and presumably other people will offer advice on those.<br>

<br>

One point you raise comes up so much that we have an entry in our FAQ (<a href="https://www.phaser.cimr.cam.ac.uk/index.php/FAQ" rel="noreferrer" target="_blank">https://www.phaser.cimr.cam.ac.uk/index.php/FAQ</a>) about it: “Can I use Phaser to check the correctness of a model I have already built and refined?”  The answer is no, because once you’ve refined the model it becomes better than random at predicting the data and therefore achieves a high LLG score, regardless of whether or not it is correct.<br>

<br>

In this case, you might be able to use Phaser to help complete the model if one of the two copies of the complex is more completely modelled than the other.  If, say, you had a model for one copy you could fix that and search for a second copy: this should work because the refinement didn’t know anything about the second copy.<br>

<br>

Phenix.autosol attempts to determine the NCS operators, so you need to check whether that has succeeded.  If not, you might need to try some more manual approaches to defining the NCS, which would help a great deal in map improvement.  Tom Terwilliger might answer this in more detail (perhaps on the Phenix-BB), but I don’t think you can build protein and nucleic acid in the same job, so you should look at the documentation to see how to do that.<br>

<br>

Good luck!<br>

<br>

Randy Read<br>

-----<br>

Randy J. Read<br>

Department of Haematology, University of Cambridge<br>

Cambridge Institute for Medical Research     Tel: +44 1223 336500<br>

The Keith Peters Building                               Fax: +44 1223 336827<br>

Hills Road                                                       E-mail: <a href="mailto:rjr27@cam.ac.uk" target="_blank">rjr27@cam.ac.uk</a><br>

Cambridge CB2 0XY, U.K.                              <a href="http://www-structmed.cimr.cam.ac.uk" rel="noreferrer" target="_blank">www-structmed.cimr.cam.ac.uk</a><br>

<br>

&gt; On 24 Mar 2020, at 04:31, Zhu Qiao &lt;<a href="mailto:jasonqiao03@GMAIL.COM" target="_blank">jasonqiao03@GMAIL.COM</a>&gt; wrote:<br>

&gt; <br>

&gt; Dear All<br>

&gt; <br>

&gt; I am sorry for the long context. <br>

&gt; <br>

&gt; I have one protein (252 AAs, 2 Met) bound to double-stranded DNA (24 bp) crystalized.  I collected the Se-Met data of the crystal in C222 up to 2.8 angstrom. the space group is confirmed by running the pointless. <br>

&gt; <br>

&gt; I used the Phenix.Autosol to find the heavy atoms and get a quite nice map after the density modification.  It seems there are two proteins and two DNA duplex are in one ASU. Phenix.Autobuild can only build less than half of the protein sequence into the map and fill in the potential DNA map with amino acids. The Rwork/Rfree is 0.40 and 0.46, with the map CC=0.60. If I do the MR with the initial model built by Autobuild, the result TFZ=40, LLG=200+, which suggests the partial correction of the initial model. <br>

&gt; <br>

&gt; Here is the problem. From the map, I can see one of my protein domain and a clear feature of DNA double helix. But whatever I go further for manual build using coot, like building the DNA double-strand into the map and building the resolved domain, the refinement statistics go bad with R free ~0.50. <br>

&gt; <br>

&gt; I am wondering what&#39;s going wrong and how come the refinement can&#39;t improve the R factor. <br>

&gt; <br>

&gt; I have attached the relevant photos. <br>

&gt; <a href="https://drive.google.com/drive/folders/1dJ4kn7CEHkCL3sMcCJtBqa5OsVFQngBx?usp=sharing" rel="noreferrer" target="_blank">https://drive.google.com/drive/folders/1dJ4kn7CEHkCL3sMcCJtBqa5OsVFQngBx?usp=sharing</a><br>

&gt; <br>

&gt; <br>

&gt; Sincerely<br>

&gt; Zhu <br>

&gt; <br>

&gt; ########################################################################<br>

&gt; <br>

&gt; To unsubscribe from the CCP4BB list, click the following link:<br>

&gt; <a href="https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=CCP4BB&amp;A=1" rel="noreferrer" target="_blank">https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=CCP4BB&amp;A=1</a><br>

<br>

<br>

_______________________________________________<br>

phenixbb mailing list<br>

<a href="mailto:phenixbb@phenix-online.org" target="_blank">phenixbb@phenix-online.org</a><br>

<a href="http://phenix-online.org/mailman/listinfo/phenixbb" rel="noreferrer" target="_blank">http://phenix-online.org/mailman/listinfo/phenixbb</a><br>

Unsubscribe: <a href="mailto:phenixbb-leave@phenix-online.org" target="_blank">phenixbb-leave@phenix-online.org</a></blockquote></div><br clear="all"><div><br></div>-- <br><div dir="ltr"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div dir="ltr">Thomas C Terwilliger<div>Laboratory Fellow, Los Alamos National Laboratory</div><div>Senior Scientist, New Mexico Consortium</div><div>100 Entrada Dr, Los Alamos, NM 87544</div><div>Email: <a href="mailto:tterwilliger@newmexicoconsortium.org" target="_blank">tterwilliger@newmexicoconsortium.org</a></div><div>Tel: 505-431-0010</div><div><br></div></div></div></div></div></div></div></div></div></div>