<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body bgcolor="#ffffff" text="#000000">

    I agree with Phil about UNK - it seems to be good indeed to call

    unknown (undefined) residue as it appears on the map rather than

    call ALA something that in fact is TYR, and then later on getting

    confused about the mismatch between actual sequence and the one

    derived from PDB file. This is actually what I get confused all the

    time looking at results of model building programs, because the

    first thing I always do is I compare the real actual sequence with

    the one derived from PDB file - just to validate the result of model

    building.<br>

    <br>

    However, I agree with Tom too about loosing identity in cases where

    we really do know what to expect: polypeptide or rna/dna.<br>

    <br>

    Hm... interesting situation -:)<br>

    <br>

    I guess UNK is may be still better, ONLY IF you go one level deeper

    and look at atom names (or make sure you do that consistently). Say

    you name a "residue" as UNK and name corresponding atoms within this

    residue as CA, N, C, O (kind of peptide pattern) - then you have a

    chance to guess what it is. Of course how you then know where you

    place those CA,N,C and O...<br>

    <br>

    Pavel.<br>

    <br>

    <br>

    <br>

    On 7/28/10 3:19 PM, Tom Terwilliger wrote:

    <blockquote cite="mid:051EFF69-34F6-4F8D-B9B5-C47DB38DF393@lanl.gov"

      type="cite">One disadvantage of using UNK is that it is often a

      loss of information. For example in the case Phil mentions...we do

      think that we have a polypeptide. &nbsp;By labelling protein residues

      UNK we no longer distinguish them from DNA, or depending on HETATM

      vs ATOM identification, from ligands.

      <div>-Tom T</div>

      <div><br>

        <div>

          <div>On Jul 28, 2010, at 4:01 PM, Phil Evans wrote:</div>

          <br class="Apple-interchange-newline">

          <blockquote type="cite">

            <div>UNK residues have another valid use where you can see

              peptide but not assign a sequence register. A poly-Ala

              model in that case is better labelled UNK than ALA, since

              it isn't ALA<br>

              <br>

              Phil <br>

              <br>

              <br>

              On 28 Jul 2010, at 19:12, Pavel Afonine wrote:<br>

              <br>

              <blockquote type="cite">Dear Ed,<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">I think it is very important to

                  be able to include unknown atoms<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">in a deposited pdb file (with

                  echoing the caveat about flooding<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">the structure with UNK's to

                  lower the R-factor).<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">yes, as I wrote in original reply,

                including these atoms may improve the map and in turn

                may reveal or improve some its other important

                (biologically) places. The only point is: please define

                these dummy atoms properly, providing all the

                information, such as scattering element type that you or

                your program used for such an approximation.<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">For one thing, these structures

                  are produced not just for structure-factor<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">calculation and validation. Many

                  of the end users will never even<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">bother to do a structure factor

                  calculation.<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">The ability to reproduce the

                R-factor is not only for someones pleasure but for the

                validation purposes at least. If I've got a PDB file for

                which I can't compute the R-factors (and, by the way,

                even the map too), then I don't need the deposited Fobs

                too, unless I'm going to re-determine the structure from

                scratch.<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">It important for the<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">depositor to be able to refer to

                  an unknown but likely significant<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">ligand and for the reader to be

                  able to go and look at that position<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">(ideally surrounded by electron

                  density).<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">Sure, it is important.<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">For another thing, the structure

                  factor calculation will give exactly<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">the same result whether the

                  dummy atoms are omitted or are flagged<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite">

                <blockquote type="cite">with zero occupancy or atom-type

                  X to be ignored in sf calculation.<br>

                </blockquote>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">If you look in PDB you will find

                that very often the occupancies are not set up to 1.

                Plus, as I mentioned, often the B-factors for these

                atoms are set to some funny numbers (looks like they

                were refined).<br>

              </blockquote>

              <blockquote type="cite">Are we sure that these programs

                were ignoring these dummies in Fcalc calculations? If so

                how the B-factor were refined, or they were made up?<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">Again, if it is defined properly,

                for example, like this:<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">ATOM &nbsp;&nbsp;1959 &nbsp;O &nbsp;&nbsp;DUM A &nbsp;&nbsp;1

                &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;-8.762 &nbsp;&nbsp;8.060 &nbsp;25.324 &nbsp;1.00 31.23 &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;O<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">or<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">ATOM &nbsp;&nbsp;1959 &nbsp;O &nbsp;&nbsp;UNK A &nbsp;&nbsp;1

                &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;-8.762 &nbsp;&nbsp;8.060 &nbsp;25.324 &nbsp;1.00 31.23 &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;O<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">then it is absolutely OK to have

                such entries, because it is completely defined and can

                be used in any calculations without any unnecessary

                guesswork. But if you start masking things with X or

                blanks then I (and the software I write) will start

                asking all these nasty questions...<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">All the best!<br>

              </blockquote>

              <blockquote type="cite">Pavel.<br>

              </blockquote>

              <blockquote type="cite"><br>

              </blockquote>

              <blockquote type="cite">_______________________________________________<br>

              </blockquote>

              <blockquote type="cite">phenixbb mailing list<br>

              </blockquote>

              <blockquote type="cite"><a moz-do-not-send="true"

                  href="mailto:phenixbb@phenix-online.org">phenixbb@phenix-online.org</a><br>

              </blockquote>

              <blockquote type="cite"><a moz-do-not-send="true"

                  href="http://phenix-online.org/mailman/listinfo/phenixbb">http://phenix-online.org/mailman/listinfo/phenixbb</a><br>

              </blockquote>

              <br>

              _______________________________________________<br>

              phenixbb mailing list<br>

              <a moz-do-not-send="true"

                href="mailto:phenixbb@phenix-online.org">phenixbb@phenix-online.org</a><br>

              <a class="moz-txt-link-freetext" href="http://phenix-online.org/mailman/listinfo/phenixbb">http://phenix-online.org/mailman/listinfo/phenixbb</a><br>

            </div>

          </blockquote>

        </div>

        <br>

        <div apple-content-edited="true"> <span

            class="Apple-style-span" style="border-collapse: separate;

            color: rgb(0, 0, 0); font-family: Helvetica; font-size:

            12px; font-style: normal; font-variant: normal; font-weight:

            normal; letter-spacing: normal; line-height: normal;

            orphans: 2; text-indent: 0px; text-transform: none;

            white-space: normal; widows: 2; word-spacing: 0px;">

            <div style="word-wrap: break-word;">

              <div>

                <div>

                  <div>

                    <div>

                      <div><br>

                        Thomas C. Terwilliger<br>

                        Mail Stop M888<br>

                        Los Alamos National Laboratory<br>

                        Los Alamos, NM 87545<br>

                        <br>

                        Tel:&nbsp; 505-667-0072&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; email: <a

                          moz-do-not-send="true"

                          href="mailto:terwilliger@LANL.gov">terwilliger@LANL.gov</a><br>

                        Fax: 505-665-3024&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; SOLVE web

                        site:&nbsp;<a moz-do-not-send="true"

                          href="http://solve.lanl.gov">http://solve.lanl.gov</a><br>

                        PHENIX web site: http:<a moz-do-not-send="true"

                          href="http://www.phenix-online.org">www.phenix-online.org</a><br>

                        ISFI Integrated Center for Structure and

                        Function Innovation web&nbsp;site:&nbsp;<a

                          moz-do-not-send="true"

                          href="http://techcenter.mbi.ucla.edu">http://techcenter.mbi.ucla.edu</a><br>

                        TB Structural Genomics Consortium web site:&nbsp;<a

                          moz-do-not-send="true"

                          href="http://www.doe-mbi.ucla.edu/TB">http://www.doe-mbi.ucla.edu/TB</a></div>

                      <div>CBSS Center for Bio-Security Science web

                        site: <a moz-do-not-send="true"

                          href="http://www.lanl.gov/cbss">http://www.lanl.gov/cbss</a><br>

                        <br>

                      </div>

                    </div>

                  </div>

                </div>

                <br>

              </div>

            </div>

          </span><br class="Apple-interchange-newline">

        </div>

        <br>

      </div>

      <pre wrap="">

<fieldset class="mimeAttachmentHeader"></fieldset>

_______________________________________________

phenixbb mailing list

<a class="moz-txt-link-abbreviated" href="mailto:phenixbb@phenix-online.org">phenixbb@phenix-online.org</a>

<a class="moz-txt-link-freetext" href="http://phenix-online.org/mailman/listinfo/phenixbb">http://phenix-online.org/mailman/listinfo/phenixbb</a>

</pre>

    </blockquote>

    <br>

  </body>

</html>