regarding number of unique reflections
Dear all, First of all sorry for the novice question. I *processed* one of my datasets using xds. Since I was not expecting an anomalous signal, I kept Friedel's Law = True. The number of unique reflections I got: *17,000* I used xds_ASCII.HKL after correct.LP and converted it to mtz using Phenix with 10% reflections for Rfree calculations. Now, when I am trying to* refine *the structure in Phenix, the structure is being refined against *32,300* reflections with *3230* reflections for Rfree calculation. Looks like refinement is treating Friedel's pair as two different reflections. When I am trying to upload PDB on rcsb, on the refinement tab, the number of reflections used for refinement and Rfree calculations are shown as 32,300 and 3220 respectively, while in the validation report, no. of reflections used for Rfree calculation are shown as *1700*. I would really appreciate if someone can guide me to resolve this discrepancy. Thanks in advance. Viney
This is a common problem. I use ccp4i CAD to remove all the columns from the .mtz file except for F, SIGF, and FreeR_flag, and then use this new .mtz file in refinement before depositing. For the next structure, remember to do this in the early stages.
--
John J. Tanner
Professor of Biochemistry and Chemistry
Associate Chair of Biochemistry
Department of Biochemistry
University of Missouri
117 Schweitzer Hall
503 S College Avenue
Columbia, MO 65211
Phone: 573-884-1280
Email: [email protected]mailto:[email protected]
https://cafnrfaculty.missouri.edu/tannerlab/
Lab: Schlundt Annex rooms 3,6,9, 203B, 203C
Office: Schlundt Annex 203A
From: [email protected]
Thanks a lot, Dr .Tanner. I will try the suggestion. Currently, I tried
running xds_ASCII.HKL on pointless followed by truncate. Seems like mtz is
now all good. But I am wondering if this is the right way to go about it?
On Wed, Jan 12, 2022 at 8:33 PM Tanner, John J.
This is a common problem. I use ccp4i CAD to remove all the columns from the .mtz file except for F, SIGF, and FreeR_flag, and then use this new .mtz file in refinement before depositing. For the next structure, remember to do this in the early stages.
--
John J. Tanner
Professor of Biochemistry and Chemistry
Associate Chair of Biochemistry
Department of Biochemistry
University of Missouri 117 Schweitzer Hall
503 S College Avenue Columbia, MO 65211 Phone: 573-884-1280
Email: [email protected]
https://cafnrfaculty.missouri.edu/tannerlab/ Lab: Schlundt Annex rooms 3,6,9, 203B, 203C
Office: Schlundt Annex 203A
*From: *[email protected] < [email protected]> on behalf of Viney Singh < [email protected]> *Date: *Wednesday, January 12, 2022 at 6:52 AM *To: *[email protected]
*Subject: *[phenixbb] regarding number of unique reflections *WARNING:* This message has originated from an External Source. This may be a phishing expedition that can result in unauthorized access to our IT System. Please use proper judgment and caution when opening attachments, clicking links, or responding to this email.
Dear all,
First of all sorry for the novice question.
I *processed* one of my datasets using xds. Since I was not expecting an anomalous signal, I kept Friedel's Law = True.
The number of unique reflections I got: *17,000*
I used xds_ASCII.HKL after correct.LP and converted it to mtz using Phenix with 10% reflections for Rfree calculations.
Now, when I am trying to* refine *the structure in Phenix, the structure is being refined against *32,300* reflections with *3230* reflections for Rfree calculation. Looks like refinement is treating Friedel's pair as two different reflections.
When I am trying to upload PDB on rcsb, on the refinement tab, the number of reflections used for refinement and Rfree calculations are shown as 32,300 and 3220 respectively, while in the validation report, no. of reflections used for Rfree calculation are shown as *1700*.
I would really appreciate if someone can guide me to resolve this discrepancy.
Thanks in advance.
Viney
No, I disagree strongly with that suggestion about pruning the data! The conversion of intensities to amplitudes loses information that cannot be recovered, so the intensities should *always* be included in the deposition. Depositing unmerged intensities (now possible, as Gerard says) is even better and, if you do that, I won’t complain about only depositing amplitudes in the merged data. Removing Bijvoet pairs when you’re sure there is no anomalous signal is fine (but if anyone disagrees and you’ve deposited unmerged intensities, they can get them back). Best wishes, Randy Read
On 12 Jan 2022, at 15:03, Tanner, John J.
wrote: This is a common problem. I use ccp4i CAD to remove all the columns from the .mtz file except for F, SIGF, and FreeR_flag, and then use this new .mtz file in refinement before depositing. For the next structure, remember to do this in the early stages.
--
John J. Tanner
Professor of Biochemistry and Chemistry
Associate Chair of Biochemistry
Department of Biochemistry
University of Missouri 117 Schweitzer Hall
503 S College Avenue Columbia, MO 65211 Phone: 573-884-1280
Email: [email protected] https://cafnrfaculty.missouri.edu/tannerlab/
Lab: Schlundt Annex rooms 3,6,9, 203B, 203C
Office: Schlundt Annex 203A
From: [email protected]
on behalf of Viney Singh Date: Wednesday, January 12, 2022 at 6:52 AM To: [email protected] Subject: [phenixbb] regarding number of unique reflections WARNING: This message has originated from an External Source. This may be a phishing expedition that can result in unauthorized access to our IT System. Please use proper judgment and caution when opening attachments, clicking links, or responding to this email.
Dear all,
First of all sorry for the novice question.
I processed one of my datasets using xds. Since I was not expecting an anomalous signal, I kept Friedel's Law = True.
The number of unique reflections I got: 17,000
I used xds_ASCII.HKL after correct.LP and converted it to mtz using Phenix with 10% reflections for Rfree calculations.
Now, when I am trying to refine the structure in Phenix, the structure is being refined against 32,300 reflections with 3230 reflections for Rfree calculation. Looks like refinement is treating Friedel's pair as two different reflections.
When I am trying to upload PDB on rcsb, on the refinement tab, the number of reflections used for refinement and Rfree calculations are shown as 32,300 and 3220 respectively, while in the validation report, no. of reflections used for Rfree calculation are shown as 1700.
I would really appreciate if someone can guide me to resolve this discrepancy.
Thanks in advance.
Viney
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb Unsubscribe: [email protected]
----- Randy J. Read Department of Haematology, University of Cambridge Cambridge Institute for Medical Research Tel: +44 1223 336500 The Keith Peters Building Fax: +44 1223 336827 Hills Road E-mail: [email protected] Cambridge CB2 0XY, U.K. www-structmed.cimr.cam.ac.uk
Dear Randy, I absolutely and very strongly agree with you on this point. There needs to be a return to considering where the genuinely experimental data are. Short of archiving the raw diffraction images with all the instrumental metadata that would enable their reprocessing with improved or more expertly used software, the closest approximation to what the experiment delivered lies in the unmerged intensities. This is why it seemed so important to do the extra work on the mmCIF dictionary (and, on the purely wwPDB side, on the deposition software) enabling their deposition into the PDB, along with the refined model itself. From this perspective, the stripping operation recommended by John is extremely counter-productive, even if expedient, and I join you in strongly advising not to use it. John, Viney: perhaps not all processing programs are yet producing the appropriate mmCIF files for the orderly deposition of unmerged data , but autoPROC does - see: https://www.globalphasing.com/autoproc/ With best wishes, Gerard. -- On Thu, Jan 13, 2022 at 10:21:23AM +0000, Randy John Read wrote:
No, I disagree strongly with that suggestion about pruning the data! The conversion of intensities to amplitudes loses information that cannot be recovered, so the intensities should *always* be included in the deposition. Depositing unmerged intensities (now possible, as Gerard says) is even better and, if you do that, I won’t complain about only depositing amplitudes in the merged data. Removing Bijvoet pairs when you’re sure there is no anomalous signal is fine (but if anyone disagrees and you’ve deposited unmerged intensities, they can get them back).
Best wishes,
Randy Read
On 12 Jan 2022, at 15:03, Tanner, John J.
wrote: This is a common problem. I use ccp4i CAD to remove all the columns from the .mtz file except for F, SIGF, and FreeR_flag, and then use this new .mtz file in refinement before depositing. For the next structure, remember to do this in the early stages.
--
John J. Tanner
Professor of Biochemistry and Chemistry
Associate Chair of Biochemistry
Department of Biochemistry
University of Missouri 117 Schweitzer Hall
503 S College Avenue Columbia, MO 65211 Phone: 573-884-1280
Email: [email protected] https://cafnrfaculty.missouri.edu/tannerlab/
Lab: Schlundt Annex rooms 3,6,9, 203B, 203C
Office: Schlundt Annex 203A
From: [email protected]
on behalf of Viney Singh Date: Wednesday, January 12, 2022 at 6:52 AM To: [email protected] Subject: [phenixbb] regarding number of unique reflections WARNING: This message has originated from an External Source. This may be a phishing expedition that can result in unauthorized access to our IT System. Please use proper judgment and caution when opening attachments, clicking links, or responding to this email.
Dear all,
First of all sorry for the novice question.
I processed one of my datasets using xds. Since I was not expecting an anomalous signal, I kept Friedel's Law = True.
The number of unique reflections I got: 17,000
I used xds_ASCII.HKL after correct.LP and converted it to mtz using Phenix with 10% reflections for Rfree calculations.
Now, when I am trying to refine the structure in Phenix, the structure is being refined against 32,300 reflections with 3230 reflections for Rfree calculation. Looks like refinement is treating Friedel's pair as two different reflections.
When I am trying to upload PDB on rcsb, on the refinement tab, the number of reflections used for refinement and Rfree calculations are shown as 32,300 and 3220 respectively, while in the validation report, no. of reflections used for Rfree calculation are shown as 1700.
I would really appreciate if someone can guide me to resolve this discrepancy.
Thanks in advance.
Viney
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb Unsubscribe: [email protected]
----- Randy J. Read Department of Haematology, University of Cambridge Cambridge Institute for Medical Research Tel: +44 1223 336500 The Keith Peters Building Fax: +44 1223 336827 Hills Road E-mail: [email protected] Cambridge CB2 0XY, U.K. www-structmed.cimr.cam.ac.uk
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb Unsubscribe: [email protected]
On 1/13/22 5:21 AM, Randy John Read wrote:
No, I disagree strongly with that suggestion about pruning the data!
The Original Poster's question arises from a manifestation of phenix.refine usurping user intent over which data to use during refinement. Pruning the data - which I do routinely - is a reaction to phenix.refine's very ill-advised behavior to pull intensities rather than structure factors from a MTZ file that contains both. It seems to then convert the intensities to F's and refine against those F's. phenix.refine also has an unhealthy tendency of grabbing the anomalous data and using that in refinement, even when there's little present. This ignores user intent as it is a conscious additional step to convert I to F after data processing. As long as phenix.refine maintains that default behavior, I will remove intensities from the reflection file prior to refinement. Often (as, I imagine, today) this will result in me not depositing intensities since I invariably deposit the files used in refinement. I can specify the reflection data used, but my command line for that program is already long, and the default behavior should not be to ignore user intent. I would expect that this is a significant source of discrepancy upon PDB deposition, and frequently at odds with what the user had intended to do. I don't think BUSTER behaves like that. I doubt that's true for REFMAC but I don't often run it from the command line. Fix the program to avoid second-guessing the experimenter, and you'll probably get more intensities deposited in PDB. Phil Jeffrey Princeton
Hi Viney, if your data are anomalous then you expect to have Fobs+, Fobs- and singletons. If your data are not anomalous, you should not have +/- mates in your data set. So then the real question is: since you do not expect any anomalous signal and thus assume your data set isn't anomalous (as you said), then how come you ended up with a data set that contains Fobs+, Fobs- and singletons? Perhaps you should go back to data processing step and make sure you don't get anomalous data set in the first place. Now as to counting... Fobs+ and Fobs- are the two entries in the array of float values, so they are counted as two, not one (in Phenix). Pavel On 1/12/22 04:41, Viney Singh wrote:
Dear all,
First of all sorry for the novice question.
I _processed_ one of my datasets using xds. Since I was not expecting an anomalous signal, I kept Friedel's Law = True. The number of unique reflections I got: *17,000* I used xds_ASCII.HKL after correct.LP and converted it to mtz using Phenix with 10% reflections for Rfree calculations.
Now, when I am trying to_refine _the structure in Phenix, the structure is being refined against *32,300* reflections with *3230* reflections for Rfree calculation. Looks like refinement is treating Friedel's pair as two different reflections.
When I am trying to upload PDB on rcsb, on the refinement tab, the number of reflections used for refinement and Rfree calculations are shown as 32,300 and 3220 respectively, while in the validation report, no. of reflections used for Rfree calculation are shown as *1700*.
I would really appreciate if someone can guide me to resolve this discrepancy.
Thanks in advance.
Viney
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb Unsubscribe:[email protected]
Hi Pavel, hi Viney,
you should NOT go back to data processing. Data processing should
result in unmerged data, including I+ and I-. The refinement program
(possibly under guidance of the user) should decide whether and how
data should be merged for refinement purposes. This decision should
also be reflected during data deposition when counting reflections,
however, it is best practice to deposit unmerged data. Whether or not
the PDB support deposition of unmerged data may be a different question.
Cheers,
Tim
On Wed, 12 Jan 2022 09:19:35 -0800 Pavel Afonine
Hi Viney,
if your data are anomalous then you expect to have Fobs+, Fobs- and singletons. If your data are not anomalous, you should not have +/- mates in your data set. So then the real question is: since you do not expect any anomalous signal and thus assume your data set isn't anomalous (as you said), then how come you ended up with a data set that contains Fobs+, Fobs- and singletons? Perhaps you should go back to data processing step and make sure you don't get anomalous data set in the first place.
Now as to counting... Fobs+ and Fobs- are the two entries in the array of float values, so they are counted as two, not one (in Phenix).
Pavel
On 1/12/22 04:41, Viney Singh wrote:
Dear all,
First of all sorry for the novice question.
I _processed_ one of my datasets using xds. Since I was not expecting an anomalous signal, I kept Friedel's Law = True. The number of unique reflections I got: *17,000* I used xds_ASCII.HKL after correct.LP and converted it to mtz using Phenix with 10% reflections for Rfree calculations.
Now, when I am trying to_refine _the structure in Phenix, the structure is being refined against *32,300* reflections with *3230* reflections for Rfree calculation. Looks like refinement is treating Friedel's pair as two different reflections.
When I am trying to upload PDB on rcsb, on the refinement tab, the number of reflections used for refinement and Rfree calculations are shown as 32,300 and 3220 respectively, while in the validation report, no. of reflections used for Rfree calculation are shown as *1700*.
I would really appreciate if someone can guide me to resolve this discrepancy.
Thanks in advance.
Viney
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb Unsubscribe:[email protected]
-- -- Tim Gruene Head of the Centre for X-ray Structure Analysis Faculty of Chemistry University of Vienna Phone: +43-1-4277-70202 GPG Key ID = A46BEE1A
Dear Tim, Your final question suggests that you might have missed the answer contained in the following announcement https://www.wwpdb.org/news/news?year=2021#60638da1931d5660393084c3 that clearly states that the deposition of unmerged data is now supported. These extensions of the mmCIF dictionary included the tidying up of the definitions related to the deposition of anomalous data. Your active participation in the on-going efforts to promote such depositions into the PDB would be of great benefit to the project. More information available on request :-)) . With best wishes, Gerard. -- On Wed, Jan 12, 2022 at 07:54:57PM +0100, Tim Gruene wrote:
Hi Pavel, hi Viney,
you should NOT go back to data processing. Data processing should result in unmerged data, including I+ and I-. The refinement program (possibly under guidance of the user) should decide whether and how data should be merged for refinement purposes. This decision should also be reflected during data deposition when counting reflections, however, it is best practice to deposit unmerged data. Whether or not the PDB support deposition of unmerged data may be a different question.
Cheers, Tim
On Wed, 12 Jan 2022 09:19:35 -0800 Pavel Afonine
wrote: Hi Viney,
if your data are anomalous then you expect to have Fobs+, Fobs- and singletons. If your data are not anomalous, you should not have +/- mates in your data set. So then the real question is: since you do not expect any anomalous signal and thus assume your data set isn't anomalous (as you said), then how come you ended up with a data set that contains Fobs+, Fobs- and singletons? Perhaps you should go back to data processing step and make sure you don't get anomalous data set in the first place.
Now as to counting... Fobs+ and Fobs- are the two entries in the array of float values, so they are counted as two, not one (in Phenix).
Pavel
On 1/12/22 04:41, Viney Singh wrote:
Dear all,
First of all sorry for the novice question.
I _processed_ one of my datasets using xds. Since I was not expecting an anomalous signal, I kept Friedel's Law = True. The number of unique reflections I got: *17,000* I used xds_ASCII.HKL after correct.LP and converted it to mtz using Phenix with 10% reflections for Rfree calculations.
Now, when I am trying to_refine _the structure in Phenix, the structure is being refined against *32,300* reflections with *3230* reflections for Rfree calculation. Looks like refinement is treating Friedel's pair as two different reflections.
When I am trying to upload PDB on rcsb, on the refinement tab, the number of reflections used for refinement and Rfree calculations are shown as 32,300 and 3220 respectively, while in the validation report, no. of reflections used for Rfree calculation are shown as *1700*.
I would really appreciate if someone can guide me to resolve this discrepancy.
Thanks in advance.
Viney
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb Unsubscribe:[email protected]
-- -- Tim Gruene Head of the Centre for X-ray Structure Analysis Faculty of Chemistry University of Vienna
Phone: +43-1-4277-70202
GPG Key ID = A46BEE1A
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb Unsubscribe: [email protected]
Dear Tim and others,
Thanks for your valuable insights.
Will it be okay if I treat Friedel's pair as separate reflections, even if
there is no anomalous signal for structure refinement. Can such reflections
(Friedel's pair) be considered unique reflections in that case?
If I treat them separately I still have a multiplicity of 6 and completion
of more than 95%.
Thanks again.
On Thu, Jan 13, 2022, 00:24 Tim Gruene
Hi Pavel, hi Viney,
you should NOT go back to data processing. Data processing should result in unmerged data, including I+ and I-. The refinement program (possibly under guidance of the user) should decide whether and how data should be merged for refinement purposes. This decision should also be reflected during data deposition when counting reflections, however, it is best practice to deposit unmerged data. Whether or not the PDB support deposition of unmerged data may be a different question.
Cheers, Tim
On Wed, 12 Jan 2022 09:19:35 -0800 Pavel Afonine
wrote: Hi Viney,
if your data are anomalous then you expect to have Fobs+, Fobs- and singletons. If your data are not anomalous, you should not have +/- mates in your data set. So then the real question is: since you do not expect any anomalous signal and thus assume your data set isn't anomalous (as you said), then how come you ended up with a data set that contains Fobs+, Fobs- and singletons? Perhaps you should go back to data processing step and make sure you don't get anomalous data set in the first place.
Now as to counting... Fobs+ and Fobs- are the two entries in the array of float values, so they are counted as two, not one (in Phenix).
Pavel
On 1/12/22 04:41, Viney Singh wrote:
Dear all,
First of all sorry for the novice question.
I _processed_ one of my datasets using xds. Since I was not expecting an anomalous signal, I kept Friedel's Law = True. The number of unique reflections I got: *17,000* I used xds_ASCII.HKL after correct.LP and converted it to mtz using Phenix with 10% reflections for Rfree calculations.
Now, when I am trying to_refine _the structure in Phenix, the structure is being refined against *32,300* reflections with *3230* reflections for Rfree calculation. Looks like refinement is treating Friedel's pair as two different reflections.
When I am trying to upload PDB on rcsb, on the refinement tab, the number of reflections used for refinement and Rfree calculations are shown as 32,300 and 3220 respectively, while in the validation report, no. of reflections used for Rfree calculation are shown as *1700*.
I would really appreciate if someone can guide me to resolve this discrepancy.
Thanks in advance.
Viney
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb Unsubscribe:[email protected]
-- -- Tim Gruene Head of the Centre for X-ray Structure Analysis Faculty of Chemistry University of Vienna
Phone: +43-1-4277-70202
GPG Key ID = A46BEE1A
participants (7)
-
Gerard Bricogne
-
Pavel Afonine
-
Phil Jeffrey
-
Randy John Read
-
Tanner, John J.
-
Tim Gruene
-
Viney Singh