Phenix.ensemble_refinment questions
Dear Phenix community, I was hoping that some of you may be able to clarify some questions about Phenix.ensemble_refinement: 1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT ("The overall correlation coefficient between the electron-density map from the ensemble model (obtained without experimental phases) and the experimentally phased electron-density map was 0.903, compared with 0.873 and 0.895 for the published and re-refined single structures. These seemingly small improvements in over- all quality indicators allow for significant local improvements."). I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this? In addition, the phenix.ensemble_refinement keywords include: * experimental_phases * file_name = None * labels = None * Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)? 2. Is the correct procedure in parameter optimisation to optimise pTLS, then temperature offest and then tx? Or is it possible that the best ensemble may be generated from any (non-directional) combination of the above parameters? 3. What is the function of the "nproc" keyword? If this is the number of CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster? 4. Can anybody suggest a good way to compare the ensembles generated by phenix.ensemble_refinement with the trajectories of traditional MD simulations (for e.g. produced by NAMD? ;) Finally, I noticed that I cannot run phenix.ensemble_refinement using a "my_parameters.eff" file, it is necessary to type on the command line. Alternatively, the queueing system for the supercomputer I use is managed by SLURM, however, I cannot submit batch jobs via the appropriate command sbatch with anything other than default parameters (keywords in a parameters.eff or that are typed explicitly following the mycoords.pdb mydata.mtz myrestraints.cif files are not incorporated into the resulting parameters of the .log file). Could these issues be due to my local environment setup? I am running Phenix 1.9-1692 Platform: intel-linux-2.6-x86_64 redhat-e6.5. Sorry for the long email and many thanks in advance for you help! Best regards, Joseph. Joseph Brock | PhD Division of Physiological Chemistry II Department of Medical Biochemistry and Biophysics Karolinska Institutet Scheeles väg 2 SE-171 77 Stockholm, Sweden
Hi Joseph,
1) IIRC the fully phased SFs weren't deposited and we asked the
authors for them. Whilst I was developing ER we hadn't allowed for
phase information to be used in the target function.
2) It is possible that a optimal parameters are found by grid
searching all three parameters at once. If no value for tx is
supplied it will estimate Tx based on resolution and this is typically
very reliable.
3) With a cluster I would suggest using the auto Tx setting as above
and running a grid of temperature offsets and ptls values: e.g 12
parallel of 1, 2.5, 5 (temp) and 0.95, 0.9, 0.8, 0.6 (ptls).
More details can be found here:
http://www.phenix-online.org/newsletter/CCN_2013_07.pdf
I haven't been actively developing ensemble refinement for a while now
so other may have more up-to-date answers....
Hope that helps,
Tom
On 13 November 2014 14:46, Joseph Brock
Dear Phenix community,
I was hoping that some of you may be able to clarify some questions about Phenix.ensemble_refinement:
1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT ("The overall correlation coefficient between the electron-density map from the ensemble model (obtained without experimental phases) and the experimentally phased electron-density map was 0.903, compared with 0.873 and 0.895 for the published and re-refined single structures. These seemingly small improvements in over- all quality indicators allow for significant local improvements."). I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this?
In addition, the phenix.ensemble_refinement keywords include:
experimental_phases
file_name = None labels = None
Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)?
2. Is the correct procedure in parameter optimisation to optimise pTLS, then temperature offest and then tx? Or is it possible that the best ensemble may be generated from any (non-directional) combination of the above parameters?
3. What is the function of the "nproc" keyword? If this is the number of CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster?
4. Can anybody suggest a good way to compare the ensembles generated by phenix.ensemble_refinement with the trajectories of traditional MD simulations (for e.g. produced by NAMD? ;)
Finally, I noticed that I cannot run phenix.ensemble_refinement using a "my_parameters.eff" file, it is necessary to type on the command line.
Alternatively, the queueing system for the supercomputer I use is managed by SLURM, however, I cannot submit batch jobs via the appropriate command sbatch with anything other than default parameters (keywords in a parameters.eff or that are typed explicitly following the mycoords.pdb mydata.mtz myrestraints.cif files are not incorporated into the resulting parameters of the .log file).
Could these issues be due to my local environment setup?
I am running Phenix 1.9-1692 Platform: intel-linux-2.6-x86_64 redhat-e6.5.
Sorry for the long email and many thanks in advance for you help!
Best regards,
Joseph.
Joseph Brock | PhD Division of Physiological Chemistry II Department of Medical Biochemistry and Biophysics Karolinska Institutet Scheeles väg 2 SE-171 77 Stockholm, Sweden
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
Hi Tom, thanks for keeping eye on tools you have developed even though you are not working on this any more!
I haven't been actively developing ensemble refinement for a while now so other may have more up-to-date answers....
There was no major changes to the code since you left, so I'm sure your answers are up-to-date! Pavel
On Thu, Nov 13, 2014 at 6:46 AM, Joseph Brock
1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT... I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this?
AutoSol can be used to re-solve such datasets, although in the case of 1YTT it requires additional information that wasn't deposited.
Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)?
It is possible, but completely untested aside from verifying that it doesn't crash. I added this a year ago at the request of another user but haven't looked into it since. 3. What is the function of the "nproc" keyword? If this is the number of
CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster?
The only parallelization is in the optimization of the ptls parameter - i.e. if you try N values for ptls, you can run N jobs at once. For a single ptls it will run in serial. So on a cluster, you are better off running N different jobs separately. Finally, I noticed that I cannot run phenix.ensemble_refinement using a
"my_parameters.eff" file, it is necessary to type on the command line.
That sounds like a bug... -Nat
Hi All,
As I am (I think) the other user that Nat is referring to, I'll comment.
I requested support for experimental phase information in the PHENIX
ensemble refinement target and can verify that it is accepted (in my case
as HL coeffs) and will run. The result in my test case was not
dramatically different than an amplitude-based target, but obviously a
great many factors could affect this. What differences I saw were minor
improvements in R/Rfree (~1%) with Se-Met SAD phases in a 1.05 Å
resolution structure of a flexible protein. I've not dug too much more
into this, but I can verify that phases are accepted and do influence the
final ensemble.
Best regards,
Mark
Mark A. Wilson
Associate Professor
Department of Biochemistry/Redox Biology Center
University of Nebraska
N118 Beadle Center
1901 Vine Street
Lincoln, NE 68588
(402) 472-3626
[email protected]
On 11/13/14 9:53 AM, "Nathaniel Echols"
On Thu, Nov 13, 2014 at 6:46 AM, Joseph Brock
wrote: 1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT... I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this?
AutoSol can be used to re-solve such datasets, although in the case of 1YTT it requires additional information that wasn't deposited.
Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)?
It is possible, but completely untested aside from verifying that it doesn't crash. I added this a year ago at the request of another user but haven't looked into it since.
3. What is the function of the "nproc" keyword? If this is the number of CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster?
The only parallelization is in the optimization of the ptls parameter - i.e. if you try N values for ptls, you can run N jobs at once. For a single ptls it will run in serial. So on a cluster, you are better off running N different jobs separately.
Finally, I noticed that I cannot run phenix.ensemble_refinement using a "my_parameters.eff" file, it is necessary to type on the command line.
That sounds like a bug...
-Nat
Dear Mark, for a 1A structure, at least when it is near completeness, I would expect the phase information from the model much, much more accurate than from a SAD experiment, i.e. I'd rather expect the R value to get worse when you include SAD phasing information. For example, when developers show the phase errors of their methods for phasing experiments, the error is generally calculated with respect to the final model. Did you observe the drop at an early stage of refinement with a low complete model? Regards, Tim On 11/13/2014 05:03 PM, Mark Wilson wrote:
Hi All, As I am (I think) the other user that Nat is referring to, I'll comment. I requested support for experimental phase information in the PHENIX ensemble refinement target and can verify that it is accepted (in my case as HL coeffs) and will run. The result in my test case was not dramatically different than an amplitude-based target, but obviously a great many factors could affect this. What differences I saw were minor improvements in R/Rfree (~1%) with Se-Met SAD phases in a 1.05 Å resolution structure of a flexible protein. I've not dug too much more into this, but I can verify that phases are accepted and do influence the final ensemble. Best regards, Mark
Mark A. Wilson Associate Professor Department of Biochemistry/Redox Biology Center University of Nebraska N118 Beadle Center 1901 Vine Street Lincoln, NE 68588 (402) 472-3626 [email protected]
On 11/13/14 9:53 AM, "Nathaniel Echols"
wrote: On Thu, Nov 13, 2014 at 6:46 AM, Joseph Brock
wrote: 1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT... I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this?
AutoSol can be used to re-solve such datasets, although in the case of 1YTT it requires additional information that wasn't deposited.
Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)?
It is possible, but completely untested aside from verifying that it doesn't crash. I added this a year ago at the request of another user but haven't looked into it since.
3. What is the function of the "nproc" keyword? If this is the number of CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster?
The only parallelization is in the optimization of the ptls parameter - i.e. if you try N values for ptls, you can run N jobs at once. For a single ptls it will run in serial. So on a cluster, you are better off running N different jobs separately.
Finally, I noticed that I cannot run phenix.ensemble_refinement using a "my_parameters.eff" file, it is necessary to type on the command line.
That sounds like a bug...
-Nat
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
-- Dr Tim Gruene Institut fuer anorganische Chemie Tammannstr. 4 D-37077 Goettingen GPG Key ID = A46BEE1A
Hi Tim,
That is a fair point, although the observed (very modest) improvement with
phases was for the final ensemble. My motivation for looking into this is
that some of our unpublished work has indicated that even weak phase
information that is properly handled with a maximum likelihood target
(hence my use of HL coeffs with MLHL) can improve the modeling of disorder
using "CNS-style" non-interacting multicopy ensemble models. These are
distinct from the ensembles implemented in PHENIX in that they are not
time-averaged MD models but rather a set of n full replicates of the model
that do not interact with each other and are allowed to collectively fit
the data. I wanted to see how time-averaged ensembles compared to this
approach when phase information was included.
The overarching rationale for these tests is that phase information
provides a powerful set of additional experimental observations that may
counteract the intrinsic tendency of ensemble approaches to overfit data,
even if the phase information is noisy. In any event, I certainly did not
see a detrimental effect of added phase information on the R values of the
ensemble models with optimum choices of pTLS and using the MLHL target.
Your broader point however, is a good one-perhaps the true value of
including phase data isn't evident unless unusually high quality
experimental phases are being used.
Best regards,
Mark
Mark A. Wilson
Associate Professor
Department of Biochemistry/Redox Biology Center
University of Nebraska
N118 Beadle Center
1901 Vine Street
Lincoln, NE 68588
(402) 472-3626
[email protected]
On 11/14/14 2:50 AM, "Tim Gruene"
Dear Mark,
for a 1A structure, at least when it is near completeness, I would expect the phase information from the model much, much more accurate than from a SAD experiment, i.e. I'd rather expect the R value to get worse when you include SAD phasing information. For example, when developers show the phase errors of their methods for phasing experiments, the error is generally calculated with respect to the final model.
Did you observe the drop at an early stage of refinement with a low complete model?
Regards, Tim
On 11/13/2014 05:03 PM, Mark Wilson wrote:
Hi All, As I am (I think) the other user that Nat is referring to, I'll comment. I requested support for experimental phase information in the PHENIX ensemble refinement target and can verify that it is accepted (in my case as HL coeffs) and will run. The result in my test case was not dramatically different than an amplitude-based target, but obviously a great many factors could affect this. What differences I saw were minor improvements in R/Rfree (~1%) with Se-Met SAD phases in a 1.05 Å resolution structure of a flexible protein. I've not dug too much more into this, but I can verify that phases are accepted and do influence the final ensemble. Best regards, Mark
Mark A. Wilson Associate Professor Department of Biochemistry/Redox Biology Center University of Nebraska N118 Beadle Center 1901 Vine Street Lincoln, NE 68588 (402) 472-3626 [email protected]
On 11/13/14 9:53 AM, "Nathaniel Echols"
wrote: On Thu, Nov 13, 2014 at 6:46 AM, Joseph Brock
wrote: 1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT... I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this?
AutoSol can be used to re-solve such datasets, although in the case of 1YTT it requires additional information that wasn't deposited.
Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)?
It is possible, but completely untested aside from verifying that it doesn't crash. I added this a year ago at the request of another user but haven't looked into it since.
3. What is the function of the "nproc" keyword? If this is the number of CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster?
The only parallelization is in the optimization of the ptls parameter - i.e. if you try N values for ptls, you can run N jobs at once. For a single ptls it will run in serial. So on a cluster, you are better off running N different jobs separately.
Finally, I noticed that I cannot run phenix.ensemble_refinement using a "my_parameters.eff" file, it is necessary to type on the command line.
That sounds like a bug...
-Nat
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
-- Dr Tim Gruene Institut fuer anorganische Chemie Tammannstr. 4 D-37077 Goettingen
GPG Key ID = A46BEE1A
I greatly appreciate all the helpful info everyone.
Thanks!
-Joseph.
Joseph Brock | PhD
Division of Physiological Chemistry II
Department of Medical Biochemistry and Biophysics
Karolinska Institutet
Scheeles väg 2
SE-171 77 Stockholm, Sweden
________________________________________
From: Mark Wilson [[email protected]]
Sent: Thursday, 13 November 2014 5:03 PM
To: Nathaniel Echols; Joseph Brock
Cc: [email protected]
Subject: Re: [phenixbb] Phenix.ensemble_refinment questions
Hi All,
As I am (I think) the other user that Nat is referring to, I'll comment.
I requested support for experimental phase information in the PHENIX
ensemble refinement target and can verify that it is accepted (in my case
as HL coeffs) and will run. The result in my test case was not
dramatically different than an amplitude-based target, but obviously a
great many factors could affect this. What differences I saw were minor
improvements in R/Rfree (~1%) with Se-Met SAD phases in a 1.05 Å
resolution structure of a flexible protein. I've not dug too much more
into this, but I can verify that phases are accepted and do influence the
final ensemble.
Best regards,
Mark
Mark A. Wilson
Associate Professor
Department of Biochemistry/Redox Biology Center
University of Nebraska
N118 Beadle Center
1901 Vine Street
Lincoln, NE 68588
(402) 472-3626
[email protected]
On 11/13/14 9:53 AM, "Nathaniel Echols"
On Thu, Nov 13, 2014 at 6:46 AM, Joseph Brock
wrote: 1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT... I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this?
AutoSol can be used to re-solve such datasets, although in the case of 1YTT it requires additional information that wasn't deposited.
Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)?
It is possible, but completely untested aside from verifying that it doesn't crash. I added this a year ago at the request of another user but haven't looked into it since.
3. What is the function of the "nproc" keyword? If this is the number of CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster?
The only parallelization is in the optimization of the ptls parameter - i.e. if you try N values for ptls, you can run N jobs at once. For a single ptls it will run in serial. So on a cluster, you are better off running N different jobs separately.
Finally, I noticed that I cannot run phenix.ensemble_refinement using a "my_parameters.eff" file, it is necessary to type on the command line.
That sounds like a bug...
-Nat
Hi all,
I hope it is OK if I revive this old thread to ask a few additional questions regarding phenix.ensemble_refinement, specifically regarding its use with data at atomic resolution.
While I had good and immediate results using the software with data sets at around 1.7 Å, I have been having a very hard time with a data set of 1.1 Å resolution, phenix.refine R/Rfree values of ~12/13% (alt conformations removed and ligand occupancy refined).
I have now tried allot of different combinations (~180!) screening:
- ptls values of 0.6, 0.7, 0.8, 0.9 and 0.95,
- wxray_coupled_tbath_offset values of 1, 2.5 and 5,
- and primary_map_cutoff values of 2.5, 3.0 and 3.5
combined with various different TLS definitions and restraints on ligands. I also tried to manually set tx values to 1.5, 2.0 and 2.5 to reflect those of a dataset of similar resolution (1KZK) reported in Burnely et. al., 2012. However, ensemble refinement consistently makes the model considerably worse, with the best Rfree values I have achieved being ~15.
It just occurred to me that increasing the x-ray weight with wxray_coupled_tbath_offset values of ~10 would probably make sense for data of this quality, but I would be greatly appreciate any further tips if anybody has specific experience with using phenix.ensemble_refinement with high resolution datasets.
On a different note, Burnely et. al., 2012, shows some very cool figures in which atoms are coloured by atom probability. If there is an easy way to output these values, to say the occupancy column in the resulting .pdb files, I would love to hear it!
Thanks!
-Joseph.
Joseph Brock | PhD
Division of Physiological Chemistry II
Department of Medical Biochemistry and Biophysics
Karolinska Institutet
Scheeles väg 2
SE-171 77 Stockholm, Sweden
________________________________________
From: Joseph Brock
Sent: Monday, 17 November 2014 9:25 AM
To: Mark Wilson; Nathaniel Echols
Cc: [email protected]
Subject: RE: [phenixbb] Phenix.ensemble_refinment questions
I greatly appreciate all the helpful info everyone.
Thanks!
-Joseph.
Joseph Brock | PhD
Division of Physiological Chemistry II
Department of Medical Biochemistry and Biophysics
Karolinska Institutet
Scheeles väg 2
SE-171 77 Stockholm, Sweden
________________________________________
From: Mark Wilson [[email protected]]
Sent: Thursday, 13 November 2014 5:03 PM
To: Nathaniel Echols; Joseph Brock
Cc: [email protected]
Subject: Re: [phenixbb] Phenix.ensemble_refinment questions
Hi All,
As I am (I think) the other user that Nat is referring to, I'll comment.
I requested support for experimental phase information in the PHENIX
ensemble refinement target and can verify that it is accepted (in my case
as HL coeffs) and will run. The result in my test case was not
dramatically different than an amplitude-based target, but obviously a
great many factors could affect this. What differences I saw were minor
improvements in R/Rfree (~1%) with Se-Met SAD phases in a 1.05 Å
resolution structure of a flexible protein. I've not dug too much more
into this, but I can verify that phases are accepted and do influence the
final ensemble.
Best regards,
Mark
Mark A. Wilson
Associate Professor
Department of Biochemistry/Redox Biology Center
University of Nebraska
N118 Beadle Center
1901 Vine Street
Lincoln, NE 68588
(402) 472-3626
[email protected]
On 11/13/14 9:53 AM, "Nathaniel Echols"
On Thu, Nov 13, 2014 at 6:46 AM, Joseph Brock
wrote: 1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT... I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this?
AutoSol can be used to re-solve such datasets, although in the case of 1YTT it requires additional information that wasn't deposited.
Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)?
It is possible, but completely untested aside from verifying that it doesn't crash. I added this a year ago at the request of another user but haven't looked into it since.
3. What is the function of the "nproc" keyword? If this is the number of CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster?
The only parallelization is in the optimization of the ptls parameter - i.e. if you try N values for ptls, you can run N jobs at once. For a single ptls it will run in serial. So on a cluster, you are better off running N different jobs separately.
Finally, I noticed that I cannot run phenix.ensemble_refinement using a "my_parameters.eff" file, it is necessary to type on the command line.
That sounds like a bug...
-Nat
Hi Joseph,
It may be worth trying as well:
- wxray_coupled_tbath_offset values of 0.5 and 10,
- and primary_map_cutoff values of 2.0.
I'll see if I can find the ensemble probability scripts and send them to you.
Cheers,
Tom
On 4 December 2014 at 10:13, Joseph Brock
Hi all,
I hope it is OK if I revive this old thread to ask a few additional questions regarding phenix.ensemble_refinement, specifically regarding its use with data at atomic resolution.
While I had good and immediate results using the software with data sets at around 1.7 Å, I have been having a very hard time with a data set of 1.1 Å resolution, phenix.refine R/Rfree values of ~12/13% (alt conformations removed and ligand occupancy refined).
I have now tried allot of different combinations (~180!) screening:
- ptls values of 0.6, 0.7, 0.8, 0.9 and 0.95,
- wxray_coupled_tbath_offset values of 1, 2.5 and 5,
- and primary_map_cutoff values of 2.5, 3.0 and 3.5
combined with various different TLS definitions and restraints on ligands. I also tried to manually set tx values to 1.5, 2.0 and 2.5 to reflect those of a dataset of similar resolution (1KZK) reported in Burnely et. al., 2012. However, ensemble refinement consistently makes the model considerably worse, with the best Rfree values I have achieved being ~15.
It just occurred to me that increasing the x-ray weight with wxray_coupled_tbath_offset values of ~10 would probably make sense for data of this quality, but I would be greatly appreciate any further tips if anybody has specific experience with using phenix.ensemble_refinement with high resolution datasets.
On a different note, Burnely et. al., 2012, shows some very cool figures in which atoms are coloured by atom probability. If there is an easy way to output these values, to say the occupancy column in the resulting .pdb files, I would love to hear it!
Thanks!
-Joseph.
Joseph Brock | PhD Division of Physiological Chemistry II Department of Medical Biochemistry and Biophysics Karolinska Institutet Scheeles väg 2 SE-171 77 Stockholm, Sweden
________________________________________ From: Joseph Brock Sent: Monday, 17 November 2014 9:25 AM To: Mark Wilson; Nathaniel Echols Cc: [email protected] Subject: RE: [phenixbb] Phenix.ensemble_refinment questions
I greatly appreciate all the helpful info everyone.
Thanks!
-Joseph.
Joseph Brock | PhD Division of Physiological Chemistry II Department of Medical Biochemistry and Biophysics Karolinska Institutet Scheeles väg 2 SE-171 77 Stockholm, Sweden
________________________________________ From: Mark Wilson [[email protected]] Sent: Thursday, 13 November 2014 5:03 PM To: Nathaniel Echols; Joseph Brock Cc: [email protected] Subject: Re: [phenixbb] Phenix.ensemble_refinment questions
Hi All, As I am (I think) the other user that Nat is referring to, I'll comment. I requested support for experimental phase information in the PHENIX ensemble refinement target and can verify that it is accepted (in my case as HL coeffs) and will run. The result in my test case was not dramatically different than an amplitude-based target, but obviously a great many factors could affect this. What differences I saw were minor improvements in R/Rfree (~1%) with Se-Met SAD phases in a 1.05 Å resolution structure of a flexible protein. I've not dug too much more into this, but I can verify that phases are accepted and do influence the final ensemble. Best regards, Mark
Mark A. Wilson Associate Professor Department of Biochemistry/Redox Biology Center University of Nebraska N118 Beadle Center 1901 Vine Street Lincoln, NE 68588 (402) 472-3626 [email protected]
On 11/13/14 9:53 AM, "Nathaniel Echols"
wrote: On Thu, Nov 13, 2014 at 6:46 AM, Joseph Brock
wrote: 1. In the associated publication (Burnley et al. eLife 2012;), the ensemble refinement is validated by comparing the correlation of the ensemble generated map, with the map generated from the experimental phases for PDB entry 1YTT... I am confused how one computes an experimentally phased from structure factors deposited in the PDB that contain only anomalous intensities/amplitudes and not Hendrickson-Lattman coefficients. Is there a program within the phenix package that can do this?
AutoSol can be used to re-solve such datasets, although in the case of 1YTT it requires additional information that wasn't deposited.
Is it possible to include experimental phases during the rolling average refinement process and could this be beneficial (if the phases were of a sufficient quality)?
It is possible, but completely untested aside from verifying that it doesn't crash. I added this a year ago at the request of another user but haven't looked into it since.
3. What is the function of the "nproc" keyword? If this is the number of CPU cores that can be used in parallel, what is the most efficient way of using phenix.ensemble_refinement on a cluster?
The only parallelization is in the optimization of the ptls parameter - i.e. if you try N values for ptls, you can run N jobs at once. For a single ptls it will run in serial. So on a cluster, you are better off running N different jobs separately.
Finally, I noticed that I cannot run phenix.ensemble_refinement using a "my_parameters.eff" file, it is necessary to type on the command line.
That sounds like a bug...
-Nat
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
participants (6)
-
Joseph Brock
-
Mark Wilson
-
Nathaniel Echols
-
Pavel Afonine
-
Tim Gruene
-
Tom Burnley