Cryo-EM Structure Deposition
Background
Once a protein structure is solved it is usually deposited
in the World Wide Protein Data Bank (wwPDB). This is necessary for
publication as most journals require a PDBID to be included in any
manuscript under review that describes a new crystallographic structure.
The wwPDB is an invaluable resource that contains over 150,000 structures
(as of June 2019).
Procedure
The model files (PDB and mmCIF) generated by phenix.real_space_refine contain
information that is required when depositing a structure to the wwPDB. Starting
in July 2019, the wwPDB will only accept model files in mmCIF for deposition for
crystallographic structures, while cryo-em structures can still use the PDB
format. To aid in the eventual transition to using only mmCIF for deposition, we
recommend that you start using mmCIF for deposition of cryo-em structures as well.
To generate a model file suitable to deposition, a two stage process is
currently recommended:
- By default, phenix.real_space_refine will output model files in mmCIF and PDB format.
If you turned off the option for mmCIF output, run a final cycle of
phenix.real_space_refine that writes mmCIF files for model and data (this can be set in
the Output section of the GUI).
- The model file (mmCIF) and a sequence file can then be processed with
mmtbx.prepare_pdb_deposition program to create a mmCIF file with the sequence.
This program requires the full sequence for the macromolecule to be provided.
In the GUI, this program is in the "PDB Deposition" section of tools.
Related programs
- phenix.validation_cryoem: This program can be used to generate the table of statistics.
- mmtbx.prepare_pdb_deposition:
This program takes the mmCIF output from phenix.real_space_refine and adds the sequence
information necessary for deposition into the Proten Data Bank.
- phenix.get_pdb_validation_report:
This program submits your model and data in CIF format to the PDB OneDep
server to get a validation report in PDF an XML formats.
References
- Announcing mandatory submission of PDBx/mmCIF format files for crystallographic depositions to the Protein Data Bank (PDB). P.D. Adams, P.V. Afonine, K. Baskaran, H.M. Berman, J. Berrisford, G. Bricogne, D.G. Brown, S.K. Burley, M. Chen, Z. Feng, C. Flensburg, A. Gutmanas, J.C. Hoch, Y. Ikegawa, Y. Kengaku, E. Krissinel, G. Kurisu, Y. Liang, D. Liebschner, L. Mak, J.L. Markley, N.W. Moriarty, G.N. Murshudov, M. Noble, E. Peisach, I. Persikova, B.K. Poon, O.V. Sobolev, E.L. Ulrich, S. Velankar, C. Vonrhein, J. Westboork, M. Wojdyr, M. Yokochi, and J.Y. Young. Acta Cryst. D75, 451-454 (2019).