Dear CCTBX developers,

 

I am a postdoc at EPFL working with HHpred for homology modeling of membrane proteins.

 

I have been trying to write my own HHpred alignment parser until I found the python script under “cctbx_fork/iotbx/bioinformatics/__init__.py/” that contains an HHpred parser.

 

My goal is to correctly parse the raw HHpred output file (.hhr), which involves unwrapping every alignment, parsing out a lot of text to finally obtain something like this:

 

>pdb_name

query-sequence

column score

 

Example:

 

>4U15

VYGFIGGIFGFMSIMTMAMISIDRYNVIGRPMAASKKMSHRRAFIMIIFVWLWS

+........+..+..++|+++|++++.++.+.++++ +..+.++.+|+++|++.++...+........     +...|..

 

 

Being somewhat new to python, I was wondering whether the people who wrote this script are still around and could help me figure out whether the parser could be implemented in such a way.

 

Thanks for any help you can provide!

 

Best,

 

Louis D