Protein Eng.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Institution: Gunma University Library || Sign In as Personal Subscriber
Full Text of this Article
Reprint (PDF) Version of this Article
Similar articles found in:
Protein Eng. Online
PubMed
PubMed Citation
This Article has been cited by:
other online articles
Search Medline for articles by:
Miyazawa, S. || Jernigan, R. L.
Alert me when:
new articles cite this article
Download to Citation Manager
Protein Engineering, Vol. 13, No. 7, 459-475, July 2000
¸¢ķ 2000 Oxford University Press

Identifying sequence–structure pairs undetected by sequence alignments

Sanzo Miyazawa1,2 and Robert L. Jernigan3

1 Faculty of Technology, Gunma University, Kiryu, Gunma 376, Japan and
3 Room B-116, Bldg 12B, MSC 5677, Laboratory of Experimental and Computational Biology, DBS, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-5677, USA

We examine how effectively simple potential functions previously developed can identify compatibilities between sequences and structures of proteins for database searches. The potential function consists of pairwise contact energies, repulsive packing potentials of residues for overly dense arrangement and short-range potentials for secondary structures, all of which were estimated from statistical preferences observed in known protein structures. Each potential energy term was modified to represent compatibilities between sequences and structures for globular proteins. Pairwise contact interactions in a sequence–structure alignment are evaluated in a mean field approximation on the basis of probabilities of site pairs to be aligned. Gap penalties are assumed to be proportional to the number of contacts at each residue position, and as a result gaps will be more frequently placed on protein surfaces than in cores. In addition to minimum energy alignments, we use probability alignments made by successively aligning site pairs in order by pairwise alignment probabilities. The results show that the present energy function and alignment method can detect well both folds compatible with a given sequence and, inversely, sequences compatible with a given fold, and yield mostly similar alignments for these two types of sequence and structure pairs. Probability alignments consisting of most reliable site pairs only can yield extremely small root mean square deviations, and including less reliable pairs increases the deviations. Also, it is observed that secondary structure potentials are usefully complementary to yield improved alignments with this method. Remarkably, by this method some individual sequence–structure pairs are detected having only 5–20% sequence identity.


This article has been cited by other articles:


Home page
Protein SciHome page
F. Melo, R. Sanchez, and A. Sali
Statistical potentials for fold assessment
Protein Sci., February 1, 2002; 11(2): 430 - 448.
[Abstract] [Full Text] [PDF]


Home page
Protein SciHome page
C. Geourjon, C. Combet, C. Blanchet, and G. Del¸«±age
Identification of related proteins with weak sequence identity using secondary structure information
Protein Sci., April 1, 2001; 10(4): 788 - 797.
[Abstract] [Full Text]





HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright ¸¢ķ 2000 Oxford University Press.