|
|||||||||||||||||||||||||||||||||
|
|||||||||||||||||||||||||||||||||
DescriptionProtein Threading, also known as fold recognition, is a method for the computational prediction of protein structure from amino acid sequence. Homology modelling is also for that purpose. Classification of protein structureThe Structural Classification of Proteins (SCOP) database provides a detailed and comprehensive description of the structural and evolutionary relationships of known structure. Proteins are classified to reflect both structural and evolutionary relatedness. Many levels exist in the hierarchy, but the principal levels are family, superfamily and fold described below. The different major levels in the hierarchy are:
Protein threadingProtein threading or fold recognition is for those targets which have the same fold as proteins of known structures but do not have homologous proteins with known structure. Protein threading predicts protein structures by using statistical knowledge of the relationship between the structure and the sequence. The prediction is made by "threading" (i.e., placing, aligning) each amino acid contained in the target sequence to a position in the template structure, and evaluating how well the target fits the template. After the best-fit template is selected, the structural model of the sequence is built based on the alignment with the chosen template. The protein threading method is based on two basic observations. One is that the number of different folds in nature is fairly small (approximately 1000), and the other is that according to the statistics of PDB, 90% of the new structures submitted to PDB in the past three years have similar structural folds to the ones in PDB. Steps involved in protein threadingA general paradigm of protein threading consists of the following four steps:
Difference between protein threading and homology modellingHomology modelling and protein threading are both template-based methods and there is no rigorous boundary between homology modelling and protein threading in terms of prediction techniques. But the protein structures they target at are different. Homology modelling is for those targets that have homologous proteins with known structure. As mentioned, protein threading is for those targets with only fold-level homology found . In other words, homology modelling is for easy targets and protein threading is for hard targets. Homology modelling treats the template in an alignment as a sequence and only sequence homology is used for prediction. Protein threading treats the template in an alignment as a structure and both sequence and structure information extracted from the alignment are used for prediction. When there is no significant homology found, protein threading can make a prediction based on the structure information. That also explains why protein threading may be more effective than homology modelling in many cases. In practice, when the sequence identity in a sequence sequence alignment is low (i.e. <25%), homology modelling may not produce a significant prediction. In this case, if there is distant homology found for the target, protein threading can generate a good prediction. More about threadingFold recognition methods can be broadly divided into two types: 1. methods that derive a 1-D profile for each structure in the fold library and align the target sequence to these profiles; 2. methods that consider the full 3-D structure of the protein template. A simple example of a profile representation would be to take each amino acid in the structure and simply label it according to whether it is buried in the core of the protein or exposed on the surface. More elaborate profiles might take into account the local secondary structure (e.g. whether the amino acid is part of an alpha helix) or even evolutionary information (how conserved the amino acid is). In the 3-D representation, the structure is modelled as a set of inter-atomic distances i.e. the distances are calculated between some or all of the atom pairs in the structure. This is a much richer and far more flexible description of the structure, but is much harder to use in calculating an alignment. The profile-based fold recognition approach was first described by Bowie, Lüthy and Eisenberg in 1991. The term threading was first coined by Jones, Taylor and Thornton in 1992, and originally referred specifically to the use of a full 3-D structure atomic representation of the protein template in fold recognition. Today, the terms threading and fold recognition are frequently (though somewhat incorrectly) used interchangeably. Fold recognition methods are widely used and effective because it is believed that there are a strictly limited number of different protein folds in nature, mostly as a result of evolution but also due to constraints imposed by the basic physics and chemistry of polypeptide chains. There is, therefore, a good chance (currently 70-80%) that a protein which has a similar fold to the target protein has already been studied by X-ray crystallography or NMR spectroscopy and can be found in the PDB (Protein Data Bank). Currently there are just over 1100 different protein folds known (see CATH database statistics for latest view), but new folds are still being discovered every year thanks in part to the ongoing structural genomics projects. Many different algorithms have been proposed for finding the correct threading of a sequence onto a structure, though many make use of dynamic programming in some form. For full 3-D threading, the problem of identifying the best alignment is very difficult (it is an NP-hard problem) and researchers have made use of many combinatorial optimization methods such as simulated annealing or branch and bound searching to arrive at heuristic solutions. It is interesting to compare threading methods to methods which attempt to align two protein structures (Protein structural alignment), and indeed many of the same algorithms have been applied to both problems. Protein threading software
See alsoReferences
|
| Smutne • Elena • Bogumiła • Agata • Czesława • Brygida • Grzyby • Smutne • Dorota • Ada • Smutne • Eleonora • Eleonora • gospodarstwa rolne • Cecylia All Right Reserved © 2007, Designed by Stylish Blog. |