JUL

Article: Identification of Subfamily-specific Sites based on Active Sites Modeling and Clustering

Abstract : ASMC method (Active Sites Modeling and Clustering) is a novel unsupervised method to classify sequences using structural information of protein pockets. The method predicts functional amino-acids by proposing active site SDP residues (Specificity Determining Position) and active site CP residues (Conserved Positions) profiles. ASMC combines homology modeling of family members, structural alignment of modeled active sites and a subsequent hierarchical conceptual classification of obtained alignments. Comparison of profiles obtained from computed clusters allows the identification of the residues correlated to sub-families function divergence.

Supplementary material

ASMC method has been validated on a benchmark of 42 Pfam families for which previous resolved holo-structures were available.

Supplementary material Download contains:

Table I and Table II: The test set benchmark

Table III: Average distance comparison

et al.

Figure I: ASMC tree of Family PF02274 (Amidinotransferase).

Figure II: Influence of the % of identity between targets and templates on ASMC performance.

Table IV: Comparison of ASMC performance with a similar procedure that uses only information on sequences

Article: Identification of Subfamily-specific Sites based on Active Sites Modeling and Clustering

Supplementary material

Home

About

An example

Contacts