Article: Identification of Subfamily-specific Sites based on Active Sites Modeling and Clustering
Abstract : ASMC method (Active Sites Modeling and Clustering) is a novel unsupervised method to classify sequences using structural information of protein pockets. The method predicts functional amino-acids by proposing active site SDP residues (Specificity Determining Position) and active site CP residues (Conserved Positions) profiles. ASMC combines homology modeling of family members, structural alignment of modeled active sites and a subsequent hierarchical conceptual classification of obtained alignments. Comparison of profiles obtained from computed clusters allows the identification of the residues correlated to sub-families function divergence.