Using partitioning and non-partitioning clustering algorithms for included proteins sequences in esophagus, stomach and colon cancer

Yalda Zarnegar Nia, Hamid Alavi Majd, Mona Azodi, Nasibeh Khayyer

Abstract


158

A thorough recognition of the nature and duties of the genes is based upon having adequate information about the proteins. However, the proteomic projects follow a slow trend; therefore, solving the protein-related problems has become as one of the most important challenges in bio-informatics. Consequently, the presence of tools which can enhance the structural recognition, classification, and interpretation of proteins will be advantageous. Statistical methods are among the tools to help solve bio-informatics problems. These methods may be used to help predict the third structures of proteins, study proteins collectively, as well as extract new interactions among the protein collections. One of the very efficient and useful methods in the collective study of protein subsets is the cluster analysis. In the present study, the recognized protein sequences related to esophagus, stomach, and colon cancers are analyzed through partitioning, non-partitioning, and fuzzy clustering methods. Needleman-Wunsch global alignment algorithm was used to determine pair-wise similarities. The evaluations have shown that the clusters obtained through using the AGNES method have produced more powerful structures; yet, it can be said that the PAM clustering method, compared to other ones, has produced the best results in predicting ability of the 3D structure of the unknown protein sequences.

Keywords


Clustering; Protein Sequence; 3D structure; Gastrointestinal Cancers

Full Text:

PDF

129



DOI: https://doi.org/10.22037/jps.v2i2.2339

Refbacks

  • There are currently no refbacks.


"Journal of Paramdedical Sciences", is a publication of "School of Paramedical Sciences, Shahid Beheshti University of Medical Sciences" and "Iranian Society of Medical Proteomics".

"Journal of Paramdedical Sciences" is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

 

EISSN: 2008-4978

PISSN: 2008-496X