Friday 19th of April 2024
 

A Novel Entropy Based Segment Selection Technique for Extraction of Protein Sequence Motifs


M Chitralegha and K Thangavel

Bioinformatics is the combination of Biology, Mathematics and Information Technology. It is a study of management and analysis of De-oxyribo Nucleic Acid, Ribo Nucleic Acid and protein sequence data. In Bioinformatics, motif finding is one of the most popular problems which have got lot of applications in diagnosing the diseases, drug designing and protein classification. It is essential to have an efficient technique to explore sequence motif from protein sequences. Data mining is one such technique. Bioinformatics dataset frequently contains large volume of segments generated from protein sequences. However, all the generated protein segments may not yield potential motif patterns. The segments have no labels or classes. Hence, one has to apply unsupervised segment selection method to select the potential segments. In this paper, two novel unsupervised segment selection methods are proposed for first time based on Shannon Entropy and Singular Value Decomposition (SVD) based - Entropy. The proposed methods are evaluated using the benchmark K-Means clustering method. It is found that the proposed SVD-Entropy based segment selection produces more number of highly structurally similar clusters, through which we are able to generate significant motif patterns.

Keywords: Clustering, Data mining, protein sequence, Motif, SVD – Entropy.

Download Full-Text


ABOUT THE AUTHORS

M Chitralegha
Research Scholar, Department of Computer Science, Periyar University, Salem, Tamil Nadu, India-636 011

K Thangavel
Professor and Head, Department of Computer Science, Periyar University, Salem, Tamil Nadu, India-636 011


IJCSI Published Papers Indexed By:

 

 

 

 
+++
About IJCSI

IJCSI is a refereed open access international journal for scientific papers dealing in all areas of computer science research...

Learn more »
Join Us
FAQs

Read the most frequently asked questions about IJCSI.

Frequently Asked Questions (FAQs) »
Get in touch

Phone: +230 911 5482
Email: info@ijcsi.org

More contact details »