A pattern recognition model to distinguish cancerous DNA sequences via signal processing methods

  • PDF / 3,498,548 Bytes
  • 20 Pages / 595.276 x 790.866 pts Page_size
  • 57 Downloads / 159 Views

DOWNLOAD

REPORT


(0123456789().,-volV)(0123456789(). ,- volV)

METHODOLOGIES AND APPLICATION

A pattern recognition model to distinguish cancerous DNA sequences via signal processing methods Amin Khodaei1 • Mohammad-Reza Feizi-Derakhshi1 • Behzad Mozaffari-Tazehkand1

 Springer-Verlag GmbH Germany, part of Springer Nature 2020

Abstract Cancer is one of the life-threatening diseases caused by changes in the structure of genetic components of the cell. DNA sequences are one of the most important factors in the formation and spread of this disease. The signal processing approach is one of the scientific fields that has been developed in the last two decades in the analysis of DNA sequences. In this research, a hybrid model of discrete Fourier transform and anti-notch digital filter has been used for this purpose. The aim of using these techniques is to model an approach that can distinguish cancerous samples from non-cancerous ones. In other words, a pattern recognition model is designed to discriminate cancerous cell samples based on the features of protein coding regions of DNA sequences. Some computational and statistical techniques have been used in feature extraction and feature selection stages. Despite the proposed model simplicity, it doesn’t face conventional challenges such as high computational complexity or memory dissipation. Case studies have been tested with the least possible feature, depending on the nature of the features. Experimental results and features relationship led to the proposal of the SVM classifier to discriminate two categories. The output features and classification show good discrimination results among the cancerous and non-cancerous samples. One of the main advantages of the proposed model is the independence of its performance over the data length. Evaluation and validation results indicate the high accuracy and precision of the proposed method which emphasizes the biological genetic mutation nature of cancer. Keywords Cancer  DNA sequence  Anti-notch filter  Discrete Fourier transform  Pattern recognition

1 Introduction Despite the successes that have been done in recent decades in terms of control and prevention of many diseases, cancer is considered as one of the life-threatening diseases in every culture. The increasing number and variety of cancers in recent decades confirm this issue. According to published reports and statistics, from each four deaths in America, one of them is referred to cancer disease. The

Communicated by V. Loia. & Amin Khodaei [email protected] Mohammad-Reza Feizi-Derakhshi [email protected] Behzad Mozaffari-Tazehkand [email protected] 1

main origin of cancer is genetically formed by the mutation of some intracellular components associated with DNA, which is developed by the uncontrolled division of cells. Large numbers of human cells that are estimated over 3 billion and the internal structure of cells on the other hand have troubled to analyze and process modules and cellular components (Siegel et al. 2011; Zainal Ariffin and Nor Saleha 2011;