Power spectrum and dynamic time warping for DNA sequences classification

  • PDF / 1,979,139 Bytes
  • 10 Pages / 595.276 x 790.866 pts Page_size
  • 0 Downloads / 265 Views

DOWNLOAD

REPORT


ORIGINAL PAPER

Power spectrum and dynamic time warping for DNA sequences classification Abdesselem Dakhli1 · Chokri Ben Amar2 Received: 13 August 2018 / Accepted: 29 September 2019 © Springer-Verlag GmbH Germany, part of Springer Nature 2019

Abstract Similarity and alignment and are often used to classify DNA sequences. We have developed a new classifier to classify DNA sequence. First, our approach is used to extract the features of DNA strands. Second, the goal of our approach is to classify DNA strands according to the similarity elaborated by the alignment. Frequently, the performance of the classification of DNA sequences depends on the method that allows to extract the characteristics and calculation of the genomic similarity. Particularly, our approach consists of three different methods for improving the classification of the DNA sequences. This paper presents a new approach of classification of DNA sequence based on dynamic time warping (DTW) method. First, the binary indicator is used to code each nucleotide and the power spectrum is used to extract the characteristics. Secondly, the DNA sequence similarity matrix is evaluated by the dynamic temporal Warping. Third, pairwise comparison is used to classify DNA strands. Our approach solves the complex problem of presentation and structure of different groups of organisms. The experimental results of our classifier obtained are compared with other approaches based on the alignment and similarity of the DNA sequences. These results showed that our approach outperformed other approaches in terms of classification and running time. Here is a summary of the main contributions of this article: (1) Convert nucleotides from DNA sequences by applying binary coding. (2) Using power spectrum our approach extracts the characteristics of DNA sequences. (3) Elaborate the similarity matrix of the DNA strand signal by the Dynamic Time Warping method. (4) Use pairwise comparison to classify DNA sequences. The approach developed is efficient to solve the problems of classification of DNA sequences. Keywords  DNA sequences · Power spectrum · Dynamic time warping · Binary · Pairwise comparison · Discrete Fourier transform

1 Introduction Bioinformatics studies the classification of DNA strands as a fundamental problem in modern genomics. Methods for classifying DNA sequences can be divided into three major types. The first category is the feature-based classification, which converts a DNA sequence into vector functionality and then uses classical classification methods to classify * Abdesselem Dakhli [email protected] Chokri Ben Amar [email protected] 1



Hail University, Community College, Hail, Kingdom of Saudi Arabia



REGIM: Research Groups on Intelligent Machines, National Engineering School of Sfax (ENIS), University of Sfax, 3038 Sfax, Tunisia

2

the DNA sequences. The second type is the classification based on the distance between the DNA strands. Similarity was used to group the DNA sequences. The similarity assessment is done using a distance fu