Gene selection of non-small cell lung cancer data for adjuvant chemotherapy decision using cell separation algorithm

  • PDF / 1,888,259 Bytes
  • 15 Pages / 595.276 x 790.866 pts Page_size
  • 8 Downloads / 143 Views

DOWNLOAD

REPORT


Gene selection of non-small cell lung cancer data for adjuvant chemotherapy decision using cell separation algorithm Najmeh Sadat Jaddi 1 & Mohammad Saniee Abadeh 1

# Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract Since recommended treatment for Non-small cell lung cancer (NSCLC) after surgery is chemotherapy, the prediction of effectiveness or futileness of adjuvant chemotherapy (ACT) in early stage is important for future decision. Classification of NSCLC in gene expression data is performed to predict effectiveness or futileness of ACT. Selection of genes highly correlated with the class attribute, affects the classification accuracy. In this paper, a new cell separation algorithm is proposed which it imitates the action of cell separation using differential centrifugation process involving multiple centrifugation steps and increasing the rotor speed in each step. The CSA uses the application of centrifugal force to separate the solutions based on their objective function in different steps while the velocity is increased in each step. The CSA contributes to automatic trade-off between exploration and exploitation by control of selection rate during the search process. To examine the CSA, 25 test functions were used first and then the CSA was applied to predict effectiveness or futileness of ACT. The number of genes in candidate subsets is handled by increasing the subset size if after a certain number of iterations there is no improvement in fitness of the subset. This contributes to less time consideration and memory usage. In this experiment, the NSCLC data contain 280 samples collected from four institutes are used. As results, the minimum number of five genes with dependency degree equal to one and classification accuracy of higher than 94% for SVM, KNN and MLP classifiers is obtained. Keywords Gene selection . Classification . Non-small cell lung cancer . Adjuvant chemotherapy decision . Cell separation algorithm

1 Introduction The most common type of cancer and the most important reason of death in the worldwide is lung cancer [1]. Lung cancer contains of two main subtypes: About 10–15% are small cell lung cancer (SCLC) and around 85–90% of lung cancers are non-small cell lung cancer (NSCLC) [1]. The most commonly used treatment for NSCLC is the adjuvant chemotherapy following by surgery. The use of adjuvant chemotherapy is to avoid recurrence or metastases. Since chemotherapy is not only a painful treatment but also an expensive process, prediction of effectiveness or futileness of ACT turn out to be a significant subject to prevent unnecessary application of * Mohammad Saniee Abadeh [email protected] Najmeh Sadat Jaddi [email protected] 1

Faculty of Electrical and Computer Engineering, Tarbiat Modares University, Tehran, Iran

chemotherapy for futile cases. Many studies have been attempted to investigate the effectiveness of chemotherapy for NSCLC patients [2–4]. In opposite, several studies report futileness by ACT in NSCLC [5–7]. Since ACT has significant toxici