Evaluation of train and test performance of machine learning algorithms and Parkinson diagnosis with statistical measure

  • PDF / 1,122,390 Bytes
  • 14 Pages / 595.276 x 790.866 pts Page_size
  • 98 Downloads / 191 Views

DOWNLOAD

REPORT


ORIGINAL ARTICLE

Evaluation of train and test performance of machine learning algorithms and Parkinson diagnosis with statistical measurements Emre Avuçlu 1 & Abdullah Elen 2 Received: 5 March 2020 / Accepted: 29 August 2020 # International Federation for Medical and Biological Engineering 2020

Abstract Parkinson’s disease is a neurological disorder that causes partial or complete loss of motor reflexes and speech and affects thinking, behavior, and other vital functions affecting the nervous system. Parkinson’s disease causes impaired speech and motor abilities (writing, balance, etc.) in about 90% of patients and is often seen in older people. Some signs (deterioration of vocal cords) in medical voice recordings from Parkinson’s patients are used to diagnose this disease. The database used in this study contains biomedical speech voice from 31 people of different age and sex related to this disease. The performance comparison of the machine learning algorithms k-Nearest Neighborhood (k-NN), Random Forest, Naive Bayes, and Support Vector Machine classifiers was performed with the used database. Moreover, the best classifier was determined for the diagnosis of Parkinson’s disease. Eleven different training and test data (45 × 55, 50 × 50, 55 × 45, 60 × 40, 65 × 35, 70 × 30, 75 × 25, 80 × 20, 85 × 15, 90 × 10, 95 × 5) were processed separately. The data obtained from these training and tests were compared with statistical measurements. The training results of the k-NN classification algorithm were generally 100% successful. The best test result was obtained from Random Forest classifier with 85.81%. All statistical results and measured values are given in detail in the experimental studies section. Keywords Medical voice recordings . Machine learning . Parkinson’s disease . Performance comparison

1 Introduction Neurological diseases in the world cause more and more human deaths for people. Parkinson’s disease is a neurodegenerative disease of the central nervous system that causes loss of motor reflex and speech and affects behavior, mental process, and other vital functions [1]. Parkinson’s disease was described and named as shaky paralysis by Doctor James Parkinson in 1817 [2]. It is generally seen in elderly people and causes loss of speech and motor abilities (balance, etc.) in 90% of patients [3]. Parkinson’s disease is the second most

* Emre Avuçlu [email protected] Abdullah Elen [email protected] 1

Department of Computer Technology, Aksaray University, Aksaray, Turkey

2

Department of Computer Technology, Karabuk University, Karabuk, Turkey

common neurological health problem following Alzheimer’s disease [4]. The incidence and prevalence of the disease vary in different studies. In general, approximately 10 million people in the world complain of this disease [5, 6]. In a recent comprehensive study, the incidence of the disease was reported to be 20/100000 [7]. It is known that there are more than a million Parkinson’s patients in North America [8]. In Europe, the prevalence of the disease i