GEML: A Grammatical Evolution, Machine Learning Approach to Multi-class Classification

In this paper, we propose a hybrid approach to solving multi-class problems which combines evolutionary computation with elements of traditional machine learning. The method, Grammatical Evolution Machine Learning (GEML) adapts machine learning concepts f

PDF / 676,965 Bytes
22 Pages / 439.37 x 666.142 pts Page_size
47 Downloads / 169 Views

DOWNLOAD

REPORT

Abstract. In this paper, we propose a hybrid approach to solving multiclass problems which combines evolutionary computation with elements of traditional machine learning. The method, Grammatical Evolution Machine Learning (GEML) adapts machine learning concepts from decision tree learning and clustering methods and integrates these into a Grammatical Evolution framework. We investigate the eﬀectiveness of GEML on several supervised, semi-supervised and unsupervised multiclass problems and demonstrate its competitive performance when compared with several well known machine learning algorithms. The GEML framework evolves human readable solutions which provide an explanation of the logic behind its classiﬁcation decisions, oﬀering a signiﬁcant advantage over existing paradigms for unsupervised and semi-supervised learning. In addition we also examine the possibility of improving the performance of the algorithm through the application of several ensemble techniques. Keywords: Multi-class classiﬁcation · Grammatical evolution tionary computation · Machine learning

1

· Evolu-

Introduction

Evolutionary algorithms (EAs) are algorithms which are inspired by biological evolution and which are constructed to emulate aspects of evolution, such as genetic mutation and recombination and the notion of natural selection. Genetic Programming (GP) [29] is an evolutionary algorithm which has been successful on a wide range of problems from various diverse domains [19], achieving many human competitive results [4]. However, a signiﬁcant proportion of previous work has concentrated on supervised learning tasks and, aside from some notable exceptions, studies on unsupervised and semi-supervised learning have been left to the wider machine learning (ML) community. Two of the most important problems types which beneﬁt from the application of ML techniques are regression and classiﬁcation, and GP has proven itself as an eﬀective learner on each of these: achieving particularly competitive results on symbolic regression and binary classiﬁcation tasks. Although many studies have c Springer International Publishing AG 2017 J.J. Merelo et al. (eds.), Computational Intelligence, Studies in Computational Intelligence 669, DOI 10.1007/978-3-319-48506-5 7

114

J.M. Fitzgerald et al.

been undertaken, multi-class classiﬁcation (MCC) remains a problem which is considered challenging for traditional tree based GP [11]. While we are concerned with multi-class classiﬁcation generally, an important motivation for the current investigation is the requirement for an algorithm which can be applied to multi-class grouping/categorisation tasks involving both labelled and unlabelled inputs from the medical domain, where the unsupervised algorithm must be able to supply human interpretable justiﬁcation for categorisation decisions. Clustering is a natural choice for this type of task, but standard clustering algorithms generally fail to satisfy the requirement of providing the reasoning behind cluster allocations in a human readable form. In the med

Data Loading...

GEML: A Grammatical Evolution, Machine Learning Approach to Multi-class Classification

Recommend Documents

Machine Learning Approach on Steel Microstructure Classification

Machine Learning Approach Towards Satellite Image Classification

Binary Classification of Proteins by a Machine Learning Approach

Empirical Approach to Machine Learning

Bow Gesture Classification to Identify Three Different Expertise Levels: A Machine Learning Approach

Machine Learning for Evolution Strategies

A Novel Approach to Detect Emergency Using Machine Learning

A Structured Approach to Risk Assessment of Machine Learning Applications

A Machine Learning Approach to Dataset Imputation for Software Vulnerabilities

Combinatorial Machine Learning A Rough Set Approach

Machine Learning Approach for Feature Interpretation and Classification of Genetic Mutations Leading to Tumor and Cancer

An Output Grouping Based Approach to Multiclass Classification Using Support Vector Machines