Decision regions and decision boundaries of generalized K mean algorithm based on various norm criteria

PDF / 2,758,496 Bytes
16 Pages / 439.37 x 666.142 pts Page_size
94 Downloads / 212 Views

Decision regions and decision boundaries of generalized K mean algorithm based on various norm criteria Xinpeng Wang 1 & Bingo Wing-Kuen Ling 1 Received: 26 April 2020 / Revised: 10 July 2020 / Accepted: 21 July 2020 # Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract

This paper derives the decision regions and the decision boundaries of the generalized K mean algorithms based on the L1 norm criterion, the L2 norm criterion and the L∞ norm criterion. The decision boundaries of these three generalized K mean algorithms are all linear hyperplanes. However, the total numbers of the decision boundaries of the generalized K mean algorithms based on both the L1 norm criterion and the L∞ norm criterion are more than that based on the L2 norm criterion. On the other hand, the decision regions of the generalized K mean algorithm based on the L2 norm criterion are convex while that based on both the L1 norm criterion and the L∞ norm criterion are in general nonconvex. The computer numerical simulations on a toy example demonstrate the above phenomena. Besides, two examples are illustrated. The first example on the patent image retrieval system shows that the recognition accuracies of using the generalized K mean algorithms based on the L2 norm criterion, the L1 norm criterion and the L∞ norm criterion are 58.25%, 61% and 58.75%, respectively. The second example on the electromyogram based Parkinson’s disease detection system shows that the recognition accuracies of using the generalized K mean algorithms based on the L2 norm criterion, the L1 norm criterion and the L∞ norm criterion are 60%, 60% and 67%, respectively, if the signals are classified directly in the time domain. On the other hand, the recognition accuracies of the generalized K mean algorithms based on the L2 norm criterion, the L1 norm criterion and the L∞ norm criterion are 60%, 87% and 60%, respectively, if the signals are classified directly in the discrete cosine transform domain. The improvements are due to the nonconvexity of the decision regions of the generalized K mean algorithm based on the L1 norm criterion and the L∞ norm criterion. Keywords Generalized K mean algorithms . Decision regions . Decision boundaries . Nonconvex set . Mixed integer quadratic programming . Mixed integer linear programming

* Xinpeng Wang [email protected]

1

Faculty of Information Engineering, Guangdong University of Technology, Guangzhou 510006, China

Multimedia Tools and Applications

1 Introduction The K mean algorithm [7, 14, 15, 19] is the common approach to perform the unsupervised clustering [2, 6]. First, a set of representative vectors is initialized where each representative vector corresponds to a particular cluster of objects. Then, assign each feature vector to a particular cluster in such a way that the L2 norm error between the feature vector and the corresponding representative vector is the smallest. Once all the feature vectors are assigned to their corresponding clusters, the mean of the feature vectors in the same

Data Loading...

Decision regions and decision boundaries of generalized K mean algorithm based on various norm criteria

Recommend Documents

New Reduction Algorithm Based on Decision Power of Decision Table

MULTI-CRITERIA DECISION MODELS

Multiple criteria decision making

Hesitant Fuzzy Multiple Criteria Decision Analysis Based on TOPSIS

Multi-criteria decision making (MCDM)

Generalized Maclaurin symmetric mean aggregation operators based on Archimedean t-norm of the intuitionistic fuzzy soft

Research on Leisure Sports Activities Based on Decision Tree Algorithm

The Multi-Criteria Approach Decision

A labeling algorithm based on a forest of decision trees

Multiple Criteria Decision Analysis An Integrated Approach

Decision Support Systems with Multiple Criteria

A special issue on multi-criteria decision aiding