Multi-label optimal margin distribution machine

PDF / 1,153,918 Bytes
20 Pages / 439.37 x 666.142 pts Page_size
58 Downloads / 232 Views

Multi-label optimal margin distribution machine Zhi-Hao Tan1

· Peng Tan1 · Yuan Jiang1 · Zhi-Hua Zhou1

Received: 6 May 2019 / Revised: 30 July 2019 / Accepted: 6 September 2019 © The Author(s), under exclusive licence to Springer Science+Business Media LLC, part of Springer Nature 2019

Abstract Multi-label support vector machine (Rank-SVM) is a classic and effective algorithm for multi-label classification. The pivotal idea is to maximize the minimum margin of label pairs, which is extended from SVM. However, recent studies disclosed that maximizing the minimum margin does not necessarily lead to better generalization performance, and instead, it is more crucial to optimize the margin distribution. Inspired by this idea, in this paper, we first introduce margin distribution to multi-label learning and propose multi-label Optimal margin Distribution Machine (mlODM), which optimizes the margin mean and variance of all label pairs efficiently. Extensive experiments in multiple multi-label evaluation metrics illustrate that mlODM outperforms SVM-style multi-label methods. Moreover, empirical study presents the best margin distribution and verifies the fast convergence of our method. Keywords Optimal margin distribution machine · Multi-label learning · Support vector machine · Margin theory

1 Introduction In contrast to traditional supervised learning, multi-label classification purports to build classification models for objects assigned with multiple labels simultaneously, which is a common learning paradigm in real-world tasks. In the past decades, it has attracted much attention (Zhang and Zhou 2014a). To name a few, in image classification, a scene image is usually annotated with several tags (Boutell et al. 2004); in text categorization, a docu-

Editors: Kee-Eung Kim and Jun Zhu.

B

Yuan Jiang [email protected] Zhi-Hao Tan [email protected] Peng Tan [email protected] Zhi-Hua Zhou [email protected]

1

National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China

123

Machine Learning

ment may present multiple topics (McCallum 1999; Schapire and Singer 2000); in music information retrieval, a piece of music can convey various messages (Turnbull et al. 2008). To solve the multi-label tasks, a variety of methods have been proposed (Zhang and Zhou 2014a, Zhang et al. 2018), among which Rank-SVM (Elisseeff and Weston 2002) is one of the most eminent methods. It extended the idea of maximizing minimum margin in support vector machine (SVM) (Cortes and Vapnik 1995) to multi-label classification and achieved impressive performance. Specifically, the central idea of SVM is to search a large margin separator, i.e., maximizing the smallest distance from the instances to the classification boundary in a RKHS (reproducing kernel Hilbert space). Rank-SVM modified the definition of margin for label pairs and adapted maximizing margin strategy to deal with multi-label data, where a set of classifiers are optimized simultaneously. Benefiting from kernel tricks and co

Data Loading...

Multi-label optimal margin distribution machine

Recommend Documents

Multi-parameter safe screening rule for hinge-optimal margin distribution machine

Multilabel Toxic Comment Classification Using Supervised Machine Learning Algorithms

A density-based maximum margin machine classifier

Twin support vector machine based on adjustable large margin distribution for pattern classification

Multilabel Classification

Optimal Energy Distribution in Smart Grid

Vegetable Distribution System Using Machine Learning Approach

PRODUCT MARGIN

Multilabel graph-based classification for missing labels

Surgical Margin Assessment

CONTRIBUTION MARGIN ANALYSIS

Margin Classifier