Maximum Entropy Linear Manifold for Learning Discriminative Low-Dimensional Representation

Representation learning is currently a very hot topic in modern machine learning, mostly due to the great success of the deep learning methods. In particular low-dimensional representation which discriminates classes can not only enhance the classificatio

PDF / 1,554,109 Bytes
16 Pages / 439.37 x 666.142 pts Page_size
5 Downloads / 230 Views

DOWNLOAD

REPORT

Faculty of Mathematics and Computer Science, Jagiellonian University, Krakow, Poland {wojciech.czarnecki,jacek.tabor}@uj.edu.pl 2 Google, New York, USA [email protected]

Abstract. Representation learning is currently a very hot topic in modern machine learning, mostly due to the great success of the deep learning methods. In particular low-dimensional representation which discriminates classes can not only enhance the classiﬁcation procedure, but also make it faster, while contrary to the high-dimensional embeddings can be eﬃciently used for visual based exploratory data analysis. In this paper we propose Maximum Entropy Linear Manifold (MELM), a multidimensional generalization of Multithreshold Entropy Linear Classiﬁer model which is able to ﬁnd a low-dimensional linear data projection maximizing discriminativeness of projected classes. As a result we obtain a linear embedding which can be used for classiﬁcation, class aware dimensionality reduction and data visualization. MELM provides highly discriminative 2D projections of the data which can be used as a method for constructing robust classiﬁers. We provide both empirical evaluation as well as some interesting theoretical properties of our objective function such us scale and aﬃne transformation invariance, connections with PCA and bounding of the expected balanced accuracy error. Keywords: Dense representation learning · Data visualization Entropy · Supervised dimensionality reduction

1

·

Introduction

Correct representation of the data, consistent with the problem and used classiﬁcation method, is crucial for the eﬃciency of the machine learning models. In practice it is a very hard task to ﬁnd suitable embedding of many real-life objects in Rd space used by most of the algorithms. In particular for natural language processing [12], cheminformatics or even image recognition tasks it is still an open problem. As a result there is a growing interest in methods of representation learning [8], suited for ﬁnding better embedding of our data, which may be further used for classiﬁcation, clustering or other analysis purposes. Recent c Springer International Publishing Switzerland 2015 A. Appice et al. (Eds.): ECML PKDD 2015, Part I, LNAI 9284, pp. 52–67, 2015. DOI: 10.1007/978-3-319-23528-8 4

MELM for Learning Discriminative Low-Dimensional Representation

53

years brought many success stories, such as dictionary learning [13] or deep learning [9]. Many of them look for a sparse [7], highly dimensional embedding which simplify linear separation at a cost of making visual analysis nontrivial. A dual approach is to look for low-dimensional linear embedding, which has advantage of easy visualiation, interpretation and manipulation at a cost of much weaker (in terms of models complexity) space of transformations. In this work we focus on the scenario where we are given labeled dataset in Rd and we are looking for such low-dimensional linear embedding which allows to easily distinguish each of the classes. In other words we are looking for a highly discriminative

Data Loading...

Maximum Entropy Linear Manifold for Learning Discriminative Low-Dimensional Representation

Recommend Documents

DMRAE: discriminative manifold regularized auto-encoder for sparse and robust feature learning

Bayesian Maximum Entropy

Simplifying a shape manifold as linear manifold for shape analysis

Manifold Learning

A Theoretical Basis for Maximum Entropy Production

Maximum Entropy Production and Maximum Shannon Entropy as Germane Principles for the Evolution of Enzyme Kinetics

Maximum Entropy Principle: Image Reconstruction

Maximum Entropy Production by Technology

Canonical Correlation Discriminative Learning for Domain Adaptation

Discriminative Learning in Biometrics

CapsNet and Manifold Learning

Entropy Measures, Maximum Entropy Principle and Emerging Applications