Sign Constrained Rectifier Networks with Applications to Pattern Decompositions

In this paper we introduce sign constrained rectifier networks (SCRN), demonstrate their universal classification power and illustrate their applications to pattern decompositions. We prove that the proposed two-hidden-layer SCRN, with sign constraints on

PDF / 275,834 Bytes
14 Pages / 439.37 x 666.142 pts Page_size
97 Downloads / 206 Views

DOWNLOAD

REPORT

School of Computer Science and Software Engineering, The University of Western Australia, Crawley, WA 6009, Australia {senjian.an,mohammed.bennamoun,ferdous.sohel}@uwa.edu.au, [email protected] 2 School of Electrical, Electronic and Computer Engineering, The University of Western Australia, Crawley, WA 6009, Australia [email protected] Abstract. In this paper we introduce sign constrained rectiﬁer networks (SCRN), demonstrate their universal classiﬁcation power and illustrate their applications to pattern decompositions. We prove that the proposed two-hidden-layer SCRN, with sign constraints on the weights of the output layer and on those of the top hidden layer, are capable of separating any two disjoint pattern sets. Furthermore, a two-hidden-layer SCRN of a pair of disjoint pattern sets can be used to decompose one of the pattern sets into several subsets so that each subset is convexly separable from the entire other pattern set; and a single-hidden-layer SCRN of a pair of convexly separable pattern sets can be used to decompose one of the pattern sets into several subsets so that each subset is linearly separable from the entire other pattern set. SCRN can thus be used to learn the pattern structures from the decomposed subsets of patterns and to analyse the discriminant factors of diﬀerent patterns from the linear classiﬁers of the linearly separable subsets in the decompositions. With such pattern decompositions exhibiting convex separability or linear separability, users can also analyse the complexity of the classiﬁcation problem, remove the outliers and the non-crucial points to improve the training of the traditional unconstrained rectiﬁer networks in terms of both performance and eﬃciency. Keywords: Rectiﬁer neural network

1

· Pattern decomposition

Introduction

Deep rectiﬁer networks have achieved great success in object recognition [4,8,10,18], face veriﬁcation [14,15], speech recognition ([3,6,12] and handwritten digit recognition [2]. However, the lack of understanding of the roles of the hidden layers makes the deep learning network diﬃcult to interpret for tasks of discriminant factor analysis and pattern structure analysis. Towards a clear understanding of the success of the deep rectiﬁer networks, a recent work [1] provides a constructive proof for the universal classiﬁcation power of two-hiddenlayer rectiﬁer networks. For binary classiﬁcation, the proof uses the ﬁrst hidden c Springer International Publishing Switzerland 2015 A. Appice et al. (Eds.): ECML PKDD 2015, Part I, LNAI 9284, pp. 546–559, 2015. DOI: 10.1007/978-3-319-23528-8 34

Sign Constrained Rectiﬁer Networks with Applications

547

layer to make the pattern sets convexly separable. The second hidden layer is then used to achieve linear separability, and ﬁnally a linear classiﬁer is used to separate the patterns. Although this strategy can be used in constructive proofs, it cannot be used to analyse the learnt rectiﬁer network since it might not be veriﬁed in the empirical learning from data. Fortunately,

Data Loading...

Sign Constrained Rectifier Networks with Applications to Pattern Decompositions

Recommend Documents

Extrapolation and Optimal Decompositions with Applications to Analys

Traffic Sign Detection with Convolutional Neural Networks

A new contraction technique with applications to congruency-constrained cuts

High-dimensional sign-constrained feature selection and grouping

Sector Decompositions

Efficient Beamforming Design for Cellular Networks with Energy-Constrained Devices

Constrained Evolutionary Piecemeal Training to Design Convolutional Neural Networks

Box-Constrained Monotone Approximations to Lipschitz Regularizations, with Applications to Robust Testing

Soft-Constrained Nonparametric Density Estimation with Artificial Neural Networks

Learning and Generalisation With Applications to Neural Networks

Pattern control of external electromagnetic stimulation to neuronal networks

Competition-Based Neural Networks with Robotic Applications