Co-training with Credal Models

So-called credal classifiers offer an interesting approach when the reliability or robustness of predictions have to be guaranteed. Through the use of convex probability sets, they can select multiple classes as prediction when information is insufficient

PDF / 697,156 Bytes
13 Pages / 439.37 x 666.142 pts Page_size
87 Downloads / 240 Views

DOWNLOAD

REPORT

Abstract. So-called credal classiﬁers oﬀer an interesting approach when the reliability or robustness of predictions have to be guaranteed. Through the use of convex probability sets, they can select multiple classes as prediction when information is insuﬃcient and predict a unique class only when the available information is rich enough. The goal of this paper is to explore whether this particular feature can be used advantageously in the setting of co-training, in which a classiﬁer strengthen another one by feeding it with new labeled data. We propose several co-training strategies to exploit the potential indeterminacy of credal classiﬁers and test them on several UCI datasets. We then compare the best strategy to the standard co-training process to check its eﬃciency. Keywords: Co-training · Imprecise probabilities learning · Ensemble models

1

·

Semi-supervised

Introduction

There are many application ﬁelds (gesture, human activity, ﬁnance, ...) where extracting numerous unlabeled data is easy, but where labeling them reliably require costly human eﬀorts or an expertise that may be rare and expensive. In this case, getting a large labeled dataset is not possible, making the task of training an eﬃcient classiﬁer from labeled data alone diﬃcult. The general goal of semi-supervised learning techniques [1,7,28] is to solve this issue by exploiting the information contained in unlabeled data. It includes diﬀerent approaches such as the adaptation of training criteria [13,14,16], active learning methods [18] and co-training-like approaches [6,19,22]. In this paper, we focus on the co-training framework. This approach aims at training two classiﬁers in parallel, and each model then attempts to strengthen the other by labeling a selection of unlabeled data. We will call trainer the classiﬁer providing new labeled instances and learner the classiﬁer using it as new training data. In the standard co-training approach [6,22], the trainer provides to the learner the data about which it gets the most conﬁdent labels. However, those labels are predicted with high conﬁdence by the trainer but it is not guaranteed that the new labeled instances will be informative for the learner, in the sense that it may not help him to improve its accuracy. c Springer International Publishing AG 2016 F. Schwenker et al. (Eds.): ANNPR 2016, LNAI 9896, pp. 92–104, 2016. DOI: 10.1007/978-3-319-46182-3 8

Co-training with Credal Models

93

To solve this issue, we propose a new co-training approach using credal classiﬁers. Such classiﬁers, through the use of convex sets of probabilities, can predict a set of labels when training data are insuﬃciently conclusive. It means they will produce a single label as prediction only when the information is enough (i.e., when the probability set is small enough). The basic idea of our approach is to select as potential new training data for the learner those instances for which the (credal) trainer has predicted a single label and the learner multiple ones.

2

Co-training Framework

We assume that sam

Data Loading...

Co-training with Credal Models

Recommend Documents

Composition Operator for Credal Sets Reconsidered

The Extension of Imprecise Probabilities Based on Generalized Credal Sets

Epidemiological Models with Seasonality

Graphical Models with R

Driving Reinforcement Learning with Models

Object Models with Vector Steering

Replacement Models with Minimal Repair

Crack Models with Embedded Discontinuities

Authorship Verification with Personalized Language Models

Optimal Models and Methods with Fuzzy Quantities

Cointegration models with non Gaussian GARCH innovations

Correlation Analysis with Continuous Time Models