A virtual mart for knowledge discovery in databases

  • PDF / 1,298,239 Bytes
  • 17 Pages / 595.224 x 790.955 pts Page_size
  • 102 Downloads / 201 Views

DOWNLOAD

REPORT


A virtual mart for knowledge discovery in databases Claudia Diamantini · Domenico Potena · Emanuele Storti

© Springer Science+Business Media New York 2013

Abstract The Web has profoundly reshaped our vision of information management and processing, enlightening the power of a collaborative model of information production and consumption. This new vision influences the Knowledge Discovery in Databases domain as well. In this paper we propose a service-oriented, semantic-supported approach to the development of a platform for sharing and reuse of resources (data processing and mining techniques), enabling the management of different implementations of the same technique and characterized by a communitycentered attitude, with functionalities for both resource production and consumption, facilitating end-users with different skills as well as resource providers with different technical and domain specific capabilities. We first describe the semantic framework underlying the approach, then we demonstrate how this framework is exploited to give different functionalities to users through the presentation of the platform functionalities.

1 Introduction The present paper proposes Knowledge Discovery in Databases Virtual Mart (KDDVM), a framework and a platform for distributed Knowledge Discovery in Databases (KDD) experiments. As the name suggests, the aim is to provide

C. Diamantini () · D. Potena · E. Storti DII, Universit´a Politecnica delle Marche, Ancona, Italy e-mail: [email protected] D. Potena e-mail: [email protected] E. Storti e-mail: [email protected]

a virtual environment where distributed and heterogeneous KDD resources (i.e. data processing and mining techniques) can be easily introduced, acquired and exploited. In order to achieve the goal, in a five-years long work on the subject, we have individuated the major issues and requirements of a distributed experimental environment, from which the technological aspects and basic components of a supporting architecture have been derived. We have also developed suitable technologies for the representation of knowledge about resources and developed suitable services for its management and exploitation. All these achievements will be systematically described in the paper. The work is in the mainstream of the recent web-based revolution. The Web has profoundly reshaped our vision of information management and processing, software architectures and design, delivery models. All of this can be at least in part explained by the more than linear growth of value in a network of elements, synthesized by the Metcalfe’s Law. The social aspect emerging from the success of so-called Web 2.0 show the strategic value of a collaborative model of information production and consumption (Pedrinaci and Domingue 2010). As a matter of fact, more and more organizations share their data in a way that allows people to process and exploit them. The Semantic Web, and its recent evolution in the Web of Data, through the use of machinereadable languages for the representation of kno