Machine Learning and Data Mining

Machine learning and data mining are research areas of computer science whose quick development is due to the advances in data analysis research, growth in thedatabase industry and the resultingmarket needs for methods that are capable of extracting valua

  • PDF / 565,817 Bytes
  • 17 Pages / 439.36 x 666.15 pts Page_size
  • 49 Downloads / 252 Views

DOWNLOAD

REPORT


Machine Learning and Data Mining

Machine learning and data mining are research areas of computer science whose quick development is due to the advances in data analysis research, growth in the database industry and the resulting market needs for methods that are capable of extracting valuable knowledge from large data stores. This chapter gives an informal introduction to machine learning and data mining, and describes selected machine learning and data mining methods illustrated by examples. After a brief general introduction, Sect. 1.2 briefly sketches the historical background of the research area, followed by an outline of the knowledge discovery process and the emerging standards in Sect. 1.3. Section 1.4 establishes the basic terminology and provides a categorization of different learning tasks. Predictive and descriptive data mining techniques are illustrated by means of simplified examples of data mining tasks in Sects. 1.5 and 1.6, respectively. In Sect. 1.7, we highlight the importance of relational data mining techniques. The chapter concludes with some speculations about future developments in data mining.

1.1 Introduction Machine learning (Mitchell, 1997) is a mature and well-recognized research area of computer science, mainly concerned with the discovery of models, patterns, and other regularities in data. Machine learning approaches can be roughly categorized into two different groups: Symbolic approaches. Inductive learning of symbolic descriptions, such as rules (Clark & Niblett, 1989; Michalski, Mozetiˇc, Hong, & Lavraˇc, 1986) decision



This chapter is partly based on Lavraˇc & Grobelnik (2003).

J. F¨urnkranz et al., Foundations of Rule Learning, Cognitive Technologies, DOI 10.1007/978-3-540-75197-7 1, © Springer-Verlag Berlin Heidelberg 2012

1

2

1 Machine Learning and Data Mining

trees (Quinlan, 1986) or logical representations (De Raedt, 2008; Lavraˇc & Dˇzeroski, 1994a; Muggleton, 1992). Textbooks that focus on this line of research include (Langley, 1996; Mitchell, 1997; Witten & Frank, 2005). Statistical approaches. Statistical or pattern-recognition methods, including k-nearest neighbor or instance-based learning (Aha, Kibler, & Albert, 1991; Dasarathy, 1991), Bayesian classifiers (Pearl, 1988), neural network learning (Rumelhart & McClelland, 1986), and support vector machines (Sch¨olkopf & Smola, 2001; Vapnik, 1995). Textbooks in this area include (Bishop, 1995; Duda, Hart, & Stork, 2000; Hastie, Tibshirani, & Friedman, 2001; Ripley, 1996). Although the approaches taken in these fields are often quite different, their effectiveness in learning is often comparable (Michie, Spiegelhalter, & Taylor, 1994). Also, there are many approaches that cross the boundaries between the two approaches. For example, there are decision tree (Breiman, Friedman, Olshen, & Stone, 1984) and rule learning (Friedman & Fisher, 1999) algorithms that are firmly based in statistics. Similarly, ensemble techniques such as boosting (Freund & Schapire, 1997), bagging (Breiman, 1996) or random forests (Brei