A comprehensive survey of data mining
- PDF / 383,739 Bytes
- 15 Pages / 595.276 x 790.866 pts Page_size
- 95 Downloads / 265 Views
ORIGINAL RESEARCH
A comprehensive survey of data mining Manoj Kumar Gupta1
•
Pravin Chandra1
Received: 29 June 2019 / Accepted: 20 January 2020 Bharati Vidyapeeth’s Institute of Computer Applications and Management 2020
Abstract Data mining plays an important role in various human activities because it extracts the unknown useful patterns (or knowledge). Due to its capabilities, data mining become an essential task in large number of application domains such as banking, retail, medical, insurance, bioinformatics, etc. To take a holistic view of the research trends in the area of data mining, a comprehensive survey is presented in this paper. This paper presents a systematic and comprehensive survey of various data mining tasks and techniques. Further, various real-life applications of data mining are presented in this paper. The challenges and issues in area of data mining research are also presented in this paper. Keywords Data mining techniques Data mining tasks Data mining applications Clustering Classification Survey
1 Introduction Data mining, an essential and important step in knowledge discovery in databases, is used to discover useful unknown patterns from large repository of data [1–4]. Data mining consists of various functionalities, techniques and algorithms that are being used to discover and extract interesting patterns from the large repository of data [1, 2, 4]. Due to the importance in decision making, in the last two
& Manoj Kumar Gupta [email protected]; [email protected] 1
University School of Information, Communication and Technology, Guru Gobind Singh Indraprastha University, Sector-16C, Dwarka, Delhi 110078, India
decades, data mining got a wide focus and has become an essential tool in performing variety of operations of the organizations [5]. Data mining is a step in the knowledge discovery in databases process consisting of applying data analysis and discovery algorithms that, under acceptable computational efficiency limitations, produce a particular enumeration of patterns over the data….. [1]. Han et al. [6] stated data mining as ‘‘data mining is a process of discovering or extracting interesting patterns, associations, changes, anomalies and significant structures from large amounts of data which is stored in multiple data sources such as file systems, databases, data warehouses or other information repositories.’’ Many techniques from other domains [6–8] such as statistics, database/data warehouse systems, machine learning, algorithms, pattern recognition, visualization, information retrieval, high-performance computing, etc. incorporated in data mining. First three techniques are the primary contributors of data mining [7].
2 Trends in data mining research Through a survey of literature, it is identified that the data mining research can be broadly categorized into following types [9–12]. 2.1 Data mining functions Data mining functions or tasks can be used to specify the types of patterns or knowledge to be discovered during the data mining process. Some
Data Loading...