Web Data Mining Based on Cloud Computing
It is increasing important to get accurate information from the web. In this paper, the authors used the virtualization technology which is the key in cloud computing to build up a web data mining cloud model. This model is shown as Fig. 309.1. It consist
- PDF / 325,807 Bytes
- 9 Pages / 439.37 x 666.142 pts Page_size
- 44 Downloads / 238 Views
Web Data Mining Based on Cloud Computing Liangfei Xue, Dongfeng Yuan, and Mingyan Jiang
Abstract It is increasing important to get accurate information from the web. In this paper, the authors used the virtualization technology which is the key in cloud computing to build up a web data mining cloud model. This model is shown as Fig. 309.1. It consists of Storage Cloud and Calculation Cloud. Finally, this paper described a specific instance of Web Date Mining combined with the application of Cloud Computing. This instance compared the proposed method with the traditional method with Figs. 309.4 and 309.5. In addition, the Table 309.2 shows the new model with appropriate nodes can reduce the consumed time definitely. Keywords Web date mining • Cloud computing • Cloud model • Storage cloud • Calculation cloud
309.1
Introduction
The wide adoption of the Internet has fundamentally altered the ways in which we communicate, gather information, conduct businesses and make purchases. How to get useful information exactly from such a wide variety of data determine the development of the society, this is one of the most important problems nowadays. This makes Web Data Mining interesting and challenging. Cloud Computing is the new concept during the emergence of the parallel computing development. Cloud computing now for many customers can do mass data mining with cheap cost, which is of important scientific research value and commercial value. The potential value of the cloud computing has got attention from Google, IBM and other foreign firms and domestic companies such as Inspur and Baidu [1, 2].
L. Xue (*) • D. Yuan • M. Jiang School of Information Science and Engineering, Shandong University, Jinan 250100, China e-mail: [email protected]; [email protected]; [email protected] S. Zhong (ed.), Proceedings of the 2012 International Conference on Cybernetics 2421 and Informatics, Lecture Notes in Electrical Engineering 163, DOI 10.1007/978-1-4614-3872-4_309, # Springer Science+Business Media New York 2014
2422
L. Xue et al.
request Client
Cloud server
Pretreate
command
Data sets
Re tu to rn th the e cli res en ul t t
Data sets from web mass information
Original data Computing result
Storage cloud stores the data sets and the output produced by the calculation cloud
Calculation cloud receive the command from the cloud server
Fig. 309.1 Web data mining cloud model
Recently, the distributed data mining focused on the research of grid computing, and had obtained some achievements. Literature [3] proposed an OGSI.net framework of distributed data mining model, and gave the software deployment scheme of this model, but it did not really applied to experimental projects. Literature [4] analysis a grid environment data mining system, which is made up by the personal computer, the data mining process and the work each process should complete. Foreign studies have shown that the data mining based on cloud computing has the characteristics of low power consumption. This paper is to combine the Web data minin
Data Loading...