Advanced Techniques in Web Intelligence - I

The present book aims to introduce a selection of research applications in the area of Web Intelligence. We have selected a number of researchers around the world, all of which are experts in their respective research areas. Each chapter focuses on a spec

  • PDF / 572,968 Bytes
  • 29 Pages / 430 x 660 pts Page_size
  • 16 Downloads / 239 Views

DOWNLOAD

REPORT


Web Pattern Extraction and Storage V´ıctor L. Rebolledo, Gast´on L’Huillier, and Juan D. Vel´asquez

Abstract. Web data provides information and knowledge to improve the web site content and structure. Indeed, it eventually contains knowledge which suggests changes that makes a web site more efficient and effective to attract and retain visitors. Making use of a Data Webhouse or a web analytics solution, it is possible to store statistical information concerning the behaviour of users in a website. Likewise, through applying web mining algorithms, interesting patterns can be discovered, interpreted and transformed into useful knowledge. On the other hand, web data include quantities of irrelevant but complex data preprocessing that must be applied in order to model and understand visitor browsing behaviour. Nevertheless, there are many ways to pre-process web data and model the browsing behaviour, hence different patterns can be obtained depending on which model is used. In this sense, a knowledge representation is necessary to store and manipulate web patterns. Generally, different patterns are discovered by using distinct web mining techniques on web data with dissimilar treatments. Consequently, patterns meta-data are relevant to manipulate the discovered knowledge. In this chapter, topics like feature selection, web mining techniques, models characterisation and pattern management will be covered in order to build a repository that stores patterns’ meta-data. Specifically, a Pattern Webhouse that facilitates knowledge management in the web environment. V´ıctor L. Rebolledo · Gast´on L’Huillier · Juan D. Vel´asquez Department of Industrial Engineering, University of Chile, Rep´ublica 701, Santiago, Chile e-mail:  %                 J.D. Vel´asquez and L.C. Jain (Eds.): Advanced Techniques in Web Intelligence – 1, SCI 311, pp. 49–77. springerlink.com © Springer-Verlag Berlin Heidelberg 2010

50

V.L. Rebolledo, G. L’Huillier, and J.D. Vel´asquez

3.1 Introduction According to the Web Intelligence Consortium (WIC)1 , “Web Intelligence (WI) has been recognised as a new direction for scientific research and development to explore the fundamental roles as well as practical impacts of Artificial Intelligence (AI)2 and advanced Information Technology (IT)3 on the next generation of Webempowered products, systems, services, and activities” In other words, WI seeks ways to evolve from a Web of data to a Web of knowledge and wisdom. Just as you can see in Figure 3.1, web data source must be understood, particularly, web server and browser interactions. So new ways of preprocessing web data need to be discovered in order to extract information, which must be stored in repositories or Data Web-houses [29]. Nevertheless, information is not enough to make intelligent decisions, therefore knowledge is required. [54].

Fig. 3.1 Web Intelligence Overview 1

2 3

It is an international, non-profit organization dedicated to advance world-wide scientific research and industrial developm