Sentiment-Based Data Mining Approach for Classification and Analysis

Traditionally, individuals gather feedback from their friends or relatives before purchasing an item but today the trend is to identify the opinions of a variety of individuals around the globe using micro blogging data and twitter is such one famous micr

  • PDF / 590,797 Bytes
  • 15 Pages / 439.37 x 666.142 pts Page_size
  • 51 Downloads / 203 Views

DOWNLOAD

REPORT


Abstract Traditionally, individuals gather feedback from their friends or relatives before purchasing an item but today the trend is to identify the opinions of a variety of individuals around the globe using micro blogging data and twitter is such one famous micro blogger where user expresses their view in form of tweets. In this project, we have provided a data mining approach based on how nowadays user provides their view in tweets. This approach basically is around sentiment expressed in tweets to classify them into set of feature and also to analyze tweet to compare service or product-based organization. Keywords Data mining

 Classification  Sentiment analysis  Twitter analysis

1 Introduction 1.1

Data Mining

Data Mining or knowledge extraction is a method well known for pattern extraction and to make data more understandable. Data mining is a root or basic to many sub category in it like business mining, medical mining, music mining, text mining, etc., and every year something new is add as an improvement over previous methods or as an innovation. With these new techniques Data mining has become important element in many fields and people are using it more with all their historic data to get that extra knowledge.

V. Vashi (&)  L.D. Dhinesh Babu SITE, VIT University, Vellore, India e-mail: [email protected] © Springer Science+Business Media Singapore 2016 S.C. Satapathy et al. (eds.), Proceedings of International Conference on ICT for Sustainable Development, Advances in Intelligent Systems and Computing 408, DOI 10.1007/978-981-10-0129-1_61

581

582

1.2

V. Vashi and L.D. Dhinesh Babu

Text Mining

Text mining is one of data mining variation where as input we take unstructured text and try to find some information from it which is more useful then text. There are many research fields in text mining as well like Academic applications, Online media applications, Opinion mining, Sentiment analysis, etc., our focus here will be on Sentiment analysis (Figs. 1 and 2).

1.3

Sentiment Analysis

Sentiment Analysis is a text mining method and it is a study of someone’s sentiments about something like good–bad or positive–negative, etc. People always put sentiment analysis and opinion mining as same but there is difference between and Data Collection Twitter

Data Preprocessing

Data Classification Preprocessed Tweets

Classification Algorithm

Training Data

Classification Result

Scoring Scoring Parameter

Scoring Algorithm

Result

Visualization Bar Chart

Pie Chart

Fig. 1 Sentiment analysis process for tweets as data

Data Driven Document

Sentiment-Based Data Mining Approach …

583

Data Collection Twitter

Data Preprocessing

Data Classification Preprocessed Tweets

Sentimental Classification

Sentimental Words Feature Words and

Classification Result

Scoring Parameter Features Service=3 Time=2 Price=1 Sentiments Positive=1 Negative=-1

Bar Chart

Weights

Scoring

Result

Scoring Algorithm

Visualization Pie Chart

Data Driven Document

Fig. 2 Proposed sentiment-based classification and analysis flow