A deceptive detection model based on topic, sentiment, and sentence structure information

PDF / 1,390,883 Bytes
14 Pages / 595.224 x 790.955 pts Page_size
40 Downloads / 246 Views

A deceptive detection model based on topic, sentiment, and sentence structure information Xiaodong Du1 · Ruiqi Zhu2 · Fuqiang Zhao1 · Fangzhou Zhao1 · Ping Han3 · Zhengyu Zhu1

© Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract Deceptive reviews on Web are a common phenomenon and how to detect them has a very important impact on products, services, and even business policies. In order to filter out deceptive reviews more accurately, a new model called Sentence Joint Topic Sentiment Model (SJTSM) is presented in this paper, which incorporates the sentence structure of reviews and the sentiment label information of words based on Latent Dirichlet Allocation (LDA) model to extract the review features. The proposed model employs Gibbs algorithm to estimate the maximum likelihood parameters and takes the vector of topicsentiment distribution as the review features. Then a voting system of multiple-classifier, which takes the extracted review feature vector as its input is designed to realize the classification of deceptive review detection. The comparative experiments on different public datasets with other existing methods based on LDA model show that the new classifying system based on SJTSM model can achieve more satisfying classification results on deceptive review detection. Keywords Deceptive detection · Topic model · Multiple-classifier · Feature extraction

1 Introduction In recent years, with the rapid development of Internet techniques, people are willing to express and share their opinions on online platforms. Online reviews about a product or service will often determine whether customers would like to buy it or not, and impact the business in the long term either positively or negatively. Therefore, some businesses attempt to fabricate fake or deceptive reviews to attract customers, which will seriously damage the rights of customers and market balance. Meanwhile, the negative effects of deceptive reviews are fatal to enterprises. Accordingly, how to automatically detect deceptive reviews is becoming Ping Han

[email protected] Zhengyu Zhu

zhu [email protected] 1

College of Computer Science, Chongqing University, Chongqing, 400044, China

2

School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC 3000, Australia

3

School of Foreign Languages and Cultures, Chongqing University, Chongqing, 400044, China

a challenge in the research field. Many researchers have proposed different models to filter deceptive reviews. The behavior of deceptive reviews can be detected by distinguishing their characteristics [1, 2]. Horne et al. [3] found that the overall title structure and the use of proper nouns in the title are very significant in differentiating fake from real. Choi et al. [4] proposed a deceptive detection method that combined Term Frequency-Inverse Document Frequency (TF-IDF) technique and symmetrical conditional probability. Similarly, Ahmed et al. [5] proposed a deceptive detection model that used TF-IDF as feature extraction technique

Data Loading...

A deceptive detection model based on topic, sentiment, and sentence structure information

Recommend Documents

Blog Topic Diffusion Prediction Model Based on Link Information Flow

Topic Information Collection Based on the Hidden Markov Model

A User Group Classification Model Based on Sentiment Analysis Under Microblog Hot Topic

A NLP framework based on meaningful latent-topic detection and sentiment analysis via fuzzy lattice reasoning on youtube

A Topic Evolution Model Based on Microblog Network

Financial Topic Detection Algorithm Based on Multi-feature Fusion

Topic Detection and Tracking

Topic Detection in Group Chat Based on Implicit Reply

Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination

Research on Hot Topic Discovery Technology of Micro-blog Based on Biterm Topic Model

A Hot Topic Detection Approach on Chinese Microblogging

Topic representation model based on microblogging behavior analysis