A Context-Based Disambiguation Model for Sentiment Concepts Using a Bag-of-Concepts Approach

  • PDF / 802,725 Bytes
  • 14 Pages / 595.276 x 790.866 pts Page_size
  • 74 Downloads / 187 Views

DOWNLOAD

REPORT


A Context-Based Disambiguation Model for Sentiment Concepts Using a Bag-of-Concepts Approach Zeinab Rajabi 1

&

Mohammad Reza Valavi 1 & Maryam Hourali 1

Received: 4 October 2019 / Accepted: 14 May 2020 # Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract With the widespread dissemination of user-generated content on different web sites, social networks, and online consumer systems such as Amazon, the quantity of opinionated information available on the Internet has been increased. Sentiment analysis of user-generated content is one of the main cognitive computing branches; hence, it has attracted the attention of many scholars in recent years. One of the main tasks of the sentiment analysis is to detect polarity within a text. The existing polarity detection methods mainly focus on keywords and their naïve frequency counts; however, they less regard the meanings and implicit dimensions of the natural concepts. Although background knowledge plays a critical role in determining the polarity of concepts, it has been disregarded in polarity detection methods. This study presents a context-based model to solve ambiguous polarity concepts using commonsense knowledge. First, a model is presented to generate a source of ambiguous sentiment concepts based on SenticNet by computing the probability distribution. Then, the model uses a bag-of-concepts approach to remove ambiguities and semantic augmentation with the ConceptNet handling to overcome lost knowledge. ConceptNet is a large-scale semantic network with a large number of commonsense concepts. In this paper, the point mutual information (PMI) measure is used to select the contextual concepts having strong relationships with ambiguous concepts. The polarity of the ambiguous concepts is precisely detected using positive/negative contextual concepts and the relationship of the concepts in the semantic knowledge base. The text representation scheme is semantically enriched using Numberbatch, which is a word embedding model based on the concepts from the ConceptNet semantic network. In this regard, the cosine similarity metric is used to measure similarity and select a concept from the ConceptNet network for semantic augmentation. Pre-trained concepts vectors facilitate the more effective computation of semantic similarity among the concerned concepts. The proposed model is evaluated by applying a corpus of product reviews, called Semeval. The experimental results revealed an accuracy rate of 82.07%, representing the effectiveness of the proposed model. Keywords Sentiment analysis . Sentiment concept . Disambiguation . Commonsense knowledge . Bag-of-concepts . ConceptNet . Numberbatch . Word embedding

Introduction * Mohammad Reza Valavi [email protected] Zeinab Rajabi [email protected] Maryam Hourali [email protected] 1

Department of Electronic and Computer, Malek-Ashtar University of Technology, Tehran, Iran

In recent years, social networks, blogs, forums, and customer review websites have provided individuals with an opportunity to express