When Siri Knows How You Feel: Study of Machine Learning in Automatic Sentiment Recognition from Human Speech

Opinions and sentiments are essential to human activities and have a wide variety of applications. As many decision makers turn to social media due to large volume of opinion data available, efficient and accurate sentiment analysis is necessary to extrac

  • PDF / 4,290,281 Bytes
  • 12 Pages / 439.37 x 666.142 pts Page_size
  • 30 Downloads / 140 Views

DOWNLOAD

REPORT


Abstract. Opinions and sentiments are essential to human activities and have a wide variety of applications. As many decision makers turn to social media due to large volume of opinion data available, efficient and accurate sentiment analysis is necessary to extract those data. Hence, text sentiment analysis has recently become a popular field and has attracted many researchers. However, extracting sentiments from audio speech remains a challenge. This project explored the possibility of applying supervised Machine Learning in recognizing sentiments in English utterances on a sentence level. In addition, the project also aimed to examine the effect of combining acoustic and linguistic features on classification accuracy. Six audio tracks were randomly selected to be training data from 40 YouTube videos (monologue) with strong presence of sentiments. Speakers expressed sentiments towards products, films, or political events. These sentiments were manually labelled as negative and positive based on independent judgment of three experimenters. A wide range of acoustic and linguistic features were then analyzed and extracted using sound editing and text mining tools, respectively. A novel approach was proposed, which used a simplified sentiment score to integrate linguistic features and estimate sentiment valence. This approach improved negation analysis and hence increased overall accuracy. Results showed that when both linguistic and acoustic features were used, accuracy of sentiment recognition improved significantly, and that excellent prediction was achieved when the four classifiers were trained, respectively, namely, kNN, SVM, Neural Network, and Naïve Bayes. Possible sources of error and inherent challenges of audio sentiment analysis were discussed to provide potential directions for future research. Keywords: Sentiment analysis  Natural language processing Machine learning  Affective computing  Data analytics  Speech processing Computational linguistic

L. Zhang—Yale-NUS College, Singapore © Springer Nature Switzerland AG 2019 K. Arai et al. (Eds.): FICC 2018, AISC 887, pp. 591–602, 2019. https://doi.org/10.1007/978-3-030-03405-4_41

592

L. Zhang and E. Y. K. Ng

1 Introduction Sentiment analysis is the field of study that analyses opinions, sentiments, appraisals, attitudes, and emotions toward entities and their attributes [1]. Opinions and sentiments are essential to human activities and have a wide variety of applications. As many decision makers turn to social media due to large volume of opinion data available, efficient and accurate sentiment analysis is necessary to extract those data. Business organizations in different sectors use social media to find out consumer opinions to improve their products and services. Political party leaders need to know the current public sentiment to come up with campaign strategies. Government agencies also monitor citizens’ opinions on social media. Police agencies, for example, detect criminal intents and cyber threats by analyzing sentiment valence in social media posts