Optimal feature selection for speech emotion recognition using enhanced cat swarm optimization algorithm

PDF / 1,018,374 Bytes
9 Pages / 595.276 x 790.866 pts Page_size
45 Downloads / 285 Views

Optimal feature selection for speech emotion recognition using enhanced cat swarm optimization algorithm M. Gomathy1 Received: 8 November 2019 / Accepted: 7 November 2020 © Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract Human interactions involve emotional cues that can be used to interpret the emotion expressed by the speaker. As the vocal emotions vary from one speaker to another, there is a chance of misinterpretation. To determine the emotion expressed by the speaker, a speech emotion recognizer can be utilized. It is known that speech expresses the emotional states of humans along with the syntax and semantic content of linguistic sentences. Therefore, human emotion recognition using speech signaling is possible. Speech emotion recognition is a crucial and challenging task in which the feature extraction plays a prominent role in its performance. Determining emotional states in speech signals is a very challenging area for many reasons. The first issue of all speech emotion systems is the selection of the best features, which is powerful enough to distinguish various emotions. The presence of different language, pronunciation, sentences, style, and speakers adds additional difficulty since these characteristics include pitch and energy that directly alters most of the features extracted. Redundant features and high computational cost make emotion recognition an undesirable task. Instead of focusing on the words, the vocal changes and communicative pressure on the words should be taken as the primary consideration. The Enhanced Cat Swarm Optimization (ECSO) algorithm for feature extraction has been proposed to address these issues and it is not used in any existing speech emotion recognition approaches. The proposed approach achieves excellent performance in terms of accuracy, recognition rate, sensitivity, and specificity. Keywords Speech emotion recognition · Cat swarm optimization · Opposition based learning · Support vector neural network · Feature extraction

1 Introduction The speech signal consists of linguistic information and also paralinguistic one such as emotion. The modern automatic speech recognition systems have achieved high performance in neutral style speech recognition (Gharavian et al. 2012). The acoustic and prosodic features of speech are affected by emotions and speaking styles as well as speaker characteristics and linguistic features. Although the emotional state does not alter the linguistic content, it is an important factor in human communication and improving the voicebased man–machine interactions (El Ayadi et al. 2011). Man–machine interaction is one of the key goals in developing automatic emotion recognition (AER) systems. The * M. Gomathy [email protected] 1

Department of Computer Science, Shrimati Indira Gandhi College, Tiruchirappalli 620002, India

AER system is a key component in many applications such as spoken tutoring systems, medical-emergency domain to detect stress and pain, interactions with robots, computer games, call centers, a

Data Loading...

Optimal feature selection for speech emotion recognition using enhanced cat swarm optimization algorithm

Recommend Documents

Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Fea

A hybrid grasshopper and new cat swarm optimization algorithm for feature selection and optimization of multi-layer perc

Speaker independent feature selection for speech emotion recognition: A multi-task approach

Deep Residual Local Feature Learning for Speech Emotion Recognition

Pattern recognition and features selection for speech emotion recognition model using deep learning

Feature Selection for Vocal Segmentation Using Social Emotional Optimization Algorithm

Performance of deer hunting optimization based deep learning algorithm for speech emotion recognition

Feature Selection Optimization Using a Hybrid Genetic Algorithm

Multi-features Integration for Speech Emotion Recognition

Deep learning based facial expression recognition using improved Cat Swarm Optimization

Offline music symbol recognition using Daisy feature and quantum Grey wolf optimization based feature selection

ESSAWOA: Enhanced Whale Optimization Algorithm integrated with Salp Swarm Algorithm for global optimization