Application of glottal flow descriptors for pathological voice diagnosis
- PDF / 1,001,642 Bytes
- 18 Pages / 595.276 x 790.866 pts Page_size
- 20 Downloads / 141 Views
Application of glottal flow descriptors for pathological voice diagnosis Girish Gidaye1 · Jagannath Nirmal2 · Kadria Ezzine3 · Avinash Shrivas4 · Mondher Frikha3 Received: 7 May 2019 / Accepted: 19 January 2020 © Springer Science+Business Media, LLC, part of Springer Nature 2020
Abstract Acoustic analysis of speech signal enables automatic detection and classification of voice disorders along with its severity. This automatic assessment provides help to the clinician in initial diagnosis of pathological larynx in non-intrusive way. Voice pathologies damage the vocal cords and consequently alter the dynamics (fluctuation speed) of vocal cords. In this article, we have estimated glottal volume velocity waveform (GVVW) from the speech pressure waveforms of healthy and pathological subjects using quasi closed phase (QCP) glottal inverse filtering algorithm to capture altered dynamics of vocal cords. Closed-phase methods revealed notable stability in diverse voice qualities and sub-glottal pressures. The GVVW is the source of significant acoustical clues rooted in speech. The estimated GVVW is then parameterized by various time based, frequency based and Liljencrants–Fant (LF) model based glottal descriptors. Glottal descriptor’s vectors have been passed on to stochastic gradient descent (SGD) classifier for voice disorder evaluation. The normal pitch utterance of sustained vowel /a/ quarried from German, English, Arabic and Spanish voice databases is used. Information gain (IG) feature scoring technique is employed to select optimal descriptors and to rank them. Several intra and cross-database experiments were performed to explore the usefulness of glottal descriptors for voice disorder detection, severity detection and classification. Student’s t-tests were performed to validate the obtained results. Keywords Acoustic analysis · Glottal volume velocity waveform · Inverse filtering · Vocal fold · Voice disorder detection and classification · Stochastic gradient descent
1 Introduction
* Girish Gidaye [email protected] Jagannath Nirmal [email protected] Kadria Ezzine [email protected] Avinash Shrivas [email protected] Mondher Frikha [email protected] 1
Research Scholar: K. J. Somaiya College of Engineering, Vidyalankar Institute of Technology, Mumbai, India
2
K. J. Somaiya College of Engineering, Vidyavihar (E), Mumbai, India
3
ATISP, ENET’COM, Sfax University, Sfax, Tunisia
4
Vidyalankar Institute of Technology, Mumbai, India
In human beings speech is the natural mode of communication and its production is a common but a complex phenomenon. The speech provides the vital information about the sex, age, emotions, feelings, physical and mental health or cultural background of a person. When the voice differs from persons of your age or sex, it may be symptomatic of a voice disorder (Rose and Robertson 2002). The physical, medicinal, or neurological alterations in the voice production system are the common reasons of a voice dysfunction. Sometime voice disorder may occ
Data Loading...