Application of Fuzzy-Integration-Based Multiple-Information Aggregation in Automatic Speech Recognition

Many real-world problems can be cast into a multiple-information aggregation framework where preliminary evaluations of separate information sources are combined to produce more accurate and reliable evaluation than would otherwise be the case. In this pa

PDF / 1,823,807 Bytes
15 Pages / 439.37 x 666.142 pts Page_size
73 Downloads / 205 Views

DOWNLOAD

REPORT

Abstract Many real-world problems can be cast into a multiple-information aggregation framework where preliminary evaluations of separate information sources are combined to produce more accurate and reliable evaluation than would otherwise be the case. In this paper we describe a syllable-proximity evaluation problem in automatic speech recognition that fits well into this aggregation framework. A fuzzy-integration-based approach is adopted as the aggregation operator and a gradient-based algorithm is described for learning parameters automatically from training data. Experiments using spontaneous speech material demonstrate that the fuzzy-integration-based aggregation approach has many advantages over other techniques in terms of both performance and interpretability of the system.

1 Introduction Many real-world problems such as pattern recognition and decision-making, involve input information from several different sources; the evidence from each source alone can only provide a partial account of all available information. A decision based on information from a single source may be sub-optimal or incorrect, and it is often the convergence of evidence from various sources that provides an accurate and reliable result. Thus, the appropriate aggregation of information from different sources contributes crucially to the success of a solution to such problems. For certain applications it may be possible to create a comprehensive model involving all sources of information with respect to the overall classification or decision. However, for many other tasks a comprehensive model can be difficult to develop; it is often more practical to first perform an evaluation based on each information stream and then combine the results into a single overall result. This latter approach is often referred to as multiple-information aggregation. D. Ruan et al. (eds.), Intelligent Sensory Evaluation © Springer-Verlag Berlin Heidelberg 2004

386

S. Chang and S. Greenberg

Besides requiring less complex models, the multiple-information aggregation approach has several additional advantages over the single, comprehensive-mode approach in practical applications. Performing evaluation individually on each information source minimizes the potentially harmful impact of background noise derived from various sources. Moreover, separating the initial evaluation from the aggregation process provides for flexibility in using a different computing strategy for each information source; certain methods, such as pooling of subjective evaluations, possess an inherent structure similar to the multiple-information aggregation framework and thus may benefit from such an approach. Because of requirements associated with computation and interpretation, as well as the inherent uncertainty, the general class of multi-criteria decision-making and multi-attribute optimization problems lends itself well to soft-computing based techniques. Besides the fuzzy-integration-based aggregation described in this paper, many other soft computing approaches have been

Data Loading...

Application of Fuzzy-Integration-Based Multiple-Information Aggregation in Automatic Speech Recognition

Recommend Documents

Automatic Speech Recognition of Galo

Automatic speech recognition: a survey

Isolated Word Automatic Speech Recognition System

Robust Adaptation to Non-Native Accents in Automatic Speech Recognition

Experiments on Automatic Recognition of Nonnative Arabic Speech

Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition

Automatic Speech Recognition of Arabic Phonemes with Neural Networks

Federated Acoustic Model Optimization for Automatic Speech Recognition

Toward Lexicon-Free Bangla Automatic Speech Recognition System

Automatic Speech Recognition on Mobile Devices and over Communication Networks

An Automatic Meter Recognition Method for In-Orbit Application

Speech Recognition