Inverse Filtering for Speech Dereverberation Less Sensitive to Noise and Room Transfer Function Fluctuations

PDF / 8,030,171 Bytes
12 Pages / 600.03 x 792 pts Page_size
32 Downloads / 283 Views

Research Article Inverse Filtering for Speech Dereverberation Less Sensitive to Noise and Room Transfer Function Fluctuations Takafumi Hikichi, Marc Delcroix, and Masato Miyoshi Media Information Laboratory, NTT Communication Science Laboratories, NTT Corporation, 2-4 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0237, Japan Received 16 November 2006; Accepted 2 February 2007 Recommended by Liang-Gee Chen Inverse filtering of room transfer functions (RTFs) is considered an attractive approach for speech dereverberation given that the time invariance assumption of the used RTFs holds. However, in a realistic environment, this assumption is not necessarily guaranteed, and the performance is degraded because the RTFs fluctuate over time and the inverse filter fails to remove the eﬀect of the RTFs. The inverse filter may amplify a small fluctuation in the RTFs and may cause large distortions in the filter’s output. Moreover, when interference noise is present at the microphones, the filter may also amplify the noise. This paper proposes a design strategy for the inverse filter that is less sensitive to such disturbances. We consider that reducing the filter energy is the key to making the filter less sensitive to the disturbances. Using this idea as a basis, we focus on the influence of three design parameters on the filter energy and the performance, namely, the regularization parameter, modeling delay, and filter length. By adjusting these three design parameters, we confirm that the performance can be improved in the presence of RTF fluctuations and interference noise. Copyright © 2007 Takafumi Hikichi et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1.

INTRODUCTION

Inverse filtering of room acoustics is useful in various applications such as sound reproduction, sound-field equalization, and speech dereverberation. Usually, room transfer functions (RTFs) are modeled as finite impulse response (FIR) filters, and inverse filters are designed to remove the eﬀect of the RTFs. When the RTFs are known a priori or are capable of being accurately estimated, this approach has been shown to achieve high inverse filtering performance [1– 4]. However, in actual acoustic environments, there are disturbances that aﬀect the inverse filtering performance. One cause of these disturbances is the fluctuation in the RTFs resulting from changes in such factors as source position and temperature [5–9]. As a result, an inverse filter correctly designed for one condition may not work well for another condition, and compensation or adaptation processing may become necessary. The sensitivity issue with inverse filtering in relation to the movement of a sound source or microphone has been addressed in several papers. In [8, 9], the sensitivity of inverse filters is quantified in terms of the mean-squared error (MSE), defined as the power of the deviation of the equal-

iz

Data Loading...

Inverse Filtering for Speech Dereverberation Less Sensitive to Noise and Room Transfer Function Fluctuations

Recommend Documents

Comparative Evaluation of Radial Basis Function Network Transfer Function for Filtering Speckle Noise in Synthetic Apert

Subspace Methods for Multimicrophone Speech Dereverberation

Noise Theory and Application to Physics From Fluctuations to Inf

Advanced Comb Filtering for Robust Speech Recognition

Dereverberation by Using Time-Variant Nature of Speech Production System

Noise and acoustic modeling with waveform generator in text-to-speech and neutral speech conversion

Noise Reduction in Speech Processing

A Review of Signal Subspace Speech Enhancement and Its Application to Noise Robust Speech Recognition

Locally Adaptive DCT Filtering for Signal-Dependent Noise Removal

Using Intermicrophone Correlation to Detect Speech in Spatially Separated Noise

Transfer function

Filtering Theory for a Weakly Coloured Noise Process