Session: MULT-P2
Time: 9:30 - 11:30, Thursday, May 10, 2001
Location: Exhibit Hall Area 2
Title: Blind Separation of Convolutive Mixtures
Chair: Zhi Ding

9:30, MULT-P2.1
A COMBINED APPROACH OF ARRAY PROCESSING AND INDEPENDENT COMPONENT ANALYSIS FOR BLIND SEPARATION OF ACOUSTIC SIGNALS
F. ASANO, S. EKEDA, M. OGAWA, H. ASOH, N. KITAWAKI
In this paper, two array signal processing techniques are combined with independent component analysis to enhance the performance of blind separation of acoustic signals in a reflective environment such as rooms. The first technique is the subspace method which reduces the effect of room reflection. The second technique is a method of solving permutation, in which coherency of the mixing matrix in adjacent frequencies is utilized.

9:30, MULT-P2.2
BLIND SOURCE SEPARATION COMBINING FREQUENCY-DOMAIN ICA AND BEAMFORMING
H. SARUWATARI, S. KURITA, K. TAKEDA
In this paper, we describe a new method of blind source separation (BSS) on a microphone array combining subband independent component analysis (ICA) and beamforming. The proposed array system consists of the following three sections: (1) subband-ICA-based BSS section with direction-of-arrival (DOA) estimation, (2) null beamforming section based on the estimated DOA information, and (3) integration of (1) and (2) based on the algorithm diversity. Using this technique, we can resolve the low-convergence problem through optimization in ICA. The results of the signal separation experiments reveal that the noise reduction rate (NRR) of about 18dB is obtained under the nonreverberant condition, and NRRs of 8dB and 6dB are obtained in the case that the reverberation times are 150msec and 300msec. These performances are superior to those of both simple ICA-based BSS and simple beamforming method.

9:30, MULT-P2.3
FUNDAMENTAL LIMITATION OF FREQUENCY DOMAIN BLIND SOURCE SEPARATION FOR CONVOLUTIVE MIXTURE OF SPEECH
S. ARAKI, S. MAKINO, T. NISHIKAWA, H. SARUWATARI
Despite several recent proposals to achieve Blind Source Separation (BSS) for realistic acoustic signal, separation performance is still not enough. In particular, when the length of impulse response is long, performance is highly limited. In this paper, we show it is useless to be constrained by the condition, P << T, where T is the frame size of FFT and P is the length of room impulse response. From our experiments, a frame size of 256 and 512 (32 and 64 ms at a sampling frequency of 8 kHz) is best even for the long room reverberation of T_R=150 and 300 ms. We also clarified the reason for poor performance of BSS in long reverberant environment, finding that separation is achieved chiefly for a direct jammer because BSS cannot calculate the inverse of the room transfer function both for the target and jammer signals.

9:30, MULT-P2.4
A SECOND-ORDER DIFFERENTIAL APPROACH FOR UNDERDETERMINED CONVOLUTIVE SOURCE SEPARATION
Y. DEVILLE, S. SAVOLDELLI
This paper concerns the underdetermined case of the convolutive source separation problem, i.e. the situation when the number of observed convolutively mixed signals is lower than the number of sources. We propose a criterion and associated algorithm which, unlike classical approaches, make it possible to perform the separation of a subset of these sources by exploiting their assumed non-stationarity properties. This approach uses the second-order statistics of the signals and adapts the filters of a direct separating system so as to cancel the "differential cross-correlation" of signals derived by this system. This new method is related to the general differential source separation concept that we proposed. Its effectiveness is shown by means of numerical tests.

9:30, MULT-P2.5
BLIND SOURCE SEPARATION OF CONVOLVED SOURCES BY JOINT APPROXIMATE DIAGONALIZATION OF CROSS-SPECTRAL DENSITY MATRICES
K. RAHBAR, J. REILLY
In this paper we present a new method for separating non-stationary sources from their convolutive mixtures based on approximate joint diagonalizing of the observed signals' cross-spectral density matrices. Several blind source separation (BSS) algorithms have been proposed which use approximate joint diagonalization of a set of scalar matrices to estimate the instantaneous mixing matrix. We extend the concept of approximate joint diagonalization to estimate MIMO FIR channels. Based on this estimate we then design a separating network which will recover the original sources up to only a permutation and scaling ambiguity for minimum phase channels. We eliminate the commonly experienced problem of arbitrary scaling and permutation at each frequency bin, by optimizing the cost function directly with respect to the time-domain channel variables. We demonstrate the performance of the algorithm by computer simulations using real speech data. Speech samples are available at: http://sparky.mcmaster.ca/SSP/telephony_kamran.htm.

9:30, MULT-P2.6
BLIND MIMO EQUALIZATION AND JOINT-DIAGONALIZATION CRITERIA
P. COMON, E. MOREAU
We consider the problem of convolutive blind signal separation through the optimization of contrast functions. In this work, we show that some links between contrasts and joint diagonalization criteria can be exhibited in the convolutive case. This allows to devise a constructive algorithm performing MIMO blind equalization, with the help of a joint approximate diagonalization of a set of matrices built from the observations. This analytical algorithm can be run block-wise, which is appropriate in the context of short burst communications.

9:30, MULT-P2.7
ADAPTIVE PARAUNITARY FILTER BANKS FOR CONTRAST-BASED MULTICHANNEL BLIND DECONVOLUTION
X. SUN, S. DOUGLAS
In this paper, we present novel algorithms for multichannel blind deconvolution under output whitening constraints. The algorithms are inspired by recently-developed procedures for gradient adaptive paraunitary filter banks. Several algorithms are developed, including one algorithm that successfully deconvolves mixtures of arbitrary non-zero kurtosis source signals. We provide detailed local stability analyses of the proposed methods to verify their capabilities. Simulations show that the methods successfully deconvolve spatio-temporal mixtures of statistically-independent source signals.

9:30, MULT-P2.8
A MULTIRESOLUTION APPROACH TO BLIND SEPARATION OF SPEECH SIGNALS IN A REVERBERANT ENVIRONMENT
M. IKRAM, D. MORGAN
The performance of existing blind speech separation methods is limited in a realistic reverberant environment, where a need for long un-mixing filters is imperative. We first show how these methods suffer while trying to balance the competing objectives of frequency-domain permutation alignment and spectral resolution. We then propose a multistage multiresolution algorithm, which aligns the un-mixing filter permutations over the whole frequency band without sacrificing spectral resolution. We perform experiments in both real and simulated reverberant environments, and obtain improved separation results that are comparable to the ideal benchmark obtained by aligning the permutations using prior knowledge of the mixing filters.

9:30, MULT-P2.9
FREQUENCY DOMAIN MULTI-CHANNEL SPEECH SEPARATION AND ITS APPLICATIONS
M. HANDA, T. NAGAI, A. KUREMATSU
In this paper, a multi-channel speech separation method for real environments is proposed. The proposed method is based on frequency assignment, namely the magnitude of each channel at the same frequency bin is compared with each other and it is assigned to the channel, to which it originally belongs. This method is a direct consequence of frequency domain interpretation of the eigendecomposition method proposed by Cao et al.. Furthermore, our proposed method does not require eigendecomposition, which consumes the costs of computation. We also present two example applications of the proposed method, that is, voice controlled computers in a multi-user environment and a noise removal in cellular phone using two microphones.

9:30, MULT-P2.10
FREQUENCY-DOMAIN CONTRAST FUNCTIONS FOR SEPARATION OF CONVOLUTIVE MIXTURES
J. PESQUET, B. CHEN, A. PETROPULU
This paper addresses the problem of blind separation of convolutive mixtures via contrast maximization. New frequency domain contrast functions are constructed based on second and higher-order spectra of the observations. They allow to separate mixtures of sources which are spatially independent, and temporally possibly non i.i.d. or non-linear processes. The proposed criteria provide a framework for extending to the convolutive case contrasts that have been proposed in the context of instantaneous mixtures.