Imperial College London

DrAlastairMoore

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Research Fellow in Acoustic Signal Processing
 
 
 
//

Contact

 

alastair.h.moore

 
 
//

Location

 

809Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@article{Xue:2020:10.1109/TASLP.2020.3040850,
author = {Xue, W and Moore, A and Brookes, D and Naylor, P},
doi = {10.1109/TASLP.2020.3040850},
journal = {IEEE Transactions on Audio, Speech and Language Processing},
pages = {393--405},
title = {Speech enhancement based on modulation-domain parametric multichannel Kalman filtering},
url = {http://dx.doi.org/10.1109/TASLP.2020.3040850},
volume = {29},
year = {2020}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - Recently we presented a modulation-domain multichannel Kalman filtering (MKF) algorithm for speech enhancement, which jointly exploits the inter-frame modulation-domain temporal evolution of speech and the inter-channel spatial correlation to estimate the clean speech signal. The goal of speech enhancement is to suppress noise while keeping the speech undistorted, and a key problem is to achieve the best trade-off between speech distortion and noise reduction. In this paper, we extend the MKF by presenting a modulation-domain parametric MKF (PMKF) which includes a parameter that enables flexible control of the speech enhancement behaviour in each time-frequency (TF) bin. Based on the decomposition of the MKF cost function, a new cost function for PMKF is proposed, which uses the controlling parameter to weight the noise reduction and speech distortion terms. An optimal PMKF gain is derived using a minimum mean squared error (MMSE) criterion. We analyse the performance of the proposed MKF, and show its relationship to the speech distortion weighted multichannel Wiener filter (SDW-MWF). To evaluate the impact of the controlling parameter on speech enhancement performance, we further propose PMKF speech enhancement systems in which the controlling parameter is adaptively chosen in each TF bin. Experiments on a publicly available head-related impulse response (HRIR) database in different noisy and reverberant conditions demonstrate the effectiveness of the proposed method.
AU - Xue,W
AU - Moore,A
AU - Brookes,D
AU - Naylor,P
DO - 10.1109/TASLP.2020.3040850
EP - 405
PY - 2020///
SN - 1558-7916
SP - 393
TI - Speech enhancement based on modulation-domain parametric multichannel Kalman filtering
T2 - IEEE Transactions on Audio, Speech and Language Processing
UR - http://dx.doi.org/10.1109/TASLP.2020.3040850
UR - https://ieeexplore.ieee.org/document/9272832
UR - http://hdl.handle.net/10044/1/85030
VL - 29
ER -