Imperial College London

Mr Mike Brookes

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Reader in Signal Processing
 
 
 
//

Contact

 

+44 (0)20 7594 6165mike.brookes Website

 
 
//

Assistant

 

Miss Charlotte Grady +44 (0)20 7594 6267

 
//

Location

 

814Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Publication Type
Year
to

106 results found

Dionelis N, Brookes, 2017, Speech Enhancement Using Modulation-Domain Kalman Filtering with Active Speech Level Normalized Log-Spectrum Global Priors, 25th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, ISSN: 2076-1465

We describe a single-channel speech enhancement algorithm that is based on modulation-domain Kalman filtering that tracks the inter-frame time evolution of the speech logpower spectrum in combination with the long-term average speech log-spectrum. We use offline-trained log-power spectrum global priors incorporated in the Kalman filter prediction and update steps for enhancing noise suppression. In particular, we train and utilize Gaussian mixture model priors for speech in the log-spectral domain that are normalized with respect to the active speech level. The Kalman filter update step uses the log-power spectrum global priors together with the local priors obtained from the Kalman filter prediction step. The logspectrum Kalman filtering algorithm, which uses the theoretical phase factor distribution and improves the modeling of the modulation features, is evaluated in terms of speech quality. Different algorithm configurations, dependent on whether global priors and/or Kalman filter noise tracking are used, are compared in various noise types.

CONFERENCE PAPER

Dionelis N, Brookes M, 2017, MODULATION-DOMAIN SPEECH ENHANCEMENT USING A KALMAN FILTER WITH A BAYESIAN UPDATE OF SPEECH AND NOISE IN THE LOG-SPECTRAL DOMAIN, Conference on Hands-Free Speech Communications and Microphone Arrays (HSCMA), Publisher: IEEE, Pages: 111-115

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, 2017, Robust and efficient Bayesian adaptive psychometric function estimation, JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, Vol: 141, Pages: 2501-2512, ISSN: 0001-4966

JOURNAL ARTICLE

Doire CSJ, Brookes M, Naylor PA, Hicks CM, Betts D, Dmour MA, Jensen SHet al., 2017, Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 25, Pages: 572-587, ISSN: 2329-9290

JOURNAL ARTICLE

Koulouri A, Brookes M, Rimpilaeinen V, 2017, Vector tomography for reconstructing electric fields with non-zero divergence in bounded domains, JOURNAL OF COMPUTATIONAL PHYSICS, Vol: 329, Pages: 73-90, ISSN: 0021-9991

JOURNAL ARTICLE

Lightburn L, De Sena E, Moore A, Naylor PA, Brookes Met al., 2017, Improving the perceptual quality of ideal binary masked speech, Pages: 661-665, ISSN: 1520-6149

© 2017 IEEE. It is known that applying a time-frequency binary mask to very noisy speech can improve its intelligibility but results in poor perceptual quality. In this paper we propose a new approach to applying a binary mask that combines the intelligibility gains of conventional binary masking with the perceptual quality gains of a classical speech enhancer. The binary mask is not applied directly as a time-frequency gain as in most previous studies. Instead, the mask is used to supply prior information to a classical speech enhancer about the probability of speech presence in different time-frequency regions. Using an oracle ideal binary mask, we show that the proposed method results in a higher predicted quality than other methods of applying a binary mask whilst preserving the improvements in predicted intelligibility.

CONFERENCE PAPER

Moore AH, Brookes M, Naylor PA, 2017, Robust spherical harmonic domain interpolation of spatially sampled array manifolds, Pages: 521-525, ISSN: 1520-6149

© 2017 IEEE. Accurate interpolation of the array manifold is an important first step for the acoustic simulation of rapidly moving microphone arrays. Spherical harmonic domain interpolation has been proposed and well studied in the context of head-related transfer functions but has focussed on perceptual, rather than numerical, accuracy. In this paper we analyze the effect of measurement noise on spatial aliasing. Based on this analysis we propose a method for selecting the truncation orders for the forward and reverse spherical Fourier transforms given only the noisy samples in such a way that the interpolation error is minimized. The proposed method achieves up to 1.7 dB improvement over the baseline approach.

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2017, Frequency-domain under-modelled blind system identification based on cross power spectrum and sparsity regularization, Pages: 591-595, ISSN: 1520-6149

© 2017 IEEE. In room acoustics, under-modelled multichannel blind system identification (BSI) aims to estimate the early part of the room impulse responses (RIRs), and it can be widely used in applications such as speaker localization, room geometry identification and beamforming based speech dereverberation. In this paper we extend our recent study on under-modelled BSI from the time domain to the frequency domain, such that the RIRs can be updated frame-wise and the efficiency of Fast Fourier Transform (FFT) is exploited to reduce the computational complexity. Analogous to the cross-correlation based criterion in the time domain, a frequency-domain cross power spectrum based criterion is proposed. As the early RIRs are usually sparse, the RIRs are estimated by jointly maximizing the cross power spectrum based criterion in the frequency domain and minimizing the l 1 -norm sparsity measure in the time domain. A two-stage LMS updating algorithm is derived to achieve joint optimization of these two targets. The experimental results in different under-modelled scenarios demonstrate the effectiveness of the proposed method.

CONFERENCE PAPER

Dionelis N, Brookes M, 2016, ACTIVE SPEECH LEVEL ESTIMATION IN NOISY SIGNALS WITH QUADRATURE NOISE SUPPRESSION, 24th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1193-1197, ISSN: 2076-1465

CONFERENCE PAPER

Koulouri A, Rimpilaeinen V, Brookes M, Kaipio JPet al., 2016, Compensation of domain modelling errors in the inverse source problem of the Poisson equation: Application in electroencephalographic imaging, APPLIED NUMERICAL MATHEMATICS, Vol: 106, Pages: 24-36, ISSN: 0168-9274

JOURNAL ARTICLE

Lawson M, Brookes M, Dragotti PL, 2016, Capturing the plenoptic function in a swipe, Conference on Applications of Digital Image Processing XXXIX, Publisher: SPIE-INT SOC OPTICAL ENGINEERING, ISSN: 0277-786X

CONFERENCE PAPER

Lightbum L, Brookes M, 2016, A WEIGHTED STOI INTELLIGIBILITY METRIC BASED ON MUTUAL INFORMATION, IEEE International Conference on Acoustics, Speech, and Signal Processing, Publisher: IEEE, Pages: 5365-5369, ISSN: 1520-6149

CONFERENCE PAPER

Sharma D, Wang Y, Naylor PA, Brookes Met al., 2016, A data-driven non-intrusive measure of speech quality and intelligibility, SPEECH COMMUNICATION, Vol: 80, Pages: 84-94, ISSN: 0167-6393

JOURNAL ARTICLE

Wang Y, Brookes M, 2016, SPEECH ENHANCEMENT USING AN MMSE SPECTRAL AMPLITUDE ESTIMATOR BASED ON A MODULATION DOMAIN KALMAN FILTER WITH A GAMMA PRIOR, IEEE International Conference on Acoustics, Speech, and Signal Processing, Publisher: IEEE, Pages: 5225-5229, ISSN: 1520-6149

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2016, UNDER-MODELLED BLIND SYSTEM IDENTIFICATION FOR TIME DELAY ESTIMATION IN REVERBERANT ENVIRONMENTS, 15th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2016, Cross-Correlation Based Under-Modelled Multichannel Blind Acoustic System Identification with Sparsity Regularization, 24th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 718-722, ISSN: 2076-1465

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, Betts D, Hicks CM, Dmour MA, Jensen SHet al., 2015, SINGLE-CHANNEL BLIND ESTIMATION OF REVERBERATION PARAMETERS, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 31-35, ISSN: 1520-6149

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, Jensen SHet al., 2015, DATA-DRIVEN STATISTICAL MODELLING OF ROOM IMPULSE RESPONSES IN THE POWER DOMAIN, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2466-2470, ISSN: 2076-1465

CONFERENCE PAPER

Hu M, Doclo S, Sharma D, Brookes M, Naylor PAet al., 2015, NOISE ROBUST BLIND SYSTEM IDENTIFICATION ALGORITHMS BASED ON A RAYLEIGH QUOTIENT COST FUNCTION, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2476-2480, ISSN: 2076-1465

CONFERENCE PAPER

Hu M, Parada PP, Sharma D, Doclo S, van Waterschoot T, Brookes M, Naylor PAet al., 2015, SINGLE-CHANNEL SPEAKER DIARIZATION BASED ON SPATIAL FEATURES, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168

CONFERENCE PAPER

Hu M, Sharma D, Doclo S, Brookes M, Naylor PAet al., 2015, SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5743-5747, ISSN: 1520-6149

CONFERENCE PAPER

Lightburn L, Brookes M, 2015, SOBM - A BINARY MASK FOR NOISY SPEECH THAT OPTIMISES AN OBJECTIVE INTELLIGIBILITY METRIC, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5078-5082, ISSN: 1520-6149

CONFERENCE PAPER

Gilliam C, Dragotti P-L, Brookes M, 2014, On the Spectrum of the Plenoptic Function, IEEE TRANSACTIONS ON IMAGE PROCESSING, Vol: 23, Pages: 502-516, ISSN: 1057-7149

JOURNAL ARTICLE

Gonzalez S, Brookes M, 2014, PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 22, Pages: 518-530, ISSN: 2329-9290

JOURNAL ARTICLE

Gonzalez S, Brookes M, 2014, MASK-BASED ENHANCEMENT FOR VERY LOW QUALITY SPEECH, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, ISSN: 1520-6149

CONFERENCE PAPER

Hilkhuysen G, Gaubitch N, Brookes M, Huckvale Met al., 2014, Effects of noise suppression on intelligibility. II: An attempt to validate physical metrics, JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, Vol: 135, Pages: 439-450, ISSN: 0001-4966

JOURNAL ARTICLE

Jones Z, Brookes M, Dragotti PL, Benton Det al., 2014, WIDE-BASELINE IMAGE CHANGE DETECTION, IEEE International Conference on Image Processing (ICIP), Publisher: IEEE, Pages: 1589-1593, ISSN: 1522-4880

CONFERENCE PAPER

Pearson J, Visentini-Scarzanella M, Brookes M, Dragotti PLet al., 2014, TILTED LAYER-BASED MODELING FOR ENHANCED LIGHT-FIELD PROCESSING AND IMAGE BASED RENDERING, IEEE International Conference on Image Processing (ICIP), Publisher: IEEE, Pages: 1917-1921, ISSN: 1522-4880

CONFERENCE PAPER

Stanton R, Brookes M, 2014, PATH UNCERTAINTY ROBUST BEAMFORMING, 22nd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1925-1929, ISSN: 2076-1465

CONFERENCE PAPER

Wang Y, Brookes M, 2014, SPEECH ENHANCEMENT USING A MODULATION DOMAIN KALMAN FILTER POST-PROCESSOR WITH A GAUSSIAN MIXTURE NOISE MODEL, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, ISSN: 1520-6149

CONFERENCE PAPER

This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.

Request URL: http://wlsprd.imperial.ac.uk:80/respub/WEB-INF/jsp/search-html.jsp Request URI: /respub/WEB-INF/jsp/search-html.jsp Query String: respub-action=search.html&id=00000744&limit=30&person=true