Imperial College London

Mr Mike Brookes

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Reader in Signal Processing
 
 
 
//

Contact

 

+44 (0)20 7594 6165mike.brookes Website

 
 
//

Assistant

 

Miss Charlotte Grady +44 (0)20 7594 6267

 
//

Location

 

814Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Publication Type
Year
to

106 results found

Dionelis N, Brookes, Speech Enhancement Using Modulation-Domain Kalman Filtering with Active Speech Level Normalized Log-Spectrum Global Priors, 25th European Signal Processing Conference (EUSIPCO), Publisher: IEEE

CONFERENCE PAPER

Doire CSJ, Brookes DM, Naylor PA, De Sena E, van Waterschoot T, Jensen SHJet al., Acoustic Environment Control: Implementation of a Reverberation Enhancement System, AES 60th International Conference: DREAMS (Dereverberation and Reverberation of Audio, Music, and Speech)

Reverberation enhancement systems allow the active control of the acoustic environment. They are subject to instability issues due to acoustic feedback, and are often installed permanently in large halls, sometimes at great cost. In this paper, we explore the possibility of implementing a cost-effective reverberation enhancement system to control the acoustics of typical rooms using a combination of spatial filtering, automatic calibration, adaptive notch filters, howling detection and manual adjustments. The effectiveness of the system is then tested inside a small soundproof booth.

CONFERENCE PAPER

Stanton R, Gaubitch N, Naylor P, Brookes DMet al., A Differentiable Approximation to Speech Intelligibility Index with Applications to Listening Enhancement, AES Intl Conf on Audio Forensics

The Speech Intelligibility Index is a standardised objective measure for estimating the intelligibility of speech in noise. It is, however difficult to use it in the iterative optimisation of speech enhancement algorithms because it is a discontinuous function of its input parameters. In this paper, we derive an approximation for the Speech Intelligibility Index that is both continuous and differentiable, which allows for more efficient optimisation procedures. The use of the approximation is demonstrated in an application to near-end speech enhancement.

CONFERENCE PAPER

Dionelis N, Brookes M, 2017, MODULATION-DOMAIN SPEECH ENHANCEMENT USING A KALMAN FILTER WITH A BAYESIAN UPDATE OF SPEECH AND NOISE IN THE LOG-SPECTRAL DOMAIN, Conference on Hands-Free Speech Communications and Microphone Arrays (HSCMA), Publisher: IEEE, Pages: 111-115

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, 2017, Robust and efficient Bayesian adaptive psychometric function estimation, JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, Vol: 141, Pages: 2501-2512, ISSN: 0001-4966

JOURNAL ARTICLE

Doire CSJ, Brookes M, Naylor PA, Hicks CM, Betts D, Dmour MA, Jensen SHet al., 2017, Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 25, Pages: 572-587, ISSN: 2329-9290

JOURNAL ARTICLE

Koulouri A, Brookes M, Rimpilaeinen V, 2017, Vector tomography for reconstructing electric fields with non-zero divergence in bounded domains, JOURNAL OF COMPUTATIONAL PHYSICS, Vol: 329, Pages: 73-90, ISSN: 0021-9991

JOURNAL ARTICLE

Lightburn L, De Sena E, Moore A, Naylor PA, Brookes Met al., 2017, Improving the perceptual quality of ideal binary masked speech, Pages: 661-665, ISSN: 1520-6149

© 2017 IEEE. It is known that applying a time-frequency binary mask to very noisy speech can improve its intelligibility but results in poor perceptual quality. In this paper we propose a new approach to applying a binary mask that combines the intelligibility gains of conventional binary masking with the perceptual quality gains of a classical speech enhancer. The binary mask is not applied directly as a time-frequency gain as in most previous studies. Instead, the mask is used to supply prior information to a classical speech enhancer about the probability of speech presence in different time-frequency regions. Using an oracle ideal binary mask, we show that the proposed method results in a higher predicted quality than other methods of applying a binary mask whilst preserving the improvements in predicted intelligibility.

CONFERENCE PAPER

Moore AH, Brookes M, Naylor PA, 2017, Robust spherical harmonic domain interpolation of spatially sampled array manifolds, Pages: 521-525, ISSN: 1520-6149

© 2017 IEEE. Accurate interpolation of the array manifold is an important first step for the acoustic simulation of rapidly moving microphone arrays. Spherical harmonic domain interpolation has been proposed and well studied in the context of head-related transfer functions but has focussed on perceptual, rather than numerical, accuracy. In this paper we analyze the effect of measurement noise on spatial aliasing. Based on this analysis we propose a method for selecting the truncation orders for the forward and reverse spherical Fourier transforms given only the noisy samples in such a way that the interpolation error is minimized. The proposed method achieves up to 1.7 dB improvement over the baseline approach.

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2017, Frequency-domain under-modelled blind system identification based on cross power spectrum and sparsity regularization, Pages: 591-595, ISSN: 1520-6149

© 2017 IEEE. In room acoustics, under-modelled multichannel blind system identification (BSI) aims to estimate the early part of the room impulse responses (RIRs), and it can be widely used in applications such as speaker localization, room geometry identification and beamforming based speech dereverberation. In this paper we extend our recent study on under-modelled BSI from the time domain to the frequency domain, such that the RIRs can be updated frame-wise and the efficiency of Fast Fourier Transform (FFT) is exploited to reduce the computational complexity. Analogous to the cross-correlation based criterion in the time domain, a frequency-domain cross power spectrum based criterion is proposed. As the early RIRs are usually sparse, the RIRs are estimated by jointly maximizing the cross power spectrum based criterion in the frequency domain and minimizing the l 1 -norm sparsity measure in the time domain. A two-stage LMS updating algorithm is derived to achieve joint optimization of these two targets. The experimental results in different under-modelled scenarios demonstrate the effectiveness of the proposed method.

CONFERENCE PAPER

Dionelis N, Brookes M, 2016, ACTIVE SPEECH LEVEL ESTIMATION IN NOISY SIGNALS WITH QUADRATURE NOISE SUPPRESSION, 24th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1193-1197, ISSN: 2076-1465

CONFERENCE PAPER

Koulouri A, Rimpilaeinen V, Brookes M, Kaipio JPet al., 2016, Compensation of domain modelling errors in the inverse source problem of the Poisson equation: Application in electroencephalographic imaging, APPLIED NUMERICAL MATHEMATICS, Vol: 106, Pages: 24-36, ISSN: 0168-9274

JOURNAL ARTICLE

Lawson M, Brookes M, Dragotti PL, 2016, Capturing the plenoptic function in a swipe, Conference on Applications of Digital Image Processing XXXIX, Publisher: SPIE-INT SOC OPTICAL ENGINEERING, ISSN: 0277-786X

CONFERENCE PAPER

Lightbum L, Brookes M, 2016, A WEIGHTED STOI INTELLIGIBILITY METRIC BASED ON MUTUAL INFORMATION, IEEE International Conference on Acoustics, Speech, and Signal Processing, Publisher: IEEE, Pages: 5365-5369, ISSN: 1520-6149

CONFERENCE PAPER

Sharma D, Wang Y, Naylor PA, Brookes Met al., 2016, A data-driven non-intrusive measure of speech quality and intelligibility, SPEECH COMMUNICATION, Vol: 80, Pages: 84-94, ISSN: 0167-6393

JOURNAL ARTICLE

Wang Y, Brookes M, 2016, SPEECH ENHANCEMENT USING AN MMSE SPECTRAL AMPLITUDE ESTIMATOR BASED ON A MODULATION DOMAIN KALMAN FILTER WITH A GAMMA PRIOR, IEEE International Conference on Acoustics, Speech, and Signal Processing, Publisher: IEEE, Pages: 5225-5229, ISSN: 1520-6149

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2016, UNDER-MODELLED BLIND SYSTEM IDENTIFICATION FOR TIME DELAY ESTIMATION IN REVERBERANT ENVIRONMENTS, 15th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2016, Cross-Correlation Based Under-Modelled Multichannel Blind Acoustic System Identification with Sparsity Regularization, 24th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 718-722, ISSN: 2076-1465

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, Betts D, Hicks CM, Dmour MA, Jensen SHet al., 2015, SINGLE-CHANNEL BLIND ESTIMATION OF REVERBERATION PARAMETERS, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 31-35, ISSN: 1520-6149

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, Jensen SHet al., 2015, DATA-DRIVEN STATISTICAL MODELLING OF ROOM IMPULSE RESPONSES IN THE POWER DOMAIN, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2466-2470, ISSN: 2076-1465

CONFERENCE PAPER

Hu M, Doclo S, Sharma D, Brookes M, Naylor PAet al., 2015, NOISE ROBUST BLIND SYSTEM IDENTIFICATION ALGORITHMS BASED ON A RAYLEIGH QUOTIENT COST FUNCTION, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2476-2480, ISSN: 2076-1465

CONFERENCE PAPER

Hu M, Parada PP, Sharma D, Doclo S, van Waterschoot T, Brookes M, Naylor PAet al., 2015, SINGLE-CHANNEL SPEAKER DIARIZATION BASED ON SPATIAL FEATURES, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168

CONFERENCE PAPER

Hu M, Sharma D, Doclo S, Brookes M, Naylor PAet al., 2015, SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5743-5747, ISSN: 1520-6149

CONFERENCE PAPER

Lightburn L, Brookes M, 2015, SOBM - A BINARY MASK FOR NOISY SPEECH THAT OPTIMISES AN OBJECTIVE INTELLIGIBILITY METRIC, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5078-5082, ISSN: 1520-6149

CONFERENCE PAPER

Gilliam C, Dragotti P-L, Brookes M, 2014, On the Spectrum of the Plenoptic Function, IEEE TRANSACTIONS ON IMAGE PROCESSING, Vol: 23, Pages: 502-516, ISSN: 1057-7149

JOURNAL ARTICLE

Gonzalez S, Brookes M, 2014, PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 22, Pages: 518-530, ISSN: 2329-9290

JOURNAL ARTICLE

Gonzalez S, Brookes M, 2014, MASK-BASED ENHANCEMENT FOR VERY LOW QUALITY SPEECH, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, ISSN: 1520-6149

CONFERENCE PAPER

Hilkhuysen G, Gaubitch N, Brookes M, Huckvale Met al., 2014, Effects of noise suppression on intelligibility. II: An attempt to validate physical metrics, JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, Vol: 135, Pages: 439-450, ISSN: 0001-4966

JOURNAL ARTICLE

Jones Z, Brookes M, Dragotti PL, Benton Det al., 2014, WIDE-BASELINE IMAGE CHANGE DETECTION, IEEE International Conference on Image Processing (ICIP), Publisher: IEEE, Pages: 1589-1593, ISSN: 1522-4880

CONFERENCE PAPER

Pearson J, Visentini-Scarzanella M, Brookes M, Dragotti PLet al., 2014, TILTED LAYER-BASED MODELING FOR ENHANCED LIGHT-FIELD PROCESSING AND IMAGE BASED RENDERING, IEEE International Conference on Image Processing (ICIP), Publisher: IEEE, Pages: 1917-1921, ISSN: 1522-4880

CONFERENCE PAPER

This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.

Request URL: http://wlsprd.imperial.ac.uk:80/respub/WEB-INF/jsp/search-html.jsp Request URI: /respub/WEB-INF/jsp/search-html.jsp Query String: respub-action=search.html&id=00000744&limit=30&person=true