Imperial College London

Mr Mike Brookes

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Reader in Signal Processing
 
 
 
//

Contact

 

+44 (0)20 7594 6165mike.brookes Website

 
 
//

Assistant

 

Ms Melanie Albright +44 (0)20 7594 6267

 
//

Location

 

814Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Publication Type
Year
to

116 results found

Dionelis N, Brookes M, 2018, Phase-Aware Single-Channel Speech Enhancement With Modulation-Domain Kalman Filtering, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 26, Pages: 937-950, ISSN: 2329-9290

JOURNAL ARTICLE

Wang Y, Brookes M, 2018, Model-Based Speech Enhancement in the Modulation Domain, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 26, Pages: 580-594, ISSN: 2329-9290

JOURNAL ARTICLE

Xue W, Moore AH, Brookes M, Naylor PAet al., 2018, Modulation-domain multichannel kalman filtering for speech enhancement, IEEE/ACM Transactions on Audio Speech and Language Processing, Vol: 26, Pages: 1833-1847, ISSN: 2329-9290

© 2014 IEEE. Compared with single-channel speech enhancement methods, multichannel methods can utilize spatial information to design optimal filters. Although some filters adaptively consider second-order signal statistics, the temporal evolution of the speech spectrum is usually neglected. By using linear prediction (LP) to model the inter-frame temporal evolution of speech, single-channel Kalman filtering (KF) based methods have been developed for speech enhancement. In this paper, we derive a multichannel KF (MKF) that jointly uses both interchannel spatial correlation and interframe temporal correlation for speech enhancement. We perform LP in the modulation domain, and by incorporating the spatial information, derive an optimal MKF gain in the short-time Fourier transform domain. We show that the proposed MKF reduces to the conventional multichannel Wiener filter if the LP information is discarded. Furthermore, we show that, under an appropriate assumption, the MKF is equivalent to a concatenation of the minimum variance distortion response beamformer and a single-channel modulation-domain KF and therefore present an alternative implementation of the MKF. Experiments conducted on a public head-related impulse response database demonstrate the effectiveness of the proposed method.

JOURNAL ARTICLE

De Sena E, Brookes M, Naylor PA, van Waterschoot Tet al., 2017, Localization Experiments with Reporting by Head Orientation: Statistical Framework and Case Study, JOURNAL OF THE AUDIO ENGINEERING SOCIETY, Vol: 65, Pages: 982-996, ISSN: 1549-4950

JOURNAL ARTICLE

Dionelis N, Brookes M, 2017, MODULATION-DOMAIN SPEECH ENHANCEMENT USING A KALMAN FILTER WITH A BAYESIAN UPDATE OF SPEECH AND NOISE IN THE LOG-SPECTRAL DOMAIN, Conference on Hands-Free Speech Communications and Microphone Arrays (HSCMA), Publisher: IEEE, Pages: 111-115

CONFERENCE PAPER

Dionelis N, Brookes M, 2017, Speech Enhancement Using Modulation-Domain Kalman Filtering with Active Speech Level Normalized Log-Spectrum Global Priors, 25th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2309-2313, ISSN: 2076-1465

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, 2017, Robust and efficient Bayesian adaptive psychometric function estimation, JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, Vol: 141, Pages: 2501-2512, ISSN: 0001-4966

JOURNAL ARTICLE

Doire CSJ, Brookes M, Naylor PA, Hicks CM, Betts D, Dmour MA, Jensen SHet al., 2017, Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 25, Pages: 572-587, ISSN: 2329-9290

JOURNAL ARTICLE

Koulouri A, Brookes M, Rimpilaeinen V, 2017, Vector tomography for reconstructing electric fields with non-zero divergence in bounded domains, JOURNAL OF COMPUTATIONAL PHYSICS, Vol: 329, Pages: 73-90, ISSN: 0021-9991

JOURNAL ARTICLE

Lawson M, Brookes M, Dragotti PL, 2017, IDENTIFYING A MULTIPLE PLANE PLENOPTIC FUNCTION FROM A SWIPED IMAGE, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 1423-1427, ISSN: 1520-6149

CONFERENCE PAPER

Lightburn L, De Sena E, Moore A, Naylo PA, Brookes Met al., 2017, IMPROVING THE PERCEPTUAL QUALITY OF IDEAL BINARY MASKED SPEECH, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 661-665, ISSN: 1520-6149

CONFERENCE PAPER

Moore AH, Brookes M, Naylor PA, 2017, ROBUST SPHERICAL HARMONIC DOMAIN INTERPOLATION OF SPATIALLY SAMPLED ARRAY MANIFOLDS, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 521-525, ISSN: 1520-6149

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2017, Frequency-domain under-modelled blind system identification based on cross power spectrum and sparsity regularization, IEEE International Conference on Acoustics, Speech and Signal Processing, Pages: 591-595, ISSN: 1520-6149

© 2017 IEEE. In room acoustics, under-modelled multichannel blind system identification (BSI) aims to estimate the early part of the room impulse responses (RIRs), and it can be widely used in applications such as speaker localization, room geometry identification and beamforming based speech dereverberation. In this paper we extend our recent study on under-modelled BSI from the time domain to the frequency domain, such that the RIRs can be updated frame-wise and the efficiency of Fast Fourier Transform (FFT) is exploited to reduce the computational complexity. Analogous to the cross-correlation based criterion in the time domain, a frequency-domain cross power spectrum based criterion is proposed. As the early RIRs are usually sparse, the RIRs are estimated by jointly maximizing the cross power spectrum based criterion in the frequency domain and minimizing the l 1 -norm sparsity measure in the time domain. A two-stage LMS updating algorithm is derived to achieve joint optimization of these two targets. The experimental results in different under-modelled scenarios demonstrate the effectiveness of the proposed method.

CONFERENCE PAPER

Dionelis N, Brookes M, 2016, ACTIVE SPEECH LEVEL ESTIMATION IN NOISY SIGNALS WITH QUADRATURE NOISE SUPPRESSION, 24th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1193-1197, ISSN: 2076-1465

CONFERENCE PAPER

Koulouri A, Rimpilaeinen V, Brookes M, Kaipio JPet al., 2016, Compensation of domain modelling errors in the inverse source problem of the Poisson equation: Application in electroencephalographic imaging, APPLIED NUMERICAL MATHEMATICS, Vol: 106, Pages: 24-36, ISSN: 0168-9274

JOURNAL ARTICLE

Lawson M, Brookes M, Dragotti PL, 2016, Capturing the plenoptic function in a swipe, Conference on Applications of Digital Image Processing XXXIX, Publisher: SPIE-INT SOC OPTICAL ENGINEERING, ISSN: 0277-786X

CONFERENCE PAPER

Lightbum L, Brookes M, 2016, A WEIGHTED STOI INTELLIGIBILITY METRIC BASED ON MUTUAL INFORMATION, IEEE International Conference on Acoustics, Speech, and Signal Processing, Publisher: IEEE, Pages: 5365-5369, ISSN: 1520-6149

CONFERENCE PAPER

Sharma D, Wang Y, Naylor PA, Brookes Met al., 2016, A data-driven non-intrusive measure of speech quality and intelligibility, SPEECH COMMUNICATION, Vol: 80, Pages: 84-94, ISSN: 0167-6393

JOURNAL ARTICLE

Wang Y, Brookes M, 2016, SPEECH ENHANCEMENT USING AN MMSE SPECTRAL AMPLITUDE ESTIMATOR BASED ON A MODULATION DOMAIN KALMAN FILTER WITH A GAMMA PRIOR, IEEE International Conference on Acoustics, Speech, and Signal Processing, Publisher: IEEE, Pages: 5225-5229, ISSN: 1520-6149

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2016, UNDER-MODELLED BLIND SYSTEM IDENTIFICATION FOR TIME DELAY ESTIMATION IN REVERBERANT ENVIRONMENTS, 15th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE

CONFERENCE PAPER

Xue W, Brookes M, Naylor PA, 2016, Cross-Correlation Based Under-Modelled Multichannel Blind Acoustic System Identification with Sparsity Regularization, 24th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 718-722, ISSN: 2076-1465

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, Betts D, Hicks CM, Dmour MA, Jensen SHet al., 2015, SINGLE-CHANNEL BLIND ESTIMATION OF REVERBERATION PARAMETERS, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 31-35, ISSN: 1520-6149

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, Betts D, Hicks CM, Dmour MA, Jensen SHet al., 2015, SINGLE-CHANNEL BLIND ESTIMATION OF REVERBERATION PARAMETERS, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 31-35, ISSN: 1520-6149

CONFERENCE PAPER

Doire CSJ, Brookes M, Naylor PA, Jensen SHet al., 2015, DATA-DRIVEN STATISTICAL MODELLING OF ROOM IMPULSE RESPONSES IN THE POWER DOMAIN, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2466-2470, ISSN: 2076-1465

CONFERENCE PAPER

Hu M, Doclo S, Sharma D, Brookes M, Naylor PAet al., 2015, NOISE ROBUST BLIND SYSTEM IDENTIFICATION ALGORITHMS BASED ON A RAYLEIGH QUOTIENT COST FUNCTION, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2476-2480, ISSN: 2076-1465

CONFERENCE PAPER

Hu M, Parada PP, Sharma D, Doclo S, van Waterschoot T, Brookes M, Naylor PAet al., 2015, SINGLE-CHANNEL SPEAKER DIARIZATION BASED ON SPATIAL FEATURES, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168

CONFERENCE PAPER

Hu M, Sharma D, Doclo S, Brookes M, Naylor PAet al., 2015, SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5743-5747, ISSN: 1520-6149

CONFERENCE PAPER

Hu M, Sharma D, Doclo S, Brookes M, Naylor PAet al., 2015, SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5743-5747, ISSN: 1520-6149

CONFERENCE PAPER

Lightburn L, Brookes M, 2015, SOBM - A BINARY MASK FOR NOISY SPEECH THAT OPTIMISES AN OBJECTIVE INTELLIGIBILITY METRIC, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5078-5082, ISSN: 1520-6149

CONFERENCE PAPER

Lightburn L, Brookes M, 2015, SOBM - A BINARY MASK FOR NOISY SPEECH THAT OPTIMISES AN OBJECTIVE INTELLIGIBILITY METRIC, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5078-5082, ISSN: 1520-6149

CONFERENCE PAPER

This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.

Request URL: http://wlsprd.imperial.ac.uk:80/respub/WEB-INF/jsp/search-html.jsp Request URI: /respub/WEB-INF/jsp/search-html.jsp Query String: respub-action=search.html&id=00000744&limit=30&person=true