Imperial College London

Dr Patrick A. Naylor

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Professor of Speech & Acoustic Signal Processing
 
 
 
//

Contact

 

+44 (0)20 7594 6235p.naylor Website

 
 
//

Location

 

803Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Publication Type
Year
to

328 results found

Zahedi A, Ostergaard J, Jensen SH, Bech S, Naylor Pet al., 2015, Audio coding in wireless acoustic sensor networks, SIGNAL PROCESSING, Vol: 107, Pages: 141-152, ISSN: 0165-1684

Journal article

Hu M, sharma D, Doclo S, Brookes D, naylor Pet al., Speaker change detection and speaker diarization using spatial information, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Conference paper

Cauchi B, Naylor PA, Gerkmann T, Doclo S, Goetze Set al., 2015, LATE REVERBERANT SPECTRAL VARIANCE ESTIMATION USING ACOUSTIC CHANNEL EQUALIZATION, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2481-2485, ISSN: 2076-1465

Conference paper

Javed HA, Naylor PA, 2015, AN EXTENDED REVERBERATION DECAY TAIL METRIC AS A MEASURE OF PERCEIVED LATE REVERBERATION, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1063-1067, ISSN: 2076-1465

Conference paper

Nelke CM, Naylor PA, Vary P, 2015, CORPUS BASED RECONSTRUCTION OF SPEECH DEGRADED BY WIND NOISE, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 864-868, ISSN: 2076-1465

Conference paper

Hafezi S, Moore AH, Naylor PA, 2015, MODELING SOURCE DIRECTIVITY IN ROOM IMPULSE RESPONSE SIMULATION FOR SPHERICAL MICROPHONE ARRAYS, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 574-578, ISSN: 2076-1465

Conference paper

Doire CSJ, Brookes M, Naylor PA, Betts D, Hicks CM, Dmour MA, Jensen SHet al., 2015, SINGLE-CHANNEL BLIND ESTIMATION OF REVERBERATION PARAMETERS, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 31-35, ISSN: 1520-6149

Conference paper

Zahedi A, Ostergaard J, Jensen SH, Naylor P, Bech Set al., 2015, Coding and Enhancement in Wireless Acoustic Sensor Networks, Data Compression Conference (DCC), Publisher: IEEE, Pages: 293-302, ISSN: 1068-0314

Conference paper

Hu M, Sharma D, Doclo S, Brookes M, Naylor PAet al., 2015, SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5743-5747, ISSN: 1520-6149

Conference paper

Eaton J, Moore AH, Naylor PA, Skoglund Jet al., 2015, DIRECT-TO-REVERBERANT RATIO ESTIMATION USING A NULL-STEERED BEAMFORMER, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 46-50, ISSN: 1520-6149

Conference paper

Sharma D, Poddar A, Manna S, Naylor PAet al., 2015, THE SAS PROJECT: SPEECH SIGNAL PROCESSING IN HIGH SCHOOL EDUCATION, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1781-1785, ISSN: 2076-1465

Conference paper

Doire CSJ, Brookes M, Naylor PA, Betts D, Hicks CM, Dmour MA, Jensen SHet al., 2015, SINGLE-CHANNEL BLIND ESTIMATION OF REVERBERATION PARAMETERS, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 31-35, ISSN: 1520-6149

Conference paper

Lim F, Naylor PA, Thomas MRP, Tashev IJet al., 2015, ACOUSTIC BLUR KERNEL WITH SLIDING WINDOW FOR BLIND ESTIMATION OF REVERBERATION TIME, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168

Conference paper

Hu M, Parada PP, Sharma D, Doclo S, van Waterschoot T, Brookes M, Naylor PAet al., 2015, SINGLE-CHANNEL SPEAKER DIARIZATION BASED ON SPATIAL FEATURES, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168

Conference paper

Lim F, Zhang W, Habets EAP, Naylor PAet al., 2014, Robust Multichannel Dereverberation using Relaxed Multichannel Least Squares, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 22, Pages: 1379-1390, ISSN: 2329-9290

Journal article

Evers C, Moore AH, Naylor PA, 2014, Multiple source localisation in the spherical harmonic domain

Conference paper

Moore AH, Naylor PA, Skoglund J, An Analysis of the Effect of Larynx-Synchronous Averaging on Dereverberation of Voiced Speech, European Signal Processing Conference, ISSN: 2219-5491

Conference paper

Eaton J, Naylor PA, 2014, Detection of clipping in coded speech signals, 21st European Signal Processing Conference (EUSIPCO), Publisher: IEEE

In order to exploit the full dynamic range of communicationsand recording equipment, and to minimise the effects of noiseand interference, input gain to a recording device is typicallyset as high as possible. This often leads to the signal exceedingthe input limit of the equipment resulting in clipping. Com-munications devices typically rely on codecs such as GSM06.10to compress voice signals into lower bitrates. Althoughdetecting clipping in a hard-clipped speech signal is straight-forward due to the characteristic flattening of the peaks of thewaveform, this is not the case for speech that has subsequentlypassed through a codec. We describe a novel clipping detec-tion algorithm based on amplitude histogram analysis and leastsquares residuals which can estimate the clipped samples andthe original signal level in speech even after the clipped speechhas been perceptually coded.

Conference paper

Jarrett DP, Taseska M, Habets EAP, Naylor PAet al., 2014, Noise Reduction in the Spherical Harmonic Domain Using a Tradeoff Beamformer and Narrowband DOA Estimates, IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol: 22, Pages: 965-976

Journal article

Eaton J, Naylor PA, 2014, Noise-robust detection of peak-clipping in decoded speech, Pages: 7019-7023

Clipping is a commonplace problem in voice telecommunications and detection of clipping is useful in a range of speech processing applications. We analyse and evaluate the performance of three previously presented algorithms for clipping detection in decoded speech in high levels of ambient noise. We identify a baseline method which is well known for clipping detection, determine experimentally the optimized operation parameter for the baseline approach, and use this in our experiments. Our results indicate that the new algorithms outperform the baseline except at extreme levels of clipping and negative signal-to-noise ratios.

Conference paper

Stanton R, Gaubitch N, Naylor P, Brookes DMet al., A Differentiable Approximation to Speech Intelligibility Index with Applications to Listening Enhancement, AES Intl Conf on Audio Forensics

The Speech Intelligibility Index is a standardised objective measure for estimating the intelligibility of speech in noise. It is, however difficult to use it in the iterative optimisation of speech enhancement algorithms because it is a discontinuous function of its input parameters. In this paper, we derive an approximation for the Speech Intelligibility Index that is both continuous and differentiable, which allows for more efficient optimisation procedures. The use of the approximation is demonstrated in an application to near-end speech enhancement.

Conference paper

Antonello N, van Waterschool T, Moonen M, Naylor PAet al., 2014, SOURCE LOCALIZATION AND SIGNAL RECONSTRUCTION IN A REVERBERANT FIELD USING THE FDTD METHOD, 22nd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 301-305, ISSN: 2076-1465

Conference paper

Costa MH, Naylor PA, 2014, ILD PRESERVATION IN THE MULTICHANNEL WIENER FILTER FOR BINAURAL HEARING AID APPLICATIONS, 22nd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 636-640, ISSN: 2076-1465

Conference paper

Parada PP, Sharma D, Naylor PA, 2014, Non-intrusive estimation of the level of reverberation in speech, Pages: 4718-4722, ISSN: 1520-6149

We show corroborating evidence that, among a set of common acoustic parameters, the clarity index C50 provides a measure of reverberation that is well correlated with speech recognition accuracy. We also present a data driven method for non-intrusive C50 parameter estimation from a single channel speech signal. The method extracts a number of features from the speech signal and uses a binary regression tree, trained on appropriate training data, to estimate the C50. Evaluation is carried out using speech utterances convolved with real and simulated room impulse responses, and additive babble noise. The new method outperforms a baseline approach in our evaluation. © 2014 IEEE.

Conference paper

Zahedi A, Østergaard J, Jensen SH, Naylor P, Bech Set al., 2014, Distributed remote vector gaussian source coding for wireless acoustic sensor networks, Pages: 263-272, ISSN: 1068-0314

In this paper, we consider the problem of remote vector Gaussian source coding for a wireless acoustic sensor network. Each node receives messages from multiple nodes in the network and decodes these messages using its own measurement of the sound field as side information. The node's measurement and the estimates of the source resulting from decoding the received messages are then jointly encoded and transmitted to a neighbouring node in the network. We show that for this distributed source coding scenario, one can encode a so-called conditional sufficient statistic of the sources instead of jointly encoding multiple sources. We focus on the case where node measurements are in form of noisy linearly mixed combinations of the sources and the acoustic channel mixing matrices are invertible. For this problem, we derive the rate-distortion function for vector Gaussian sources and under covariance distortion constraints. © 2014 IEEE.

Conference paper

Sharma D, Meredith L, Lainez J, Barreda D, Naylor PAet al., 2014, A Non-Intrusive PESQ Measure, IEEE Global Conference on Signal and Information Processing (GlobalSIP), Publisher: IEEE, Pages: 975-978

Conference paper

Parada PP, Sharma D, Naylor PA, van Waterschoot Tet al., 2014, REVERBERANT SPEECH RECOGNITION: A PHONEME ANALYSIS, IEEE Global Conference on Signal and Information Processing (GlobalSIP), Publisher: IEEE, Pages: 567-571

Conference paper

Zahedi A, Ostergaard J, Jensen SH, Naylor P, Bech Set al., 2014, Distributed Remote Vector Gaussian Source Coding with Covariance Distortion Constraints, IEEE International Symposium on Information Theory (ISIT), Publisher: IEEE, Pages: 586-590

Conference paper

Thomas MRP, Tashev IJ, Lim F, Naylor PAet al., 2014, OPTIMAL BEAMFORMING AS A TIME DOMAIN EQUALIZATION PROBLEM WITH APPLICATION TO ROOM ACOUSTICS, 14th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 75-79

Conference paper

Lim F, Naylor PA, 2014, STATISTICAL MODELLING OF MULTICHANNEL BLIND SYSTEM IDENTIFICATION ERRORS, 14th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 119-123

Conference paper

This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.

Request URL: http://wlsprd.imperial.ac.uk:80/respub/WEB-INF/jsp/search-html.jsp Request URI: /respub/WEB-INF/jsp/search-html.jsp Query String: id=00004259&limit=30&person=true&page=4&respub-action=search.html