Imperial College London

Dr Patrick A. Naylor

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Reader in Speech & Audio Signal Processing
 
 
 
//

Contact

 

+44 (0)20 7594 6235p.naylor Website

 
 
//

Location

 

803Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Publication Type
Year
to

322 results found

Hogg A, Naylor P, Evers C, Speaker change detection using fundamental frequency with application to multi-talker segmentation, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE

This paper shows that time varying pitch properties can be used advantageously within the segmentation step of a multi-talker diarization system. First a study is conducted to verify that changes in pitch are strong indicators of changes in the speaker. It is then highlighted that an individual’s pitch is smoothly varying and, therefore, can be predicted by means of a Kalman filter. Subsequently it is shown that if the pitch is not predictable then this is most likely due to a change in the speaker. Finally, a novel system is proposed that uses this approach of pitch prediction for speaker change detection. This system is then evaluated against a commonly used MFCC segmentation system. The proposed system is shown to increase the speaker change detection rate from 43.3% to 70.5% on meetings in the AMI corpus. Therefore, there are two equally weighted contributions in this paper: 1. We address the question of whether a change in pitch is a reliable estimator of a speaker change in multi-talk meeting audio. 2. We develop a method to extract such speaker changes and test them on a widely available meeting corpus.

CONFERENCE PAPER

Neo Weisheng V, Naylor PA, Second order sequential best rotation algorithm with householder reduction for polynomial matrix eigenvalue decomposition, IEEE International Conference on Acoustics, Speech and Signal Processing, Publisher: IEEE, ISSN: 0736-7791

The Second-order Sequential Best Rotation (SBR2) algorithm, usedfor Eigenvalue Decomposition (EVD) on para-Hermitian polynomialmatrices typically encountered in wideband signal processingapplications like multichannel Wiener filtering and channel coding,involves a series of delay and rotation operations to achieve diagonalisation.In this paper, we proposed the use of Householder transformationsto reduce polynomial matrices to tridiagonal form beforezeroing the dominant element with rotation. Similar to performingHouseholder reduction on conventional matrices, our methodenables SBR2 to converge in fewer iterations with smaller orderof polynomial matrix factors because more off-diagonal Frobeniusnorm(F-norm) could be transferred to the main diagonal at everyiteration. A reduction in the number of iterations by 12.35% and0.1% improvement in reconstruction error is achievable.

CONFERENCE PAPER

Gannot S, Naylor PA, 2019, Highlights from the Audio and Acoustic Signal Processing Technical Committee [In the Spotlight], IEEE Signal Processing Magazine, Vol: 36, ISSN: 1053-5888

© 1991-2012 IEEE. The IEEE Audio and Acoustic Signal Processing Technical Committee (AASP TC) is one of 13 TCs in the IEEE Signal Processing Society. Its mission is to support, nourish, and lead scientific and technological development in all areas of AASP. These areas are currently seeing increased levels of interest and significant growth, providing a fertile ground for a broad range of specific and interdisciplinary research and development. Ranging from array processing for microphones and loudspeakers to music genre classification, from psychoacoustics to machine learning (ML), from consumer electronics devices to blue-sky research, this scope encompasses countless technical challenges and many hot topics. The TC has roughly 30 elected volunteer members drawn equally from leading academic and industrial organizations around the world, unified by the common aim of offering their expertise in the service of the scientific community.

JOURNAL ARTICLE

Moore AH, Xue W, Naylor PA, Brookes Met al., 2019, Noise Covariance Matrix Estimation for Rotating Microphone Arrays, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 27, Pages: 519-530, ISSN: 2329-9290

JOURNAL ARTICLE

Moore AH, Xue W, Naylor PA, Brookes Met al., 2019, Estimation of the Noise Covariance Matrix for Rotating Sensor Arrays, Pages: 1936-1941, ISSN: 1058-6393

© 2018 IEEE. Estimation of the noise covariance matrix is addressed for a sensor array that rotates during desired source activity. Applications include beamformer design for head-mounted microphone arrays in assistive hearing devices. We propose a parametric model which leads to an analytical expression for the sensor signal covariance as a function of the array orientation and array manifold. The model allows the estimated noise covariance matrix to be updated in response to array rotation even during desired source activity. Simulation results demonstrate the efficacy of the method compared to a conventional, recursive estimation approach in which the estimate is not updated during desired source activity. The proposed method yields 18 dB lower error in the estimated noise covariance matrix and the resulting beamformer achieves noise reduction which is within 0.2 dB of an oracle beamformer.

CONFERENCE PAPER

Moore A, de Haan JM, Pedersen MS, Naylor P, Brookes D, Jensen Jet al., Personalized {HRTF}s for hearing aids, ELOBES2019

CONFERENCE PAPER

Brookes D, Lightburn L, Moore A, Naylor P, Xue Wet al., 2019, Mask-assisted speech enhancement for binaural hearing aids, ELOBES2019

CONFERENCE PAPER

Xue W, Moore AH, Brookes M, Naylor PAet al., 2018, Modulation-Domain Multichannel Kalman Filtering for Speech Enhancement, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 26, Pages: 1833-1847, ISSN: 2329-9290

JOURNAL ARTICLE

Evers C, Naylor PA, 2018, Acoustic SLAM, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 26, Pages: 1484-1498, ISSN: 2329-9290

JOURNAL ARTICLE

Evers C, Habets EAP, Gannot S, Naylor PAet al., 2018, DoA Reliability for Distributed Acoustic Tracking, IEEE SIGNAL PROCESSING LETTERS, Vol: 25, Pages: 1320-1324, ISSN: 1070-9908

JOURNAL ARTICLE

Evers C, Naylor PA, 2018, Optimized Self-Localization for SLAM in Dynamic Scenes Using Probability Hypothesis Density Filters, IEEE TRANSACTIONS ON SIGNAL PROCESSING, Vol: 66, Pages: 863-878, ISSN: 1053-587X

JOURNAL ARTICLE

Evers C, Loellmann HW, Mellmann H, Schmidt A, Barfuss H, Naylor PA, Kellermann Wet al., 2018, LOCATA CHALLENGE - EVALUATION TASKS AND MEASURES, 16th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 565-569, ISSN: 2639-4316

CONFERENCE PAPER

Hafezi S, Moore AH, Naylor PA, 2018, ROBUST SOURCE COUNTING AND ACOUSTIC DOA ESTIMATION USING DENSITY-BASED CLUSTERING, 10th IEEE Sensor Array and Multichannel Signal Processing Workshop (SAM), Publisher: IEEE, Pages: 395-399, ISSN: 1551-2282

CONFERENCE PAPER

Antonello N, De Sena E, Moonen M, Naylor PA, van Waterschoot Met al., 2018, JOINT SOURCE LOCALIZATION AND DEREVERBERATION BY SOUND FIELD INTERPOLATION USING SPARSE REGULARIZATION, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, Pages: 6892-6896

CONFERENCE PAPER

Yiallourides C, Moore AH, Auvinet E, Van der Straeten C, Naylor PAet al., 2018, ACOUSTIC ANALYSIS AND ASSESSMENT OF THE KNEE IN OSTEOARTHRITIS DURING WALKING, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, Pages: 281-285

CONFERENCE PAPER

Dawson PJ, De Sena E, Naylor PA, 2018, An acoustic image-source characterisation of surface profiles, European Signal Processing Conference (EUSIPCO), Publisher: IEEE COMPUTER SOC, Pages: 2130-2134, ISSN: 2076-1465

CONFERENCE PAPER

Sharma D, Nour-Eldin A, Harding P, Karimian-Azari S, Naylor PAet al., 2018, ROBUST FEATURE EXTRACTION FROM AD-HOC MICROPHONES FOR MEETING DIARIZATION, 16th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 296-300, ISSN: 2639-4316

CONFERENCE PAPER

Moore AH, Lightburn L, Xue W, Naylor PA, Brookes Met al., 2018, BINAURAL MASK-INFORMED SPEECH ENHANCEMENT FOR HEARING AIDS WITH HEAD TRACKING, 16th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 461-465, ISSN: 2639-4316

CONFERENCE PAPER

Loellmann HW, Evers C, Schmidt A, Mellmann H, Barfuss H, Naylor PA, Kellermann Wet al., 2018, The LOCATA Challenge Data Corpus for Acoustic Source Localization and Tracking, 10th IEEE Sensor Array and Multichannel Signal Processing Workshop (SAM), Publisher: IEEE, Pages: 410-414, ISSN: 1551-2282

CONFERENCE PAPER

Moore AH, Naylor PA, Brookes M, 2018, ROOM IDENTIFICATION USING FREQUENCY DEPENDENCE OF SPECTRAL DECAY STATISTICS, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, Pages: 6902-6906

CONFERENCE PAPER

Xue W, Moore AH, Brookes M, Naylor PAet al., 2018, Modulation-Domain Parametric Multichannel Kalman Filtering for Speech Enhancement, European Signal Processing Conference (EUSIPCO), Publisher: IEEE COMPUTER SOC, Pages: 2509-2513, ISSN: 2076-1465

CONFERENCE PAPER

Xue W, Moore AH, Brookes M, Naylor PAet al., 2018, MULTICHANNEL KALMAN FILTERING FOR SPEECH EHNANCEMENT, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, Pages: 41-45

CONFERENCE PAPER

De Sena E, Brookes M, Naylor PA, van Waterschoot Tet al., 2017, Localization Experiments with Reporting by Head Orientation: Statistical Framework and Case Study, JOURNAL OF THE AUDIO ENGINEERING SOCIETY, Vol: 65, Pages: 982-996, ISSN: 1549-4950

JOURNAL ARTICLE

Moore AH, Parada PP, Naylor PA, 2017, Speech enhancement for robust automatic speech recognition: Evaluation using a baseline system and instrumental measures, COMPUTER SPEECH AND LANGUAGE, Vol: 46, Pages: 574-584, ISSN: 0885-2308

JOURNAL ARTICLE

Hafezi S, Moore AH, Naylor PA, 2017, Augmented Intensity Vectors for Direction of Arrival Estimation in the Spherical Harmonic Domain, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 25, Pages: 1956-1968, ISSN: 2329-9290

JOURNAL ARTICLE

Antonello N, De Sena E, Moonen M, Naylor PA, Van Waterschoot Tet al., 2017, Room Impulse Response Interpolation Using a Sparse Spatio-Temporal Representation of the Sound Field, IEEE/ACM Transactions on Audio Speech and Language Processing, Vol: 25, Pages: 1929-1941, ISSN: 2329-9290

© 2017 IEEE. Room Impulse Responses (RIRs) are typically measured using a set of microphones and a loudspeaker. When RIRs spanning a large volume are needed, many microphone measurements must be used to spatially sample the sound field. In order to reduce the number of microphone measurements, RIRs can be spatially interpolated. In the present study, RIR interpolation is formulated as an inverse problem. This inverse problem relies on a particular acoustic model capable of representing the measurements. Two different acoustic models are compared: the plane wave decomposition model and a novel time-domain model, which consists of a collection of equivalent sources creating spherical waves. These acoustic models can both approximate any reverberant sound field created by a far-field sound source. In order to produce an accurate RIR interpolation, sparsity regularization is employed when solving the inverse problem. In particular, by combining different acoustic models with different sparsity promoting regularizations, spatial sparsity, spatio-spectral sparsity, and spatio-temporal sparsity are compared. The inverse problem is solved using a matrix-free large-scale optimization algorithm. Simulations show that the best RIR interpolation is obtained when combining the novel time-domain acoustic model with the spatio-temporal sparsity regularization, outperforming the results of the plane wave decomposition model even when far fewer microphone measurements are available.

JOURNAL ARTICLE

Doire CSJ, Brookes M, Naylor PA, 2017, Robust and efficient Bayesian adaptive psychometric function estimation, JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, Vol: 141, Pages: 2501-2512, ISSN: 0001-4966

JOURNAL ARTICLE

Doire CSJ, Brookes M, Naylor PA, Hicks CM, Betts D, Dmour MA, Jensen SHet al., 2017, Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 25, Pages: 572-587, ISSN: 2329-9290

JOURNAL ARTICLE

Moore AH, Brookes M, Naylor PA, 2017, ROBUST SPHERICAL HARMONIC DOMAIN INTERPOLATION OF SPATIALLY SAMPLED ARRAY MANIFOLDS, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 521-525, ISSN: 1520-6149

CONFERENCE PAPER

Hafezi S, Moore AH, Naylor PA, 2017, MULTIPLE SOURCE LOCALIZATION USING ESTIMATION CONSISTENCY IN THE TIME-FREQUENCY DOMAIN, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 516-520, ISSN: 1520-6149

CONFERENCE PAPER

This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.

Request URL: http://wlsprd.imperial.ac.uk:80/respub/WEB-INF/jsp/search-html.jsp Request URI: /respub/WEB-INF/jsp/search-html.jsp Query String: respub-action=search.html&id=00004259&limit=30&person=true