In this section

Publications

Showing results for:
Reset all filters

Conference paper
Lim F, Naylor PA, 2012,
Relaxed multichannel least squares with constrained initial taps for multichannel dereverberation

This paper presents a novel algorithm for robust multichannel dereverberation in the presence of system identification errors with the specific aim of avoiding colouration of the equalized signal. Our proposed algorithm is based upon the technique of channel shortening, which targets only the late taps of the room impulse response. Within the framework of the relaxed multichannel least squares (RMCLS) algorithm, we employ partial relaxation of the early taps of the equalized impulse response (EIR) to increase robustness to channel estimation errors, while constraining the initial taps to avoid undesirable colouration of the equalized signal. It is shown through quantitative experimental results that the resultant equalized signal has an overall improved speech quality perception when compared to alternative algorithms.
- Abstract
- Cite
- Citations: 8
Conference paper
Gaubitch ND, Löllmann HW, Jeub M, Falk TH, Naylor PA, Vary P, Brookes Met al., 2012,
Performance comparison of algorithms for blind reverberation time estimation from speech

The reverberation time, T60, is one of the key parameters used to quantify room acoustics. It can provide information about the quality and intelligibility of speech recorded in a reverberant environment, and it can be used to increase robustness to reverberation of speech processing algorithms. T60 can be determined directly from a measurement of the acoustic impulse response, but in situations where this is unavailable it must be estimated blindly from reverberant speech. In this contribution, we provide a study of three state-of-the-art methods for blind T60 estimation. Experimental results with a large number of talkers, simulated and measured acoustic impulse responses, and various levels of additive white Gaussian noise are presented. The relative merits of the three methods in terms of computational time, estimation accuracy, noise sensitivity and inter-talker variance are discussed. In general, all three methods are able to estimate the reverberation time to within 0.2 s for T60 ≤ 0.8 s and SNR ≥ 30 dB, while increasing the noise level causes overestimation. The relative computational speed of the three methods is also assessed.
- Abstract
- Cite
- Citations: 48
Conference paper
Naylor PA, Gaubitch ND, 2012,
Acoustic signal processing in noise: It's not getting any quieter

Acoustic signal processing research has been addressing the issues associated with additive noise and other degradations in speech for many years and several significant technical advances are now embedded in the state-of-the-art. Nevertheless, the problems are not solved and may actually be worsening. The philosophy advocated in this paper is that further improvements in acoustic signal processing for noise reduction and robustness are, of course, important but are unlikely to be sufficient on their own. Alongside the signal processing, successful systems are likely going to need to include two further factors: an element of matching to the human perception system and also an element of sensing and adaptation to the local environment, giving systems acoustic awareness. Examples of current research on human perception and acoustic signal processing are discussed. These include some aspects of auditory cognition and signal processing methods for building acoustic awareness. A new initiative for benchmarking is also highlighted.
- Abstract
- Cite
- Citations: 11
Conference paper
Canclini A, Antonacci F, Filos J, Sarti A, Naylor Pet al., 2012,
Exact localization of planar acoustic reflectors in three-dimensional geometries

In this paper we propose a methodology for localizing acoustic planar reflectors in a 3D geometry, using acoustic measurements acquired by a set of microphones. An acoustic source emitting a known signal is placed close to the wall to be identified, and is used for estimating the source-to-microphone impulse responses. In a preliminary step, such estimates are employed for localizing the source. After that, the Times Of Arrival (TOAs) associated to the first order reflective paths are extracted from the impulse responses and converted into quadratic constraints (ellipsoids) acting on the reflective plane. The constraints are then collected into acost function, whose exact minimization leads to the searched plane. A theoretical analysis is performed for predicting the impact of measurement errors on the estimation. Moreover, experimental results in a real meeting room prove the effectiveness of the method.
- Abstract
- Cite
- Citations: 6
Conference paper
Sharma D, Hilkhuysen G, Naylor PA, Gaubitch ND, Huckvale M, Brookes Met al., 2012,
Descriptive Vocabulary Development for Degraded Speech
, 13th Annual Conference of the International-Speech-Communication-Association, Publisher: ISCA-INT SPEECH COMMUNICATION ASSOC, Pages: 1494-1497
- Author Web Link
- Cite
Journal article
Antonacci F, Filos J, Thomas M, Habets EAP, Sarti A, Naylor PAet al., 2012,
Inference of room geometry from acoustic impulse responses
, IEEE Trans. Audio Speech Language Proc., Vol: 20, Pages: 2683-2695
- Cite
Journal article
Annibale P, Filos J, Naylor PA, Rabenstein Ret al., 2012,
TDOA-based speed of sound estimation for air temperature and room geometry inference
, IEEE Trans. Audio, Speech, Lang. Process.
- Cite
Journal article
Jarrett D, Habets EAP, Thomas M, Naylor PAet al., 2012,
Rigid sphere room impulse response simulation: algorithm and applications
, J. Acoust. Soc. America, Vol: 132
- Cite
Conference paper
Thomas MRP, Gaubitch ND, Habets EAP, Naylor PAet al., 2012,
AN INSIGHT INTO COMMON FILTERING IN NOISY SIMO BLIND SYSTEM IDENTIFICATION
, IEEE International Conference on Acoustics, Speech and Signal Processing, Publisher: IEEE, Pages: 521-524, ISSN: 1520-6149
- Author Web Link
- Cite
- Citations: 2
Conference paper
Sharma D, Naylor PA, Gaubitch ND, Brookes Met al., 2012,
NON INTRUSIVE CODEC IDENTIFICATION ALGORITHM
, IEEE International Conference on Acoustics, Speech and Signal Processing, Publisher: IEEE, Pages: 4477-4480, ISSN: 1520-6149
- Author Web Link
- Cite
- Citations: 7
Conference paper
Filos J, Canclini A, Antonacci F, Sarti A, Naylor PAet al., 2012,
LOCALIZATION OF PLANAR ACOUSTIC REFLECTORS FROM THE COMBINATION OF LINEAR ESTIMATES
, 20th European Signal Processing Conference (EUSIPCO), Publisher: IEEE COMPUTER SOC, Pages: 1019-1023, ISSN: 2076-1465
- Author Web Link
- Cite
- Citations: 14
Conference paper
Annibale P, Antonacci F, Bestagini P, Brutti A, Canclini A, Cristoforetti L, Habets EAP, Filos J, Kellermann W, Kowalczyk K, Lombard A, Mabande E, Markovic D, Naylor PA, Omologo M, Rabenstein R, Sarti A, Svaizer P, Thomas MRPet al., 2011,
The SCENIC Project: Space-Time Audio Processing for Environment-Aware Acoustic Sensing and Rendering
- Cite
Journal article
Slaney M, Naylor PA, 2011,
Audio and Acoustic Signal Processing
, IEEE SIGNAL PROCESSING MAGAZINE, Vol: 28, Pages: 160-U26, ISSN: 1053-5888
- Author Web Link
- Cite
Conference paper
Loganathan P, Habets EAP, Naylor PA, 2011,
A Proportionate Adaptive Algorithm with Variable Partitioned Block Length for Acoustic Echo Cancellation
- Cite
Conference paper
Jarrett DP, Thomas MR, Habets EAP, Naylor PAet al., 2011,
Simulating Room Impulse Responses for Spherical Microphone Arrays
- Cite
Conference paper
Sharma D, Hilkhuysen G, Gaubitch ND, Brookes M, Naylor PAet al., 2011,
C-Qual - A validation of PESQ using degradations encountered in forensic and law enforcement audio
, Pages: 177-181
Assessment of speech quality of law-enforcement audio recordings is important as degradations introduced by non-ideal recording conditions can reduce the intelligence value of such recordings. Furthermore a model that predicts speech quality could be beneficial for assessing the performance of audio collection and enhancement systems. The Perceptual Evaluation of Speech Quality (PESQ) algorithm (ITU-T P.862) has been validated for degradations common in telecommunications. In this paper we apply PESQ to degradations typically encountered in law-enforcement. Also we present a subjectively labeled database (C-Qual) containing distortions encountered in law enforcement scenarios. Comparing the prediction by PESQ and the observed opinions provided by the listeners shows that PESQ is less suitable for estimating the speech quality in this context.
- Abstract
- Cite
- Citations: 1
Conference paper
Gaubitch ND, Brookes M, Naylor PA, Sharma Det al., 2011,
Bayesian Adaptive method for estimating Speech Intelligibility in noise
, Pages: 169-174
We present the Bayesian Adaptive Speech Intelligibility Estimation (BASIE) method - a tool for rapid estimation of a given speech reception threshold (SRT) and the slope at that threshold of multiple psychometric functions for speech intelligibility in noise. The core of this tool is an adaptive Bayesian procedure, which adjusts the signal-to-noise ratio at each subsequent stimulus such that the expected variance of the threshold and slope estimates are minimised. Simulation results show that the algorithm is able to achieve SRT estimates accurate to within ±1 dB in under 30 iterations. Furthermore, we discuss strategies for using BASIE to evaluate the effects of speech processing algorithms on intelligibility and we give two illustrative examples for different noise reduction methods with supporting listening experiments.
- Abstract
- Cite
- Citations: 1
Conference paper
Gaubitch ND, Brookes M, Naylor PA, Sharma Det al., 2011,
Single-Microphone Blind Channel Identification in Speech Using Spectrum Classification
- Cite
Journal article
Gudnason J, Thomas MRP, Ellis DPW, Naylor PAet al., 2011,
Data-Driven Voice Source Waveform Analysis and Synthesis
, Speech Communication, Vol: to appear
- Cite
Conference paper
Habets EAP, Benesty J, Naylor PA, 2011,
A Cross-Relation Based Affine Projection Algorithm for Blind SIMO System Identification
- Cite

This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.

Request URL: http://www.imperial.ac.uk:80/respub/WEB-INF/jsp/search-t4-html.jsp Request URI: /respub/WEB-INF/jsp/search-t4-html.jsp Query String: id=1226&limit=20&page=11&respub-action=search.html Current Millis: 1728210471782 Current Time: Sun Oct 06 11:27:51 BST 2024

Contact us

Address

Speech and Audio Processing Lab
CSP Group, EEE Department
Imperial College London

Exhibition Road, London, SW7 2AZ, United Kingdom

Email

p.naylor@imperial.ac.uk

Publications

Search or filter publications

Filter by type:

Filter by year:

Results

Search results

Relaxed multichannel least squares with constrained initial taps for multichannel dereverberation

Performance comparison of algorithms for blind reverberation time estimation from speech

Acoustic signal processing in noise: It's not getting any quieter

Exact localization of planar acoustic reflectors in three-dimensional geometries

Descriptive Vocabulary Development for Degraded Speech

Inference of room geometry from acoustic impulse responses

TDOA-based speed of sound estimation for air temperature and room geometry inference

Rigid sphere room impulse response simulation: algorithm and applications

AN INSIGHT INTO COMMON FILTERING IN NOISY SIMO BLIND SYSTEM IDENTIFICATION

NON INTRUSIVE CODEC IDENTIFICATION ALGORITHM

LOCALIZATION OF PLANAR ACOUSTIC REFLECTORS FROM THE COMBINATION OF LINEAR ESTIMATES

The SCENIC Project: Space-Time Audio Processing for Environment-Aware Acoustic Sensing and Rendering

Audio and Acoustic Signal Processing

A Proportionate Adaptive Algorithm with Variable Partitioned Block Length for Acoustic Echo Cancellation

Simulating Room Impulse Responses for Spherical Microphone Arrays

C-Qual - A validation of PESQ using degradations encountered in forensic and law enforcement audio

Bayesian Adaptive method for estimating Speech Intelligibility in noise

Single-Microphone Blind Channel Identification in Speech Using Spectrum Classification

Data-Driven Voice Source Waveform Analysis and Synthesis

A Cross-Relation Based Affine Projection Algorithm for Blind SIMO System Identification

Contact us

Address

Email

Members only