336 results found
Costa MH, Naylor PA, 2014, ILD PRESERVATION IN THE MULTICHANNEL WIENER FILTER FOR BINAURAL HEARING AID APPLICATIONS, 22nd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 636-640, ISSN: 2076-1465
Parada PP, Sharma D, Naylor PA, 2014, Non-intrusive estimation of the level of reverberation in speech, Pages: 4718-4722, ISSN: 1520-6149
We show corroborating evidence that, among a set of common acoustic parameters, the clarity index C50 provides a measure of reverberation that is well correlated with speech recognition accuracy. We also present a data driven method for non-intrusive C50 parameter estimation from a single channel speech signal. The method extracts a number of features from the speech signal and uses a binary regression tree, trained on appropriate training data, to estimate the C50. Evaluation is carried out using speech utterances convolved with real and simulated room impulse responses, and additive babble noise. The new method outperforms a baseline approach in our evaluation. © 2014 IEEE.
Zahedi A, Østergaard J, Jensen SH, et al., 2014, Distributed remote vector gaussian source coding for wireless acoustic sensor networks, Pages: 263-272, ISSN: 1068-0314
In this paper, we consider the problem of remote vector Gaussian source coding for a wireless acoustic sensor network. Each node receives messages from multiple nodes in the network and decodes these messages using its own measurement of the sound field as side information. The node's measurement and the estimates of the source resulting from decoding the received messages are then jointly encoded and transmitted to a neighbouring node in the network. We show that for this distributed source coding scenario, one can encode a so-called conditional sufficient statistic of the sources instead of jointly encoding multiple sources. We focus on the case where node measurements are in form of noisy linearly mixed combinations of the sources and the acoustic channel mixing matrices are invertible. For this problem, we derive the rate-distortion function for vector Gaussian sources and under covariance distortion constraints. © 2014 IEEE.
Sharma D, Meredith L, Lainez J, et al., 2014, A Non-Intrusive PESQ Measure, IEEE Global Conference on Signal and Information Processing (GlobalSIP), Publisher: IEEE, Pages: 975-978
Parada PP, Sharma D, Naylor PA, et al., 2014, REVERBERANT SPEECH RECOGNITION: A PHONEME ANALYSIS, IEEE Global Conference on Signal and Information Processing (GlobalSIP), Publisher: IEEE, Pages: 567-571
Zahedi A, Ostergaard J, Jensen SH, et al., 2014, Distributed Remote Vector Gaussian Source Coding with Covariance Distortion Constraints, IEEE International Symposium on Information Theory (ISIT), Publisher: IEEE, Pages: 586-590
Thomas MRP, Tashev IJ, Lim F, et al., 2014, OPTIMAL BEAMFORMING AS A TIME DOMAIN EQUALIZATION PROBLEM WITH APPLICATION TO ROOM ACOUSTICS, 14th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 75-79
Lim F, Naylor PA, 2014, STATISTICAL MODELLING OF MULTICHANNEL BLIND SYSTEM IDENTIFICATION ERRORS, 14th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 119-123
Parada PP, Sharma D, Lainez J, et al., 2014, A QUANTITATIVE COMPARISON OF BLIND C-50 ESTIMATORS, 14th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 298-302
Antonello N, van Waterschoot T, Moonen M, et al., 2014, IDENTIFICATION OF SURFACE ACOUSTIC IMPEDANCES IN A REVERBERANT ROOM USING THE FDTD METHOD, 14th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 114-118
Borges RC, Costa MH, Naylor PA, et al., 2014, Impact of the vent size in the feedback-path and occlusion-effect in hearing aids, IEEE Biomedical Circuits and Systems Conference (BioCAS), Publisher: IEEE, Pages: 25-28, ISSN: 2163-4025
Moore AH, Brookes M, Naylor PA, 2013, Roomprints for forensic audio applications, Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE
A roomprint is a quantifiable description of an acoustic environment which can be measured under controlled conditions and estimated from a monophonic recording made in that space. We here identify the properties required of a roomprint in forensic audio applications and review the observable characteristics of a room that, when extracted from recordings, could form the basis of a roomprint. Frequency-dependent reverberation time is investigated as a promising characteristic and used in a room identification experiment giving correct identification in 96% of trials.
Gaubitch N, Brookes M, Naylor P, 2013, Blind Channel Magnitude Response Estimation in Speech using Spectrum Classification, IEEE Transactions on Audio, Speech, and Language Processing, Vol: 21, Pages: 2162-2171, ISSN: 1558-7916
Moore AH, Brookes M, Naylor PA, 2013, Room geometry estimation from a single channel acoustic impulse response, Proc. European Signal Processing Conference (EUSIPCO)
Eaton D, Brookes DM, Naylor PA, 2013, A Comparison of Non-Intrusive SNR Estimation Algorithms and the Use of Mapping Functions, EUSIPCO, Publisher: EURASIP, Pages: 1-5
We present a comparative evaluation of six methods for non-intrusive Signal-to-Noise Ratio (SNR) estimation for narrowband speech in noise. We demonstrate that the performance of all methods can be improved by applying a non-linear mapping function to their estimates of SNR. We have employed phrases built from the TIMIT speech corpus and noises from a broad range of sources including ITU-T P.501, NOISEX-92, and Soundjay. We compare the accuracy of the methods in estimating the SNR of both stationary and non-stationary noise and we conclude that with the mapping function, the best current methods can estimate the SNR to within approximately 3.5 dB for SNRs from -5 dB to 35 dB.
Kowalczyk K, Habets EAP, Kellermann W, et al., 2013, Blind System Identification Using Sparse Learning for TDOA Estimation of Room Reflections, IEEE SIGNAL PROCESSING LETTERS, Vol: 20, Pages: 653-656, ISSN: 1070-9908
Jarrett DP, Habets EAP, Naylor PA, 2013, Spherical harmonic domain noise reduction using an MVDR beamformer and DOA-based second-order statistics estimation, Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013)
Eaton J, Gaubitch ND, Naylor PA, 2013, Noise-robust reverberation time estimation using spectral decay distributions with reduced computational cost, Pages: 161-165, ISSN: 1520-6149
Reverberation Time (T60) is an important measure of the acoustic properties of a room. It can provide information about the acoustic environment, the intelligibility, and quality of speech recorded in the room, and help improve the performance of speech processing algorithms with reverberant speech. Where the acoustic impulse response of the room is not available, the T60 must be estimated non-intrusively from reverberant speech. State-of-the-art non-intrusive T60 estimators have been shown to be strongly biased in the presence of noise. We describe a novel T60 estimation algorithm based on spectral decay distributions that provides robustness to additive noise for a range of realistic noise types for signal-to-noise ratios in the range 0 to 35 dB and T60s between 200 and 950 ms. The proposed method also has much reduced computational cost.
Lim F, Thomas MRP, Naylor PA, 2013, MINTFORMER: A SPATIALLY AWARE CHANNEL EQUALIZER, 14th IEEE Workshop on Applications of Signal Processing to AudNew Paltzio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168
Lim F, Naylor PA, 2013, ROBUST SPEECH DEREVERBERATION USING SUBBAND MULTICHANNEL LEAST SQUARES WITH VARIABLE RELAXATION, 21st European Signal Processing Conference (EUSIPCO), Publisher: IEEE
Sharma D, Naylor PA, Brookes M, 2013, NON-INTRUSIVE SPEECH INTELLIGIBILITY ASSESSMENT, 21st European Signal Processing Conference (EUSIPCO), Publisher: IEEE
Lim F, Naylor PA, 2013, ROBUST LOW-COMPLEXITY MULTICHANNEL EQUALIZATION FOR DEREVERBERATION, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 689-693, ISSN: 1520-6149
Jarrett DP, Thiergart O, Habets EAP, et al., 2012, Coherence-based diffuseness estimation in the spherical harmonic domain, Proc. of the IEEE Convention of Electrical and Electronics Engineers in Israel (IEEEI)
Jarrett DP, Habets EAP, Benesty J, et al., 2012, A tradeoff beamformer for noise reduction in the spherical harmonic domain, Proc. of the International Workshop on Acoustic Signal Enhancement (IWAENC 2012)
Annibale P, Filos J, Naylor PA, et al., 2012, Geometric inference of the room geometry under temperature variations
Geometric inference is an approach for localizing reflectors in a closed acoustic space. It is based on a simple observation that turns time differences of arrival (TDOA) or time of arrival (TOA) measurements from the signals of a microphone array into a geometric constraint. The reflector localization methodology relies on accurate TDOA which is directly dependent on speed of sound information. Estimating the actual speed of sound at the ambient temperature therefore greatly improves the accuracy of the reflector localization in uncontrolled environments. This manuscript shows how to use the geometric inference jointly with the speed of sound estimation for a more accurate reflector localization. Simulations and experiments show the validity of the proposed approach. © 2012 IEEE.
Drugman T, Thomas MRP, Gudnason J, et al., 2012, Detection of Glottal Closure Instants from Speech Signals: a Quantitative Review, IEEE Trans. Audio Speech Language Proc., Vol: 20, Pages: 994-1006
Lin XS, Khong AWH, Naylor PA, 2012, A Forced Spectral Diversity Algorithm For Speech Dereverberation In The Presence Of Near-common Zeros, IEEE Trans. Audio Speech Language Proc., Vol: 20, Pages: 888-899
Habets EAP, Benesty J, Naylor PA, 2012, Speech Distortion and Interference Rejection Constraint Beamformer, IEEE Trans. Audio Speech Language Proc., Vol: 20, Pages: 854-867
Filos J, Canclini A, Antonacci F, et al., 2012, LOCALIZATION OF PLANAR ACOUSTIC REFLECTORS FROM THE COMBINATION OF LINEAR ESTIMATES, 20th European Signal Processing Conference (EUSIPCO), Publisher: IEEE COMPUTER SOC, Pages: 1019-1023, ISSN: 2076-1465
Sharma D, Naylor PA, Gaubitch ND, et al., 2012, NON INTRUSIVE CODEC IDENTIFICATION ALGORITHM, IEEE International Conference on Acoustics, Speech and Signal Processing, Publisher: IEEE, Pages: 4477-4480, ISSN: 1520-6149
This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.