106 results found
Dionelis N, Brookes, 2017, Speech Enhancement Using Modulation-Domain Kalman Filtering with Active Speech Level Normalized Log-Spectrum Global Priors, 25th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, ISSN: 2076-1465
We describe a single-channel speech enhancement algorithm that is based on modulation-domain Kalman filtering that tracks the inter-frame time evolution of the speech logpower spectrum in combination with the long-term average speech log-spectrum. We use offline-trained log-power spectrum global priors incorporated in the Kalman filter prediction and update steps for enhancing noise suppression. In particular, we train and utilize Gaussian mixture model priors for speech in the log-spectral domain that are normalized with respect to the active speech level. The Kalman filter update step uses the log-power spectrum global priors together with the local priors obtained from the Kalman filter prediction step. The logspectrum Kalman filtering algorithm, which uses the theoretical phase factor distribution and improves the modeling of the modulation features, is evaluated in terms of speech quality. Different algorithm configurations, dependent on whether global priors and/or Kalman filter noise tracking are used, are compared in various noise types.
Dionelis N, Brookes M, 2017, MODULATION-DOMAIN SPEECH ENHANCEMENT USING A KALMAN FILTER WITH A BAYESIAN UPDATE OF SPEECH AND NOISE IN THE LOG-SPECTRAL DOMAIN, Conference on Hands-Free Speech Communications and Microphone Arrays (HSCMA), Publisher: IEEE, Pages: 111-115
Doire CSJ, Brookes M, Naylor PA, 2017, Robust and efficient Bayesian adaptive psychometric function estimation, JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, Vol: 141, Pages: 2501-2512, ISSN: 0001-4966
Doire CSJ, Brookes M, Naylor PA, et al., 2017, Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 25, Pages: 572-587, ISSN: 2329-9290
Koulouri A, Brookes M, Rimpilaeinen V, 2017, Vector tomography for reconstructing electric fields with non-zero divergence in bounded domains, JOURNAL OF COMPUTATIONAL PHYSICS, Vol: 329, Pages: 73-90, ISSN: 0021-9991
Lightburn L, De Sena E, Moore A, et al., 2017, Improving the perceptual quality of ideal binary masked speech, Pages: 661-665, ISSN: 1520-6149
© 2017 IEEE. It is known that applying a time-frequency binary mask to very noisy speech can improve its intelligibility but results in poor perceptual quality. In this paper we propose a new approach to applying a binary mask that combines the intelligibility gains of conventional binary masking with the perceptual quality gains of a classical speech enhancer. The binary mask is not applied directly as a time-frequency gain as in most previous studies. Instead, the mask is used to supply prior information to a classical speech enhancer about the probability of speech presence in different time-frequency regions. Using an oracle ideal binary mask, we show that the proposed method results in a higher predicted quality than other methods of applying a binary mask whilst preserving the improvements in predicted intelligibility.
Moore AH, Brookes M, Naylor PA, 2017, Robust spherical harmonic domain interpolation of spatially sampled array manifolds, Pages: 521-525, ISSN: 1520-6149
© 2017 IEEE. Accurate interpolation of the array manifold is an important first step for the acoustic simulation of rapidly moving microphone arrays. Spherical harmonic domain interpolation has been proposed and well studied in the context of head-related transfer functions but has focussed on perceptual, rather than numerical, accuracy. In this paper we analyze the effect of measurement noise on spatial aliasing. Based on this analysis we propose a method for selecting the truncation orders for the forward and reverse spherical Fourier transforms given only the noisy samples in such a way that the interpolation error is minimized. The proposed method achieves up to 1.7 dB improvement over the baseline approach.
Xue W, Brookes M, Naylor PA, 2017, Frequency-domain under-modelled blind system identification based on cross power spectrum and sparsity regularization, Pages: 591-595, ISSN: 1520-6149
© 2017 IEEE. In room acoustics, under-modelled multichannel blind system identification (BSI) aims to estimate the early part of the room impulse responses (RIRs), and it can be widely used in applications such as speaker localization, room geometry identification and beamforming based speech dereverberation. In this paper we extend our recent study on under-modelled BSI from the time domain to the frequency domain, such that the RIRs can be updated frame-wise and the efficiency of Fast Fourier Transform (FFT) is exploited to reduce the computational complexity. Analogous to the cross-correlation based criterion in the time domain, a frequency-domain cross power spectrum based criterion is proposed. As the early RIRs are usually sparse, the RIRs are estimated by jointly maximizing the cross power spectrum based criterion in the frequency domain and minimizing the l 1 -norm sparsity measure in the time domain. A two-stage LMS updating algorithm is derived to achieve joint optimization of these two targets. The experimental results in different under-modelled scenarios demonstrate the effectiveness of the proposed method.
Dionelis N, Brookes M, 2016, ACTIVE SPEECH LEVEL ESTIMATION IN NOISY SIGNALS WITH QUADRATURE NOISE SUPPRESSION, 24th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1193-1197, ISSN: 2076-1465
Koulouri A, Rimpilaeinen V, Brookes M, et al., 2016, Compensation of domain modelling errors in the inverse source problem of the Poisson equation: Application in electroencephalographic imaging, APPLIED NUMERICAL MATHEMATICS, Vol: 106, Pages: 24-36, ISSN: 0168-9274
Lawson M, Brookes M, Dragotti PL, 2016, Capturing the plenoptic function in a swipe, Conference on Applications of Digital Image Processing XXXIX, Publisher: SPIE-INT SOC OPTICAL ENGINEERING, ISSN: 0277-786X
Lightbum L, Brookes M, 2016, A WEIGHTED STOI INTELLIGIBILITY METRIC BASED ON MUTUAL INFORMATION, IEEE International Conference on Acoustics, Speech, and Signal Processing, Publisher: IEEE, Pages: 5365-5369, ISSN: 1520-6149
Sharma D, Wang Y, Naylor PA, et al., 2016, A data-driven non-intrusive measure of speech quality and intelligibility, SPEECH COMMUNICATION, Vol: 80, Pages: 84-94, ISSN: 0167-6393
Wang Y, Brookes M, 2016, SPEECH ENHANCEMENT USING AN MMSE SPECTRAL AMPLITUDE ESTIMATOR BASED ON A MODULATION DOMAIN KALMAN FILTER WITH A GAMMA PRIOR, IEEE International Conference on Acoustics, Speech, and Signal Processing, Publisher: IEEE, Pages: 5225-5229, ISSN: 1520-6149
Xue W, Brookes M, Naylor PA, 2016, UNDER-MODELLED BLIND SYSTEM IDENTIFICATION FOR TIME DELAY ESTIMATION IN REVERBERANT ENVIRONMENTS, 15th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE
Xue W, Brookes M, Naylor PA, 2016, Cross-Correlation Based Under-Modelled Multichannel Blind Acoustic System Identification with Sparsity Regularization, 24th European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 718-722, ISSN: 2076-1465
Doire CSJ, Brookes M, Naylor PA, et al., 2015, SINGLE-CHANNEL BLIND ESTIMATION OF REVERBERATION PARAMETERS, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 31-35, ISSN: 1520-6149
Doire CSJ, Brookes M, Naylor PA, et al., 2015, DATA-DRIVEN STATISTICAL MODELLING OF ROOM IMPULSE RESPONSES IN THE POWER DOMAIN, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2466-2470, ISSN: 2076-1465
Hu M, Doclo S, Sharma D, et al., 2015, NOISE ROBUST BLIND SYSTEM IDENTIFICATION ALGORITHMS BASED ON A RAYLEIGH QUOTIENT COST FUNCTION, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2476-2480, ISSN: 2076-1465
Hu M, Parada PP, Sharma D, et al., 2015, SINGLE-CHANNEL SPEAKER DIARIZATION BASED ON SPATIAL FEATURES, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168
Hu M, Sharma D, Doclo S, et al., 2015, SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5743-5747, ISSN: 1520-6149
Lightburn L, Brookes M, 2015, SOBM - A BINARY MASK FOR NOISY SPEECH THAT OPTIMISES AN OBJECTIVE INTELLIGIBILITY METRIC, 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Publisher: IEEE, Pages: 5078-5082, ISSN: 1520-6149
Gilliam C, Dragotti P-L, Brookes M, 2014, On the Spectrum of the Plenoptic Function, IEEE TRANSACTIONS ON IMAGE PROCESSING, Vol: 23, Pages: 502-516, ISSN: 1057-7149
Gonzalez S, Brookes M, 2014, PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 22, Pages: 518-530, ISSN: 2329-9290
Gonzalez S, Brookes M, 2014, MASK-BASED ENHANCEMENT FOR VERY LOW QUALITY SPEECH, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, ISSN: 1520-6149
Hilkhuysen G, Gaubitch N, Brookes M, et al., 2014, Effects of noise suppression on intelligibility. II: An attempt to validate physical metrics, JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, Vol: 135, Pages: 439-450, ISSN: 0001-4966
Jones Z, Brookes M, Dragotti PL, et al., 2014, WIDE-BASELINE IMAGE CHANGE DETECTION, IEEE International Conference on Image Processing (ICIP), Publisher: IEEE, Pages: 1589-1593, ISSN: 1522-4880
Pearson J, Visentini-Scarzanella M, Brookes M, et al., 2014, TILTED LAYER-BASED MODELING FOR ENHANCED LIGHT-FIELD PROCESSING AND IMAGE BASED RENDERING, IEEE International Conference on Image Processing (ICIP), Publisher: IEEE, Pages: 1917-1921, ISSN: 1522-4880
Stanton R, Brookes M, 2014, PATH UNCERTAINTY ROBUST BEAMFORMING, 22nd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1925-1929, ISSN: 2076-1465
Wang Y, Brookes M, 2014, SPEECH ENHANCEMENT USING A MODULATION DOMAIN KALMAN FILTER POST-PROCESSOR WITH A GAUSSIAN MIXTURE NOISE MODEL, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, ISSN: 1520-6149
This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.