Imperial College London

Patrick A. Naylor

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Professor of Speech & Acoustic Signal Processing
 
 
 
//

Contact

 

+44 (0)20 7594 6235p.naylor Website

 
 
//

Location

 

803Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Neo:2022:10.1109/SSPD54131.2022.9896222,
author = {Neo, VW and Weiss, S and Naylor, PA},
doi = {10.1109/SSPD54131.2022.9896222},
pages = {1--5},
publisher = {IEEE},
title = {A polynomial subspace projection approach for the detection of weak voice activity},
url = {http://dx.doi.org/10.1109/SSPD54131.2022.9896222},
year = {2022}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - A voice activity detection (VAD) algorithm identifies whether or not time frames contain speech. It is essential for many military and commercial speech processing applications, including speech enhancement, speech coding, speaker identification, and automatic speech recognition. In this work, we adopt earlier work on detecting weak transient signals and propose a polynomial subspace projection pre-processor to improve an existing VAD algorithm. The proposed multi-channel pre-processor projects the microphone signals onto a lower dimensional subspace which attempts to remove the interferer components and thus eases the detection of the speech target. Compared to applying the same VAD to the microphone signal, the proposed approach almost always improves the F1 and balanced accuracy scores even in adverse environments, e.g. -30 dB SIR, which may be typical of operations involving noisy machinery and signal jamming scenarios.
AU - Neo,VW
AU - Weiss,S
AU - Naylor,PA
DO - 10.1109/SSPD54131.2022.9896222
EP - 5
PB - IEEE
PY - 2022///
SP - 1
TI - A polynomial subspace projection approach for the detection of weak voice activity
UR - http://dx.doi.org/10.1109/SSPD54131.2022.9896222
UR - https://ieeexplore.ieee.org/document/9896222
UR - http://hdl.handle.net/10044/1/99145
ER -