Imperial College London

Dr Patrick A. Naylor

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Professor of Speech & Acoustic Signal Processing
 
 
 
//

Contact

 

+44 (0)20 7594 6235p.naylor Website

 
 
//

Location

 

803Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Hogg,
author = {Hogg, A and Evers, C and Naylor, P},
publisher = {IEEE},
title = {Multiple hypothesis tracking for overlapping speaker segmentation},
url = {http://hdl.handle.net/10044/1/72344},
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - Speaker segmentation is an essential part of any diarization system.Applications of diarization include tasks such as speaker indexing, improving automatic speech recognition (ASR) performance and making single speaker-based algorithms available for use in multi-speaker environments.This paper proposes a multiple hypothesis tracking (MHT) method that exploits the harmonic structure associated with the pitch in voiced speech in order to segment the onsets and end-points of speech from multiple, overlapping speakers. The proposed method is evaluated against a segmentation system from the literature that uses a spectral representation and is based on employing bidirectional long short term memory networks (BLSTM). The proposed method is shown to achieve comparable performance for segmenting overlapping speakers only using the pitch harmonic information in the MHT framework.
AU - Hogg,A
AU - Evers,C
AU - Naylor,P
PB - IEEE
TI - Multiple hypothesis tracking for overlapping speaker segmentation
UR - http://hdl.handle.net/10044/1/72344
ER -