Imperial College London

ProfessorBjoernSchuller

Faculty of EngineeringDepartment of Computing

Professor of Artificial Intelligence
 
 
 
//

Contact

 

+44 (0)20 7594 8357bjoern.schuller Website

 
 
//

Location

 

574Huxley BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Xu:2021:10.1145/3412841.3441938,
author = {Xu, X and Deng, J and Zhang, Z and Wu, C and Schuller, B},
doi = {10.1145/3412841.3441938},
pages = {580--585},
title = {Identifying surgical-mask speech using deep neural networks on low-level aggregation},
url = {http://dx.doi.org/10.1145/3412841.3441938},
year = {2021}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - The task of Mask-Speech Identification (MSI) aims at judging whether a chunk of speech is pronounced when the speaker is wearing a facial mask or not. Most of the existing related research focuses on investigating the influence of wearing a mask, which only adapts in some certain cases to speech analysis. Thus in order to generalise the research on MSI, we propose an MSI approach using deep networks on Low-Level Aggregation (LLA) for speech chunks. The proposed approach benefits from data augmentation on Low-Level Descriptors (LLDs), resulting in more adaptation to deep models through inputting much more samples in training without employing pre-trained knowledge. Experiments are performed on the dataset of Mask Augsburg Speech Corpus (MSC) used in the INTERSPEECH 2020 ComParE challenge, considering the influence from employing different strategies. The experimental results show effectiveness of the proposed approach compared with the ComParE challenge baselines.
AU - Xu,X
AU - Deng,J
AU - Zhang,Z
AU - Wu,C
AU - Schuller,B
DO - 10.1145/3412841.3441938
EP - 585
PY - 2021///
SP - 580
TI - Identifying surgical-mask speech using deep neural networks on low-level aggregation
UR - http://dx.doi.org/10.1145/3412841.3441938
ER -