Imperial College London

ProfessorBjoernSchuller

Faculty of EngineeringDepartment of Computing

Professor of Artificial Intelligence
 
 
 
//

Contact

 

+44 (0)20 7594 8357bjoern.schuller Website

 
 
//

Location

 

574Huxley BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inbook{Yang:2020:10.1007/978-981-15-2756-2_11,
author = {Yang, Z and Qian, K and Ren, Z and Baird, A and Zhang, Z and Schuller, B},
doi = {10.1007/978-981-15-2756-2_11},
pages = {133--143},
title = {Learning multi-resolution representations for acoustic scene classification via neural networks},
url = {http://dx.doi.org/10.1007/978-981-15-2756-2_11},
year = {2020}
}

RIS format (EndNote, RefMan)

TY  - CHAP
AB - This study investigates the performance of wavelet as well as conventional temporal and spectral features for acoustic scene classification, testing the effectiveness of both feature sets when combined with neural networks on acoustic scene classification. The TUT Acoustic Scenes 2017 Database is used in the evaluation of the system. The model with wavelet energy feature achieved 74.8 % and 60.2 % on development and evaluation set respectively, which is better than the model using temporal and spectral feature set (72.9 % and 59.4 %). Additionally, to optimise the generalisation and robustness of the models, a decision fusion method based on the posterior probability of each audio scene is used. Comparing with the baseline system of the Detection and Classification Acoustic Scenes and Events 2017 (DCASE 2017) challenge, the best decision fusion model achieves 79.2 % and 63.8 % on the development and evaluation sets, respectively, where both results significantly exceed the baseline system result of 74.8 % and 61.0 % (confirmed by one tailed z-test p < 0.01 and p < 0.05 respectively.
AU - Yang,Z
AU - Qian,K
AU - Ren,Z
AU - Baird,A
AU - Zhang,Z
AU - Schuller,B
DO - 10.1007/978-981-15-2756-2_11
EP - 143
PY - 2020///
SN - 9789811527555
SP - 133
TI - Learning multi-resolution representations for acoustic scene classification via neural networks
UR - http://dx.doi.org/10.1007/978-981-15-2756-2_11
ER -