Publications

Ren Z, Chang Y, Bartl-Pokorny KD, Pokorny FB, Schuller BWet al., 2022, The Acoustic Dissection of Cough: Diving into Machine Listening-based COVID-19 Analysis and Detection

<jats:title>Abstract</jats:title><jats:sec><jats:title>Purpose</jats:title><jats:p>The coronavirus disease 2019 (COVID-19) has caused a crisis worldwide. Amounts of efforts have been made to prevent and control COVID-19’s transmission, from early screenings to vaccinations and treatments. Recently, due to the spring up of many automatic disease recognition applications based on machine listening techniques, it would be fast and cheap to detect COVID-19 from recordings of cough, a key symptom of COVID-19. To date, knowledge on the acoustic characteristics of COVID-19 cough sounds is limited, but would be essential for structuring effective and robust machine learning models. The present study aims to explore acoustic features for distinguishing COVID-19 positive individuals from COVID-19 negative ones based on their cough sounds.</jats:p></jats:sec><jats:sec><jats:title>Methods</jats:title><jats:p>With the theory of computational paralinguistics, we analyse the acoustic correlates of COVID-19 cough sounds based on the COMPARE feature set, i. e., a standardised set of 6,373 acoustic higher-level features. Furthermore, we train automatic COVID-19 detection models with machine learning methods and explore the latent features by evaluating the contribution of all features to the COVID-19 status predictions.</jats:p></jats:sec><jats:sec><jats:title>Results</jats:title><jats:p>The experimental results demonstrate that a set of acoustic parameters of cough sounds, e. g., statistical functionals of the root mean square energy and Mel-frequency cepstral coefficients, are relevant for the differentiation between COVID-19 positive and COVID-19 negative cough samples. Our automatic COVID-19 detection model performs significantly above chance level, i. e., at an unweighted average recall (UAR) of 0.632, on a data set consisting of 1,411 cough samples (COVID-19 positiv

Journal article

Lefter I, Baird A, Stappen L, Schuller BWet al., 2022, A Cross-Corpus Speech-Based Analysis of Escalating Negative Interactions, FRONTIERS IN COMPUTER SCIENCE, Vol: 4

Author Web Link
Cite
Citations: 1

Journal article

Milling M, Bartl-Pokorny KD, Schuller BW, 2022, Investigating Automatic Speech Emotion Recognition for Children with Autism Spectrum Disorder in interactive intervention sessions with the social robot Kaspar

<jats:title>ABSTRACT</jats:title><jats:p>In this contribution, we present the analyses of vocalisation data recorded in the first observation round of the European Commission’s Erasmus Plus project “EMBOA, Affective loop in Socially Assistive Robotics as an intervention tool for children with autism”. In total, the project partners recorded data in 112 robot-supported intervention sessions for children with autism spectrum disorder. Audio data were recorded using the internal and lapel microphone of the H4n Pro Recorder. To analyse the data, we first utilise a child voice activity detection (VAD) system in order to extract child vocalisations from the raw audio data. For each child, session, and microphone, we provide the total time child vocalisations were detected. Next, we compare the results of two different implementations for valence- and arousal-based speech emotion recognition, thereby processing (1) the child vocalisations detected by the VAD and (2) the total recorded audio material. We provide average valence and arousal values for each session and condition. Finally, we discuss challenges and limitations of child voice detection and audio-based emotion recognition in robot-supported intervention settings.</jats:p>

Journal article

Liu S, Han J, Puyal EL, Kontaxis S, Sun S, Locatelli P, Dineley J, Pokorny FB, Dalla Costa G, Leocani L, Guerrero AI, Nos C, Zabalza A, Sorensen PS, Buron M, Magyari M, Ranjan Y, Rashid Z, Conde P, Stewart C, Folarin AA, Dobson RJB, Bailon R, Vairavan S, Cummins N, Narayan VA, Hotopf M, Comi G, Schuller Bet al., 2022, Fitbeat: COVID-19 estimation based on wristband heart rate using a contrastive convolutional auto-encoder, PATTERN RECOGNITION, Vol: 123, ISSN: 0031-3203

Author Web Link
Cite
Citations: 11

Journal article

Parada-Cabaleiro E, Batliner A, Baird A, Schuller Bet al., 2022, The perception of emotional cues by children in artificial background noise (vol 23, pg 169, 2020), INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, Vol: 25, Pages: 289-289, ISSN: 1381-2416

Journal article

Milling M, Baird A, Bartl-Pokorny KD, Liu S, Alcorn AM, Shen J, Tavassoli T, Ainger E, Pellicano E, Pantic M, Cummins N, Schuller BWet al., 2022, Evaluating the Impact of Voice Activity Detection on Speech Emotion Recognition for Autistic Children, FRONTIERS IN COMPUTER SCIENCE, Vol: 4

Journal article

Deshpande G, Batliner A, Schuller BW, 2022, AI-Based human audio processing for COVID-19: A comprehensive overview, PATTERN RECOGNITION, Vol: 122, ISSN: 0031-3203

Author Web Link
Cite
Citations: 19

Journal article

Mohamed MM, Nessiem MA, Batliner A, Bergler C, Hantke S, Schmitt M, Baird A, Mallol-Ragolta A, Karas V, Amiriparian S, Schuller BWet al., 2022, Face mask recognition from audio: The MASC database and an overview on the mask challenge, PATTERN RECOGNITION, Vol: 122, ISSN: 0031-3203

Author Web Link
Cite
Citations: 12

Journal article

Schuller BW, Eldar Y, Pantic M, Narayanan S, Virtanen T, Tao Jet al., 2022, Editorial: Intelligent Signal Analysis for Contagious Virus Diseases, IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, Vol: 16, Pages: 159-163, ISSN: 1932-4553

Journal article

Chang Y, Jing X, Ren Z, Schuller BWet al., 2022, CovNet: A Transfer Learning Framework for Automatic COVID-19 Detection From Crowd-Sourced Cough Sounds, FRONTIERS IN DIGITAL HEALTH, Vol: 3

Author Web Link
Cite
Citations: 2

Journal article

Wen S, Huang T, Schuller BW, Azar ATet al., 2022, Guest Editorial Introduction to the Special Section on Efficient Network Design for Convergence of Deep Learning and Edge Computing, IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, Vol: 9, Pages: 109-110, ISSN: 2327-4697

Journal article

Nessiem MA, Coppock H, Mohamed MM, Schuller BWet al., 2022, Artificial intelligence in COVID-19, Omics Approaches and Technologies in COVID-19, Pages: 255-273, ISBN: 9780323986212

The COVID-19 pandemic has taken the world by storm, placing healthcare systems around the globe under immense pressure. The exceptional circumstance has made the scientific community turn to artificial intelligence (AI), with hopes that AI techniques can be used in all aspects of combating the pandemic, whether it is in using AI to uncover sequences in the genomic code of the severe acute respiratory syndrome coronavirus (SARS-CoV-2) virus for the purposes of developing therapeutics, such as antivirals, antibodies, or vaccines, or using AI to provide (near-) instantaneous clinical diagnosis techniques by way of analysis of chest X-ray (CXR) images, computed tomography (CT) scans or other useful modalities, or using AI for as a tool for mass population testing by analyzing patient audio recordings. In this chapter, we survey the AI research literature with respect to applications for COVID-19 and showcase and critique notable state of the art approaches.

Abstract
Cite
Citations: 1

Book chapter

Milling M, Aslan I, Berghofer M, Mallol-Ragolta A, Kunwar U, Schuller BWet al., 2022, Online Personalisation of Deep Mobile Activity Recognisers, 7th International Workshop on Sensor-Based Activity Recognition and Artificial Intelligence (IWOAR), Publisher: ASSOC COMPUTING MACHINERY

Conference paper

Zhao S, Huang Q, Tang Y, Yao X, Yang J, Ding G, Schuller BWet al., 2022, Computational Emotion Analysis From Images: Recent Advances and Future Directions, Human Perception of Visual Information, Publisher: Springer International Publishing, Pages: 85-113, ISBN: 9783030814649

Book chapter

Xu X, Deng J, Cummins N, Zhang Z, Zhao L, Schuller BWet al., 2022, Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes, IEEE TRANSACTIONS ON MULTIMEDIA, Vol: 24, Pages: 2752-2765, ISSN: 1520-9210

Author Web Link
Cite
Citations: 6

Journal article

Lu C, Zong Y, Zheng W, Li Y, Tang C, Schuller BWet al., 2022, Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, Vol: 30, Pages: 2217-2230, ISSN: 2329-9290

Author Web Link
Cite
Citations: 10

Journal article

Yang Z, Jing X, Triantafyllopoulos A, Song M, Aslan I, Schuller BWet al., 2022, An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion, Interspeech Conference, Publisher: ISCA-INT SPEECH COMMUNICATION ASSOC, Pages: 4915-4919, ISSN: 2308-457X

Conference paper

Yan T, Meng H, Liu S, Parada-Cabaleiro E, Ren Z, Schuller BWet al., 2022, CONVOLUATIONAL TRANSFORMER WITH ADAPTIVE POSITION EMBEDDING FOR COVID-19 DETECTION FROM COUGH SOUNDS, 47th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, Pages: 9092-9096, ISSN: 1520-6149

Author Web Link
Cite
Citations: 2

Conference paper

Jing X, Liu S, Parada-Cabaleiro E, Triantafyllopoulos A, Song M, Yang Z, Schuller BWet al., 2022, A Temporal-oriented Broadcast ResNet for COVID-19 Detection, 4th IEEE-EMBS International Conference on Wearable and Implantable Body Sensor Networks (BSN) / 18th IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), Publisher: IEEE

Conference paper

Qian K, Schultz T, Schuller BW, 2022, AN OVERVIEW OF THE FIRST ICASSP SPECIAL SESSION ON COMPUTER AUDITION FOR HEALTHCARE, 47th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, Pages: 9002-9006, ISSN: 1520-6149

Author Web Link
Cite
Citations: 1

Conference paper

Yu S, Ding Y, Qian K, Hu B, Li W, Schuller BWet al., 2022, A GLANCE-AND-GAZE NETWORK FOR RESPIRATORY SOUND CLASSIFICATION, 47th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Publisher: IEEE, Pages: 9007-9011, ISSN: 1520-6149

Author Web Link
Cite
Citations: 1

Conference paper

Kim JY, Liu C, Calvo RA, McCabe K, Taylor SCR, Schuller BW, Wu Ket al., 2022, Comparison of Automatic Speech Recognition Systems, International Workshop on Spoken Dialog System Technology, Publisher: Springer Nature Singapore, Pages: 123-131, ISSN: 1876-1100

Cite

Conference paper

Hledikova A, Woszczyk D, Acman A, Demetriou S, Schuller Bet al., 2022, Data Augmentation for Dementia Detection in Spoken Language, Interspeech Conference, Publisher: ISCA-INT SPEECH COMMUNICATION ASSOC, Pages: 2858-2862, ISSN: 2308-457X

Author Web Link
Cite
Citations: 1

Conference paper

Mira R, Haliassos A, Petridis S, Schuller BW, Pantic Met al., 2022, SVTS: Scalable Video-to-Speech Synthesis, Interspeech Conference, Publisher: ISCA-INT SPEECH COMMUNICATION ASSOC, Pages: 1836-1840, ISSN: 2308-457X

Conference paper

Baird A, Triantafyllopoulos A, Zaenkert S, Ottl S, Christ L, Stappen L, Konzok J, Sturmbauer S, Messner E-M, Kudielka BM, Rohleder N, Baumeister H, Schuller BWet al., 2021, An Evaluation of Speech-Based Recognition of Emotional and Physiological Markers of Stress, FRONTIERS IN COMPUTER SCIENCE, Vol: 3

Author Web Link
Cite
Citations: 4

Journal article

Coppock H, Jones L, Kiskin I, Schuller Bet al., 2021, Bias and privacy in AI's cough-based COVID-19 recognition, LANCET DIGITAL HEALTH, Vol: 3, Pages: E761-E761

Author Web Link
Cite
Citations: 1

Journal article

Schaefer J, Milling M, Schuller BW, Bauer B, Brunner JO, Traidl-Hoffmann C, Damialis Aet al., 2021, Towards automatic airborne pollen monitoring: From commercial devices to operational by mitigating class-imbalance in a deep learning approach, SCIENCE OF THE TOTAL ENVIRONMENT, Vol: 796, ISSN: 0048-9697

Author Web Link
Cite
Citations: 19

Journal article

Qian K, Schmitt M, Zheng H, Koike T, Han J, Liu J, Ji W, Duan J, Song M, Yang Z, Ren Z, Liu S, Zhang Z, Yamamoto Y, Schuller BWet al., 2021, Computer Audition for Fighting the SARS-CoV-2 Corona Crisis-Introducing the Multitask Speech Corpus for COVID-19, IEEE INTERNET OF THINGS JOURNAL, Vol: 8, Pages: 16035-16046, ISSN: 2327-4662

Author Web Link
Cite
Citations: 7

Journal article

Han J, Zhang Z, Mascolo C, Andre E, Tao J, Zhao Z, Schuller BWet al., 2021, Deep Learning for Mobile Mental Health: Challenges and recent advances, IEEE SIGNAL PROCESSING MAGAZINE, Vol: 38, Pages: 96-105, ISSN: 1053-5888

Author Web Link
Cite
Citations: 5

Journal article

Schuller B, Baird A, Gebhard A, Amiriparian S, Keren G, Schmitt M, Cummins Net al., 2021, New Avenues in Audio Intelligence: Towards Holistic Real-life Audio Understanding, TRENDS IN HEARING, Vol: 25, ISSN: 2331-2165

Author Web Link
Cite
Citations: 1

Journal article

ProfessorBjoernSchuller

Contact

Location

Summary