Sciforum : Event management platform

You seem to have javascript disabled. Please note that many of the page functionalitities won't work as expected without javascript enabled.

Previous Article in event

Effect of Data Collection and Environment on Machine Learning Performance in Screening Dysphonia

Next Article in event

Human embryonic stem cells naïve pluripotency induction in a fully defined xenogeneic-free synthetic polymer dish coating (PMEDSAH).

Next Article in session

SYMMETRY-BASED EYE DETECTION IN FACIAL IMAGES USING HOUGH TRANSFORM FOR CIRCLES

How Do Room Acoustics Impact Machine Learning Accuracy in Voice Disorder Detection?

Ahmed M. Yousef

^*,

¹ Department of Communication Sciences and Disorders, University of Iowa, Iowa City, Iowa, USA

Academic Editor: Andrea Cataldo

Published: 11 October 2024 by MDPI in The 1st International Online Conference on Bioengineering session Biosignal Processing

Abstract:

Objectives/Introduction: In acoustic voice assessment, recordings are typically collected from diverse environments with varying levels of noise and reverberation. These room acoustics are known to affect the quality of recordings and acoustic analysis, but their impact on advanced tools like machine learning remains little understood. This paper investigates how different room acoustics, particularly reverberation, influence machine learning performance in assessing voice quality and dysphonia.

Methods: This retrospective study utilized voice recordings of sustained /a:/ samples from 193 subjects (145 with voice disorders and 48 without vocal problems). The recordings were modified to add on different levels of reverberation and noise using Audacity software, simulating various room acoustic environments. Using a MATLAB script and Praat software, we extracted different acoustic measurements (temporal- and spectral-based metrics) from the original and corrupted recordings. Various machine learning models were then trained on the generated acoustic features. The models were evaluated for accuracy, sensitivity, and specificity to compare the impact of the recordings, both before and after adding reverberation and noise effects, on machine learning performance in detecting voice disorders.

Results and Conclusions: The recordings were successfully mixed with varying levels of reverberation and noise, creating a diverse set of datasets. Machine learning models were trained and evaluated on these datasets to classify normal and pathological voices under different noise and reverberation conditions. A comparison of the models demonstrated that higher levels of reverberation and noise degrade classification performance. Identifying the acceptable room acoustic conditions where machine learning models produce reliable results helps in optimizing and standardizing environmental conditions for data collection, ensuring accurate voice assessment outcomes.

Keywords: Room Acoustics; Voice Disorders; Machine Learning; Voice Assessment

View Poster

0 Reads
0 Recommendations

,

© 1996-2025 MDPI (Basel, Switzerland) unless otherwise stated

Disclaimer Terms and Conditions Privacy Policy

Top