gms | German Medical Science

38. Wissenschaftliche Jahrestagung der Deutschen Gesellschaft für Phoniatrie und Pädaudiologie (DGPP)

Deutsche Gesellschaft für Phoniatrie und Pädaudiologie e. V.

29.09. - 02.10.2022, Leipzig

Overview of automatic voice onset measurements

Poster

38. Wissenschaftliche Jahrestagung der Deutschen Gesellschaft für Phoniatrie und Pädaudiologie (DGPP). Leipzig, 29.09.-02.10.2022. Düsseldorf: German Medical Science GMS Publishing House; 2022. DocP4

doi: 10.3205/22dgpp11, urn:nbn:de:0183-22dgpp112

Published: September 26, 2022

© 2022 Arias-Vergara et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Abstract

Background: The onset of phonation has been investigated as a biomarker for quantitative voice assessment and identification of phonation type, e.g., breathy, modal, and creaky. Different methods exist for the computation of the voice onset. Furthermore, the onset of phonation can be calculated considering different modalities: acoustic, electroglottographic (EGG), and airflow signals. We investigated existing automatic methods used to measure the voice onset and implemented them into a software suitable for quantitative analysis of recording containing the sustained phonation of vowels.

Material and methods: We investigated and implemented three state-of-the-art methods used to measure voice onset:

1.
The vocal attack time (VAT), defined as the time lag between the increase of oscillation amplitude of acoustic and EGG signals at voice initiation;
2.
the vocal rise time (VRT), defined as the time needed for the amplitude envelope of the acoustic signal to go from 20% to 80% of the maximum value;
3.
the voice onset coordination (VOC), defined as the time lag between the increase of oscillation amplitude of EGG and airflow signals at voice initiation.

The voice onset measurements were implemented in Python. In addition, a graphical user interface was implemented to measure voice onset.

Results: Studies on the voice onset measurements suggest the potential for objective and quantitative clinical assessment. Automatic voice onset measurement can be performed considering different signal processing techniques; however, the type of measure (VAT, VRT, or VOC) depends on the available modality, i.e., acoustic, EGG, or airflow.

Discussion: Voice onset measurements provide an objective analysis of abnormal behavior at the start of the phonation process. The main disadvantage of the VAT and VOC measurements is that these can only be computed when two modalities are available, i.e., acoustic and EGG signals for VAT or EGG and airflow signals for VOC. The VRT has the advantage that only the acoustic signal is necessary for computation; however, it depends on arbitrary measurement points to compute the onset.

Conclusion: Analysis of vowel production is usually focused on sustained oscillations of the vocal folds, and the transition from the rest state to a saturation value (voice onset) is often overlooked. However, studies in the literature have shown the potential applications of using voice onset measures for clinical evaluation; thus, irregularities in the vocal folds dynamics are yet to be explored.


Text

Introduction

The onset of phonation has been investigated as a biomarker for quantitative voice assessment and identification of phonation type, e.g., breathy, modal, and creaky. Different methods exist for the computation of the voice onset. Furthermore, the onset of phonation can be calculated considering different modalities: acoustic, electroglottographic (EGG), and airflow signals. We investigated existing automatic methods used to measure the voice onset and implemented them into a software suitable for quantitative analysis of recording containing the sustained phonation of vowels.

Methods

We investigated and implemented three state-of-the-art methods used to measure voice onset:

1.
The vocal attack time (VAT), defined as the time lag between the increase of oscillation amplitude of acoustic and EGG signals at voice initiation;
2.
the vocal rise time (VRT), defined as the time needed for the amplitude envelope of the acoustic signal to go from 20% to 80% of the maximum value;
3.
the voice onset coordination (VOC), defined as the time lag between the increase of oscillation amplitude of EGG and airflow signals at voice initiation.

The voice onset measurements were implemented in Python. In addition, a graphical user interface was implemented to measure voice onset (Figure 1 [Fig. 1]).

Results

Studies on the voice onset measurements suggest the potential for objective and quantitative clinical assessment. Automatic voice onset measurement can be performed considering different signal processing techniques; however, the type of measure (VAT, VRT, or VOC) depends on the available modality, i.e., acoustic, EGG, or airflow.

Discussion

Voice onset measurements provide an objective analysis of abnormal behavior at the start of the phonation process. The main disadvantage of the VAT and VOC measurements is that these can only be computed when two modalities are available, i.e., acoustic and EGG signals for VAT or EGG and airflow signals for VOC. The VRT has the advantage that only the acoustic signal is necessary for computation; however, it depends on arbitrary measurement points to compute the onset.

Conclusion

Analysis of vowel production is usually focused on sustained oscillations of the vocal folds, and the transition from the rest state to a saturation value (voice onset) is often overlooked. However, studies in the literature have shown the potential applications of using voice onset measures for clinical evaluation; thus, irregularities in the vocal folds dynamics are yet to be explored.