Methods Inf Med 2003; 42(03): 271-276
DOI: 10.1055/s-0038-1634360
Original article
Schattauer GmbH

Normal Voice Production: Computation of Driving Parameters from Endoscopic Digital High Speed Images

M. Döllinger
1   Department of Phoniatrics and Pediatric Audiology, University of Erlangen-Nuremberg, Erlangen, Germany
,
T. Braunschweig
2   Department of Phoniatrics and Pediatric Audiology, University of Jena, Jena, Germany
,
J. Lohscheller
1   Department of Phoniatrics and Pediatric Audiology, University of Erlangen-Nuremberg, Erlangen, Germany
,
U. Eysholdt
1   Department of Phoniatrics and Pediatric Audiology, University of Erlangen-Nuremberg, Erlangen, Germany
,
U. Hoppe
1   Department of Phoniatrics and Pediatric Audiology, University of Erlangen-Nuremberg, Erlangen, Germany
› Author Affiliations
Further Information

Publication History

Received 15 July 2002

Accepted 13 December 2002

Publication Date:
07 February 2018 (online)

Summary

Objectives: A central point for quantitative evaluation of pathological and healthy voices is the analysis of vocal fold oscillations. By means of digital High Speed Glottography (HGG), vocal fold oscillations can be recorded in real time. Recently, a numerical inversion procedure was developed that allows the extraction of physiological parameters from digital high speed videos and a classification of voice disorders. The aim of this work was to validate the inversion procedure and to investigate the applicability to normal voices.

Methods: High speed recordings were performed during phonation within a group of five female and five male persons with normal voices. By using knowledge based image processing algorithms, motion curves of the vocal folds were extracted at three different positions (dorsal, medial, ventral). These curves were used to obtain physiological voice parameters, and in particular the degree of symmetry of the vocal folds based upon a biomechanical model of the vocal folds.

Results: The highest degree of symmetry was observed for the medial motion curves. While the dor-sally and ventrally extracted motion curves exhibited similar results concerning the degree of symmetry the performance of the algorithm was less stable.

Conclusions: The inversion algorithm provides reasonable results for all subjects when applied to the medial motion curves. However, for dorsal and ventral motion curves, correct performance is reduced to 85 %.

 
  • References

  • 1 Isshiki N, Tanabe M, Ishizaka K, Broad D. Clinical significance of asymmetrical vocal cord tension. Ann. Otol. 1977; 86: 58-66.
  • 2 Mergell P, Titze IR, Herzel H. Irregular vocal-fold vibration – high speed observation and modeling. J. Acoust. Soc. Am. 2000; 108: 2996-3002.
  • 3 Ishizaka K, Flanagan JL. Synthesis of voiced sounds from a two-mass-model of the vocal cords. Bell Syst. Techn. J. 1972; 51: 1233-68.
  • 4 Döllinger M, Hoppe U, Hettlich F, Lohscheller J, Schuberth S, Eysholdt U. Vocal Fold Parameter Extraction Using The Two-Mass-Model. IEEE T Bio-Med Eng 2002; 49 (Suppl. 08) 773-81.
  • 5 Flanagan JL, Ishizaka K. Computer model to characterize the air volume displaced by the vibrating vocal cords. J. Acoust. Soc. Am. 1978; 63: 1559-65.
  • 6 Steinecke I. Untersuchungen an einem vereinfachten Stimmlippenmodell. Diplomarbeit, Institut für Theoretische Physik, Humboldt-Universität; Berlin: 1994
  • 7 Steinecke I, Herzel H. Bifurcations in an asymmetric vocal fold model. J. Acoust. Soc. Am. 1995; 97: 1874-84.
  • 8 Herzel H, Knudsen C. Bifurcations in a vocal fold model. Nonlinear Dynamics 1995; 7: 53-64.
  • 9 Lucero JC. Dynamics of the two-mass model of the vocal folds: Equilibria, bifurcations, and oscillation region. J. Acoust. Soc. Am. 1993; 94: 3104-11.
  • 10 Mergell P, Herzel H. Modelling biphonation – the role of the vocal tract. Speech Communication 1997; 22: 141-54.
  • 11 Herzel H, Berry DA, Titze IR, Steinecke I. Nonlinear dynamics of the voice: Signal analysis and biomechanical modelling. Chaos 1995; 5: 30-4.
  • 12 Berry DA, Herzel H, Titze IR, Krischer K. Interpretation of biomechanical simulations of normal and chaotic vocal fold oscillations with empirical eigenfunctions. J. Acoust. Soc. Am. 1994; 95: 3595-604.
  • 13 Hoppe U. Mechanisms of Hoarseness – Visualization and Interpretation by Means of Nonlinear Dynamics. Aachen, Germany: Shaker Verlag; 2001
  • 14 Eysholdt U, Tigges M, Wittenberg T, Pröschel U. Direct Evaluation of High-Dpeed Recordings of Vocal Fold Vibrations. Folia Phoniatrica et Logopaedica 1996; 48: 163-70.
  • 15 Niimi S, Miyai M. Vocal fold vibration and voice quality. Folia Phoniatrica et Logopaedica 2000; 52: 32-8.
  • 16 Wittenberg T, Moser M, Tigges M, Eysholdt U. Recording, processing and analysis of digital highspeed sequences in glottography. Machine Vision and Applications 1995; 12: 399-404.
  • 17 Tigges M. Die Hochgeschwindigkeitsaufnahmetechnik des Larynx. Aachen, Germany: Shaker Verlag; 1999
  • 18 Hollien H, Colton R. Four laminagraphic studies of vocal fold thickness. Folia Phoniatrica 1969; 21: 179-98.
  • 19 Mergell P. Nonlinear Dynamics of Phonation – High Speed Glottography and Biomechanical Modelling of Vocal Fold Oscillations. Kommunikationsstörungen – Berichte aus Phoniatrie und Pädaudiologie, Aachen, Germany: Shaker Verlag; 1998
  • 20 Jiang JJ, Zhang Y, Stern J. Modelling of chaotic vibrations in symmetric vocal folds. J. Acoust. Soc. Am. 2001; 110: 2120-8.
  • 21 Moré J. The Levenberg-Marquardt Algorithm: Implementation and theory. In Lecture Notes in Mathematics 630 (Watson, ed.),. Numerical Anal.; 1977
  • 22 Murray W. Numerical Methods for Unconstrained Optimization. London: Academic Press; 1972
  • 23 Sulter Am, Wit HP. Glottal volume velocity waveform characteristics in subjects with and without vocal training, related to gender, sound intensity, fundamental frequency, and age. J. Acoust. Soc. Am. 1996; 100: 3360-73.
  • 24 Biever DM, Bless DM. Vibratory Characteristics of the Vocal Folds in Young Adult and Geriatric Women. Journal of Voice 1989; 3: 120-131.
  • 25 Linville SE. Glottal Gap Configurations in Two Age Groups of Women. Journal of Speech and Hearing Research 1992; 35: 1209-15.