Use of Emotional and Neutral Speech in Evaluating Compression Speeds

Petri Korhonen; Francis Kuk; Christopher Slugocki; Neal Davis-Ruperto

doi:10.1055/s-0041-1722945

Subscribe to RSS

Please copy the URL and add it into your RSS Feed Reader.

https://www.thieme-connect.de/rss/thieme/en/10.1055-s-00046128.xml

Download PDF

J Am Acad Audiol 2021; 32(04): 268-274
DOI: 10.1055/s-0041-1722945

Research Article

Use of Emotional and Neutral Speech in Evaluating Compression Speeds

Authors

Petri Korhonen

¹Office of Research in Clinical Amplification (ORCA-USA), WS Audiology, Lisle, Illinois
Francis Kuk

¹Office of Research in Clinical Amplification (ORCA-USA), WS Audiology, Lisle, Illinois
Christopher Slugocki

¹Office of Research in Clinical Amplification (ORCA-USA), WS Audiology, Lisle, Illinois
Neal Davis-Ruperto

¹Office of Research in Clinical Amplification (ORCA-USA), WS Audiology, Lisle, Illinois

Further Information

Also available at

Permissions and Reprints

Abstract

Background Emotional speech differs from neutral speech in its envelope characteristics. Use of emotional speech materials may be more sensitive for evaluating signal processing algorithms that affect the temporal envelope.

Purpose Subjective listener preference was compared between variable speed compression (VSC) and fast acting compression (FAC) amplitude compression algorithms using neutral and emotional speech.

Research Design The study used a single-blinded, repeated measures design.

Study Sample Twenty hearing-impaired (HI) listeners with a bilaterally symmetrical, mild- to-moderately severe sensorineural hearing loss and 21 listeners with normal hearing (NH) participated.

Intervention Speech was processed using FAC and VSC algorithms.

Data Collection and Analysis A paired-comparison paradigm assessed subjective preference for FAC versus VSC using emotional and neutral speech materials. The significance of subjective preference for compression algorithm (FAC or VSC) was evaluated using a linear mixed effects model at each combination of stimulus type (emotional or neutral speech) and hearing group (NH or HI).

Results HI listeners showed a preference for VSC over FAC when listening to emotional speech. The same listeners showed a nonsignificant, preference for VSC over FAC when listening to neutral speech. NH listeners showed preference for VSC over FAC for both neutral and emotional speech materials.

Conclusion These results suggest that the subjective sound quality of emotional speech is more susceptible than neutral speech to changes in the signal introduced by FAC. Clinicians should consider including emotional speech materials when evaluating listener preference for different compression speeds in the clinic.

Keywords

hearing aids - compression - emotions - sound quality

Note

The data from this manuscript were presented during the 47th Annual Scientific and Technology Conference of the American Auditory Society in Scottsdale, AZ, March 5-7, 2020.

Publication History

Received: 24 May 2020

Accepted: 19 October 2020

Article published online:
25 May 2021

Thieme Medical Publishers, Inc.
333 Seventh Avenue, 18th Floor, New York, NY 10001, USA

References
1 Gangamohan P, Kadiri SR, Yegnanarayana B. Analysis of Emotional Speech—A Review. Toward Robotic Socially Believable Behaving Systems Volume I—“Modeling Emotions”. Cham: Springer International Publishing; 2016: 205-238

Search in Google Scholar
Download RIS citation
2 Cowie R, Douglas-Cowie E, Tsapatsoulis N. et al. Emotion recognition in human-computer interaction. IEEE Signal Process Mag 2001; 18 (01) 32-80

Crossref Search in Google Scholar
Download RIS citation
3 Murray IR, Arnott JL. Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. J Acoust Soc Am 1993; 93 (02) 1097-1108

Crossref PubMed Search in Google Scholar
Download RIS citation
4 Banse R, Scherer KR. Acoustic profiles in vocal emotion expression. J Pers Soc Psychol 1996; 70 (03) 614-636

Crossref PubMed Search in Google Scholar
Download RIS citation
5 Johnstone T. The Communication of Affect through Modulation of Non-verbal Vocal Parameters. [Ph.D. dissertation]. University Western Australia; 2001

Search in Google Scholar
Download RIS citation
6 Keller E. The analysis of voice quality in speech processing. In: Chollet G, Di Benedetto MG, Esposito A, Marinaro M. eds. Nonlinear Speech Modeling. Berlin: Springer; 2005: 54-73

Search in Google Scholar
Download RIS citation
7 Goy H, Pichora-Fuller MK, Singh G, Russo FA. Hearing aids benefit recognition of words in emotional speech but not emotion identification. Trends Hear 2018; 22: 2331216518801736

Search in Google Scholar
8 Jenstad LM, Souza PE. Quantifying the effect of compression hearing aid release time on speech acoustics and intelligibility. J Speech Lang Hear Res 2005; 48 (03) 651-667

Crossref PubMed Search in Google Scholar
Download RIS citation
9 Jenstad LM, Souza PE. Temporal envelope changes of compression and speech rate: combined effects on recognition for older adults. J Speech Lang Hear Res 2007; 50 (05) 1123-1138

Crossref PubMed Search in Google Scholar
Download RIS citation
10 Boike KT, Souza PE. Effect of compression ratio on speech recognition and speech-quality ratings with wide dynamic range compression amplification. J Speech Lang Hear Res 2000; 43 (02) 456-468

Crossref PubMed Search in Google Scholar
Download RIS citation
11 Hansen M. Effects of multi-channel compression time constants on subjectively perceived sound quality and speech intelligibility. Ear Hear 2002; 23 (04) 369-380

Crossref PubMed Search in Google Scholar
Download RIS citation
12 Neuman AC, Bakke MH, Hellman S, Levitt H. Effect of compression ratio in a slow-acting compression hearing aid: paired-comparison judgments of quality. J Acoust Soc Am 1994; 96 (03) 1471-1478

Crossref PubMed Search in Google Scholar
Download RIS citation
13 Neuman AC, Bakke MH, Mackersie C, Hellman S, Levitt H. The effect of compression ratio and release time on the categorical rating of sound quality. J Acoust Soc Am 1998; 103 (5 Pt 1): 2273-2281

Crossref PubMed Search in Google Scholar
Download RIS citation
14 Hau O, Andersen H. Hearing Aid Compression: Effects of Speed, Ratio and Channel Bandwidth on Perceived Sound Quality. Audiology Online 2012. Available at: https://www.audiologyonline.com/articles/hearing-aid-compression-effects-speed-770. Accessed January 20, 2021
15 Henning RL, Bentler RA. The effects of hearing aid compression parameters on the short-term dynamic range of continuous speech. J Speech Lang Hear Res 2008; 51 (02) 471-484

Crossref PubMed Search in Google Scholar
Download RIS citation
16 Singh G, Liskovoi L, Launer S, Russo F. The emotional communication in hearing questionnaire (EMO-CHeQ): development and evaluation. Ear Hear 2019; 40 (02) 260-271

Crossref PubMed Search in Google Scholar
Download RIS citation
17 Kuk F, Slugocki C, Korhonen P, Seper E, Hau O. Evaluation of the efficacy of a dual variable-speed compressor over a single fixed speed compressor. J Am Acad Audiol 2019; 30 (07) 590-606

Thieme Connect PubMed Search in Google Scholar
Download RIS citation
18 Nasreddine ZS, Phillips NA, Bédirian V. et al. The Montreal cognitive assessment, MoCA: a brief screening tool for mild cognitive impairment. J Am Geriatr Soc 2005; 53 (04) 695-699

Crossref PubMed Search in Google Scholar
Download RIS citation
19 Oeding K, Valente M. Differences in sensation level between the Widex Soundtracker and two real-ear analyzers. J Am Acad Audiol 2013; 24 (08) 660-670

Thieme Connect PubMed Search in Google Scholar
Download RIS citation
20 Livingstone SR, Russo FA. The Ryerson audio-visual database of emotional speech and song (RAVDESS): a dynamic, multimodal set of facial and vocal expressions in North American English. PLoS One 2018; 13 (05) e0196391

Crossref PubMed Search in Google Scholar
Download RIS citation
21 EBU—TECH. Sound Quality Assessment Material Recordings for Subjective Tests. Geneva: European Broadcasting Union (EBU), EBU—TECH 3253; 2008

Search in Google Scholar
Download RIS citation
22 Korhonen P, Kuk F, Slugocki C. A method to evaluate the effect of signal processing on the temporal envelope of speech. Hear Rev 2019; 26 (06) 10-18

Search in Google Scholar
Download RIS citation
23 Kuk F. Paired comparisons as a fine-tuning tool in hearing aid fittings. In: Valente M. ed. Strategies for Selecting and Verifying Hearing Aid Fittings. 2nd ed.. New York, NY: Thieme; 2002: 125-150

Search in Google Scholar
Download RIS citation
24 Ohlenforst B, Souza PE, MacDonald EN. Exploring the relationship between working memory, compressor speed, and background noise characteristics. Ear Hear 2016; 37 (02) 137-143

Crossref PubMed Search in Google Scholar
Download RIS citation

Related Journals

Subscribe to RSS

Share / Bookmark

Use of Emotional and Neutral Speech in Evaluating Compression Speeds

Authors

Abstract

Keywords

Note

Publication History

References