Visual speech benefit in clear and degraded speech depends on the auditory intelligibility of the talker and the number of background talkers

Blackburn, CL; Kitterick, PT; Jones, G; Sumner, CJ; Stacey, PC

NTU > IRep

IRep

Visual speech benefit in clear and degraded speech depends on the auditory intelligibility of the talker and the number of background talkers

Tools

Blackburn, CL ORCID: https://orcid.org/0000-0003-0805-1059, Kitterick, PT, Jones, G ORCID: https://orcid.org/0000-0003-3867-9947, Sumner, CJ ORCID: https://orcid.org/0000-0002-2573-7418 and Stacey, PC ORCID: https://orcid.org/0000-0002-6018-8979, 2019. Visual speech benefit in clear and degraded speech depends on the auditory intelligibility of the talker and the number of background talkers. Trends in Hearing, 23. ISSN 2331-2165

Preview

Text
13652_Stacey.pdf - Published version
Download (1MB) | Preview

Official URL: http://doi.org/10.1177/2331216519837866

Abstract

Perceiving speech in background noise presents a significant challenge to listeners. Intelligibility can be improved by seeing the face of a talker. This is of particular value to hearing impaired people and users of cochlear implants. It is well known that auditory-only speech understanding depends on factors beyond audibility. How these factors impact on the audio-visual integration of speech is poorly understood. We investigated audio-visual integration when either the interfering background speech (Experiment 1) or intelligibility of the target talkers (Experiment 2) was manipulated. Clear speech was also contrasted with sine-wave vocoded speech to mimic the loss of temporal fine structure with a cochlear implant. Experiment 1 showed that for clear speech, the visual speech benefit was unaffected by the number of background talkers. For vocoded speech, a larger benefit was found when there was only one background talker. Experiment 2 showed that visual speech benefit depended upon the audio intelligibility of the talker and increased as intelligibility decreased. Degrading the speech by vocoding resulted in even greater benefit from visual speech information. A single “independent noise” signal detection theory model predicted the overall visual speech benefit in some conditions but could not predict the different levels of benefit across variations in the background or target talkers. This suggests that, similar to audio-only speech intelligibility, the integration of audio-visual speech cues may be functionally dependent on factors other than audibility and task difficulty, and that clinicians and researchers should carefully consider the characteristics of their stimuli when assessing audio-visual integration.

Item Type:	Journal article
Publication Title:	Trends in Hearing
Creators:	Blackburn, C.L., Kitterick, P.T., Jones, G., Sumner, C.J. and Stacey, P.C.
Publisher:	Sage
Date:	26 March 2019
Volume:	23
ISSN:	2331-2165
Identifiers:	Number Type 10.1177/2331216519837866 DOI
Rights:	Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
Divisions:	Schools > School of Social Sciences
Record created by:	Jonathan Gallacher
Date Added:	01 Apr 2019 09:45
Last Modified:	01 Apr 2019 09:45
URI:	https://irep.ntu.ac.uk/id/eprint/36151