TY - JOUR
T1 - Forming clusters of multiple sounds improves talker identification in an auditory scene
AU - Yost, William A.
AU - Torben Pastore, M.
AU - Robinson, Phillip
N1 - Funding Information:
Funding for this research came from grants from NIH (NIDCD: R01DC015214 to WAY and F32DC016808 to MTP) and Facebook Reality Laboratories (WAY & MTP). We appreciate the assistance of Kathryn Pulling, Cassie Troksa, and the advice of Dr. Yi Zhou.
Publisher Copyright:
© 2021 Acoustical Society of America.
PY - 2020
Y1 - 2020
N2 - In several recent papers we have shown that listeners perceive differences in the size of a small auditory scene (< 4 sources) for short-duration sounds (e.g., consonant-vowel, CV, pairs) presented at about the same time, but such differences are not perceived for larger scenes. Spatial separation of a small number of sound sources affects performance much more than spatially separating a larger number of sound sources. In this presentation, the number of sound sources is small, but not the number of sounds (CVs). For example, listeners are poor at determining if a target talker is the same as a cue talker, when the target is presented at one sound-source location and several distractor talkers are presented each at different sound-source locations. If all distractor talkers are “clustered (mixed)” at a single source spatially separated from the target sound source, then listeners are better at determining if the target talker was the same as the cue. Several additional scenarios were tested, all indicating that “clustering” sounds into a small number of sound sources improves target-talker identification compared to when sounds are presented from a larger number of sources.
AB - In several recent papers we have shown that listeners perceive differences in the size of a small auditory scene (< 4 sources) for short-duration sounds (e.g., consonant-vowel, CV, pairs) presented at about the same time, but such differences are not perceived for larger scenes. Spatial separation of a small number of sound sources affects performance much more than spatially separating a larger number of sound sources. In this presentation, the number of sound sources is small, but not the number of sounds (CVs). For example, listeners are poor at determining if a target talker is the same as a cue talker, when the target is presented at one sound-source location and several distractor talkers are presented each at different sound-source locations. If all distractor talkers are “clustered (mixed)” at a single source spatially separated from the target sound source, then listeners are better at determining if the target talker was the same as the cue. Several additional scenarios were tested, all indicating that “clustering” sounds into a small number of sound sources improves target-talker identification compared to when sounds are presented from a larger number of sources.
UR - http://www.scopus.com/inward/record.url?scp=85106189754&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85106189754&partnerID=8YFLogxK
U2 - 10.1121/2.0001386
DO - 10.1121/2.0001386
M3 - Conference article
AN - SCOPUS:85106189754
SN - 1939-800X
VL - 42
JO - Proceedings of Meetings on Acoustics
JF - Proceedings of Meetings on Acoustics
IS - 1
M1 - 050007
T2 - 179th Meeting of the Acoustical Society of America, ASA 2020
Y2 - 7 December 2020 through 11 December 2020
ER -