TY - GEN
T1 - Noise robust dysarthric speech classification using domain adaptation
AU - Wisler, Alan
AU - Berisha, Visar
AU - Spanias, Andreas
AU - Liss, Julie
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/9/22
Y1 - 2016/9/22
N2 - This paper will investigate viability of a screening application that could be used to identify individuals with Dysarthria from among a larger population using sentence-level speech data. This task presents a number of challenged particularly if we aim to identify the disorder in the earlier stages before the more significant symptoms have begun to manifest themselves. A principal challenge in this task is acheiving robustness to the large number of confounding variables such as gender, age, accent, speaking style, and recording conditions. All of these variables will affect an individuals speech in a manner unrelated to the disorder, and identifying what information is relevant to the disorder amongst these confounding variables given the limited amount of data that is available in this regime presents a major engineering challenge. In this paper we will focus on achieving robustness to different types and levels of noise by employing a feature selection algorithm that attempts to minimize a non-parametric upper bound of the error in the noisy condition. This is a crucial problem to solve as the clean recording conditions used in data collection are typically a poor reflection of the type of data that will be encountered upon deployment.
AB - This paper will investigate viability of a screening application that could be used to identify individuals with Dysarthria from among a larger population using sentence-level speech data. This task presents a number of challenged particularly if we aim to identify the disorder in the earlier stages before the more significant symptoms have begun to manifest themselves. A principal challenge in this task is acheiving robustness to the large number of confounding variables such as gender, age, accent, speaking style, and recording conditions. All of these variables will affect an individuals speech in a manner unrelated to the disorder, and identifying what information is relevant to the disorder amongst these confounding variables given the limited amount of data that is available in this regime presents a major engineering challenge. In this paper we will focus on achieving robustness to different types and levels of noise by employing a feature selection algorithm that attempts to minimize a non-parametric upper bound of the error in the noisy condition. This is a crucial problem to solve as the clean recording conditions used in data collection are typically a poor reflection of the type of data that will be encountered upon deployment.
UR - http://www.scopus.com/inward/record.url?scp=84991781143&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84991781143&partnerID=8YFLogxK
U2 - 10.1109/DMIAF.2016.7574918
DO - 10.1109/DMIAF.2016.7574918
M3 - Conference contribution
AN - SCOPUS:84991781143
T3 - 2016 Digital Media Industry and Academic Forum, DMIAF 2016 - Proceedings
SP - 135
EP - 138
BT - 2016 Digital Media Industry and Academic Forum, DMIAF 2016 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 Digital Media Industry and Academic Forum, DMIAF 2016
Y2 - 4 July 2016 through 6 July 2016
ER -