TY - GEN
T1 - Removing data with noisy responses in regression analysis
AU - Wisler, Alan
AU - Berisha, Visar
AU - Ramamurthy, Karthikeyan
AU - Spanias, Andreas
AU - Liss, Julie
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/8/4
Y1 - 2015/8/4
N2 - In regression analysis, outliers in the data can induce a bias in the learned function, resulting in larger errors. In this paper we derive an empirically estimable bound on the regression error based on a Euclidean minimum spanning tree generated from the data. Using this bound as motivation, we propose an iterative approach to remove data with noisy responses from the training set. We evaluate the performance of the algorithm on experiments with real-world pathological speech (speech from individuals with neurogenic disorders). Comparative results show that removing noisy examples during training using the proposed approach yields better predictive performance on out-of-sample data.
AB - In regression analysis, outliers in the data can induce a bias in the learned function, resulting in larger errors. In this paper we derive an empirically estimable bound on the regression error based on a Euclidean minimum spanning tree generated from the data. Using this bound as motivation, we propose an iterative approach to remove data with noisy responses from the training set. We evaluate the performance of the algorithm on experiments with real-world pathological speech (speech from individuals with neurogenic disorders). Comparative results show that removing noisy examples during training using the proposed approach yields better predictive performance on out-of-sample data.
KW - Friedman-Rafsky statistic
KW - minimum spanning tree
KW - noisy data
KW - outlier removal
KW - robust regression
UR - http://www.scopus.com/inward/record.url?scp=84946046733&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84946046733&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2015.7178334
DO - 10.1109/ICASSP.2015.7178334
M3 - Conference contribution
AN - SCOPUS:84946046733
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 2066
EP - 2070
BT - 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015
Y2 - 19 April 2014 through 24 April 2014
ER -