TY - GEN
T1 - A least squares formulation for canonical correlation analysis
AU - Sun, Liang
AU - Ji, Shuiwang
AU - Ye, Jieping
PY - 2008
Y1 - 2008
N2 - Canonical Correlation Analysis (CCA) is a well-known technique for finding the correlations between two sets of multi-dimensional variables. It projects both sets of variables into a lower-dimensional space in which they are maximally correlated. CCA is commonly applied for supervised dimensionality reduction, in which one of the multi-dimensional variables is derived from the class label. It has been shown that CCA can be formulated as a least squares problem in the binary-class case. However, their relationship in the more general setting remains unclear. In this paper, we show that, under a mild condition which tends to hold for high-dimensional data, CCA in multi-label classifications can be formulated as a least squares problem. Based on this equivalence relationship, we propose several CCA extensions including sparse CCA using 1-norm regularization. Experiments on multi-label data sets confirm the established equivalence relationship. Results also demonstrate the effectiveness of the proposed CCA extensions.
AB - Canonical Correlation Analysis (CCA) is a well-known technique for finding the correlations between two sets of multi-dimensional variables. It projects both sets of variables into a lower-dimensional space in which they are maximally correlated. CCA is commonly applied for supervised dimensionality reduction, in which one of the multi-dimensional variables is derived from the class label. It has been shown that CCA can be formulated as a least squares problem in the binary-class case. However, their relationship in the more general setting remains unclear. In this paper, we show that, under a mild condition which tends to hold for high-dimensional data, CCA in multi-label classifications can be formulated as a least squares problem. Based on this equivalence relationship, we propose several CCA extensions including sparse CCA using 1-norm regularization. Experiments on multi-label data sets confirm the established equivalence relationship. Results also demonstrate the effectiveness of the proposed CCA extensions.
UR - http://www.scopus.com/inward/record.url?scp=56449106936&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=56449106936&partnerID=8YFLogxK
U2 - 10.1145/1390156.1390285
DO - 10.1145/1390156.1390285
M3 - Conference contribution
AN - SCOPUS:56449106936
SN - 9781605582054
T3 - Proceedings of the 25th International Conference on Machine Learning
SP - 1024
EP - 1031
BT - Proceedings of the 25th International Conference on Machine Learning
PB - Association for Computing Machinery (ACM)
T2 - 25th International Conference on Machine Learning
Y2 - 5 July 2008 through 9 July 2008
ER -