Estimating large covariance matrix with network topology for high-dimensional biomedical data

Shuo Chen, Jian Kang, Yishi Xing, Yunpeng Zhao, Donald K. Milton

Research output: Contribution to journalArticlepeer-review

8 Scopus citations


Interactions between features of high-dimensional biomedical data often exhibit complex and organized, yet latent, network topological structures. Estimating the non-sparse large covariance matrix of these high-dimensional biomedical data while preserving and recognizing the latent network topology are challenging. A two step procedure is proposed that first detects latent network topological structures from the sample correlation matrix by implementing new penalized optimization and then regularizes the covariance matrix by leveraging the detected network topological information. The network topology guided regularization can reduce false positive and false negative rates simultaneously because it allows edges to borrow strengths from each other precisely. Empirical data examples demonstrate that organized latent network topological structures widely exist in high-dimensional biomedical data across platforms and identifying these network structures can effectively improve estimating covariance matrix and understanding interactive relationships between biomedical features.

Original languageEnglish (US)
Pages (from-to)82-95
Number of pages14
JournalComputational Statistics and Data Analysis
StatePublished - Nov 2018


  • Correlation matrix
  • Graph
  • Parsimony
  • Shrinkage
  • Thresholding

ASJC Scopus subject areas

  • Statistics and Probability
  • Computational Mathematics
  • Computational Theory and Mathematics
  • Applied Mathematics


Dive into the research topics of 'Estimating large covariance matrix with network topology for high-dimensional biomedical data'. Together they form a unique fingerprint.

Cite this