Three methods for optimization of cross-laboratory and cross-platform microarray expression data

Phillip Stafford, Marcel Brun

Research output: Contribution to journalArticlepeer-review

46 Scopus citations


Microarray gene expression data becomes more valuable as our confidence in the results grows. Guaranteeing data quality becomes increasingly important as microarrays are being used to diagnose and treat patients (1-4). The MAQC Quality Control Consortium, the FDA's Critical Path Initiative, NCI's caBIG and others are implementing procedures that will broadly enhance data quality. As GEO continues to grow, its usefulness is constrained by the level of correlation across experiments and general applicability. Although RNA preparation and array platform play important roles in data accuracy, pre-processing is a user-selected factor that has an enormous effect. Normalization of expression data is necessary, but the methods have specific and pronounced effects on precision, accuracy and historical correlation. As a case study, we present a microarray calibration process using normalization as the adjustable parameter. We examine the impact of eight normalizations across both Agilent and Affymetrix expression platforms on three expression readouts: (1) sensitivity and power, (2) functional/ biological interpretation and (3) feature selection and classification error. The reader is encouraged to measure their own discordant data, whether cross-laboratory, cross-platform or across any other variance source, and to use their results to tune the adjustable parameters of their laboratory to ensure increased correlation.

Original languageEnglish (US)
Article numbere72
JournalNucleic acids research
Issue number10
StatePublished - May 2007

ASJC Scopus subject areas

  • Genetics


Dive into the research topics of 'Three methods for optimization of cross-laboratory and cross-platform microarray expression data'. Together they form a unique fingerprint.

Cite this