A direct approach to data fusion

Zvi Gilula, Robert E. Mcculloch, Peter E. Rossi

Research output: Contribution to journalArticlepeer-review

40 Scopus citations


The generic data fusion problem is to make inferences about the joint distribution of two sets of variables without any direct observations of the joint distribution. Instead, information is available only for each set separately along with some other set of common variables. The standard approach to data fusion creates a fused data set with the variables of interest and the common variables. This article develops an approach that directly estimates the joint distribution of just the variables of interest. For the case of either discrete or continuous variables, the approach yields a solution that can be implemented with standard statistical models and software. In typical marketing applications, the common variables are psychographic or demographic variables, and the variables to be fused involve media viewing and product purchase. For this example, the approach directly estimates the joint distribution of media viewing and product purchase without including the common variables. This is the object required for marketing decisions. In marketing applications, fusion of discrete variables is required. The authors develop a method for relaxing the assumption of conditional independence for this case. They illustrate their approach with product-purchase and media-viewing data from a large survey of British consumers.

Original languageEnglish (US)
Pages (from-to)73-83
Number of pages11
JournalJournal of Marketing Research
Issue number1
StatePublished - Feb 2006
Externally publishedYes

ASJC Scopus subject areas

  • Business and International Management
  • Economics and Econometrics
  • Marketing


Dive into the research topics of 'A direct approach to data fusion'. Together they form a unique fingerprint.

Cite this