Untangle the structural and random zeros in statistical modelings

W. Tang, H. He, W. J. Wang, D. G. Chen

Research output: Contribution to journalArticlepeer-review

11 Scopus citations

Abstract

Count data with structural zeros are common in public health applications. There are considerable researches focusing on zero-inflated models such as zero-inflated Poisson (ZIP) and zero-inflated Negative Binomial (ZINB) models for such zero-inflated count data when used as response variable. However, when such variables are used as predictors, the difference between structural and random zeros is often ignored and may result in biased estimates. One remedy is to include an indicator of the structural zero in the model as a predictor if observed. However, structural zeros are often not observed in practice, in which case no statistical method is available to address the bias issue. This paper is aimed to fill this methodological gap by developing parametric methods to model zero-inflated count data when used as predictors based on the maximum likelihood approach. The response variable can be any type of data including continuous, binary, count or even zero-inflated count responses. Simulation studies are performed to assess the numerical performance of this new approach when sample size is small to moderate. A real data example is also used to demonstrate the application of this method.

Original languageEnglish (US)
Pages (from-to)1714-1733
Number of pages20
JournalJournal of Applied Statistics
Volume45
Issue number9
DOIs
StatePublished - Jul 4 2018
Externally publishedYes

Keywords

  • Generalized linear models
  • maximum likelihood
  • structural zeros
  • zero-inflated Poisson
  • zero-inflated explanatory variables

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Untangle the structural and random zeros in statistical modelings'. Together they form a unique fingerprint.

Cite this