Local Gaussian Process Extrapolation for BART Models with Applications to Causal Inference

Meijia Wang, Jingyu He, P. Richard Hahn

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Bayesian additive regression trees (BART) is a semi-parametric regression model offering state-of-the-art performance on out-of-sample prediction. Despite this success, standard implementations of BART typically suffer from inaccurate prediction and overly narrow prediction intervals at points outside the range of the training data. This article proposes a novel extrapolation strategy that grafts Gaussian processes to the leaf nodes in BART for predicting points outside the range of the observed data. The new method is compared to standard BART implementations and recent frequentist resampling-based methods for predictive inference. We apply the new approach to a challenging problem from causal inference, wherein for some regions of predictor space, only treated or untreated units are observed (but not both). In simulation studies, the new approach boasts superior performance compared to popular alternatives, such as Jackknife+. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)724-735
Number of pages12
JournalJournal of Computational and Graphical Statistics
Volume33
Issue number2
DOIs
StatePublished - 2024

Keywords

  • Extrapolation
  • Gaussian process
  • Predictive interval
  • Tree
  • XBART
  • XBCF

ASJC Scopus subject areas

  • Statistics and Probability
  • Discrete Mathematics and Combinatorics
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Local Gaussian Process Extrapolation for BART Models with Applications to Causal Inference'. Together they form a unique fingerprint.

Cite this