Missing data in bioarchaeology II: A test of ordinal and continuous data imputation

Amanda Wissler, Kelly E. Blevins, Jane E. Buikstra

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

OBJECTIVES: Previous research has shown that while missing data are common in bioarchaeological studies, they are seldom handled using statistically rigorous methods. The primary objective of this article is to evaluate the ability of imputation to manage missing data and encourage the use of advanced statistical methods in bioarchaeology and paleopathology. An overview of missing data management in biological anthropology is provided, followed by a test of imputation and deletion methods for handling missing data. MATERIALS AND METHODS: Missing data were simulated on complete datasets of ordinal (n = 287) and continuous (n = 369) bioarchaeological data. Missing values were imputed using five imputation methods (mean, predictive mean matching, random forest, expectation maximization, and stochastic regression) and the success of each at obtaining the parameters of the original dataset compared with pairwise and listwise deletion. RESULTS: In all instances, listwise deletion was least successful at approximating the original parameters. Imputation of continuous data was more effective than ordinal data. Overall, no one method performed best and the amount of missing data proved a stronger predictor of imputation success. DISCUSSION: These findings support the use of imputation methods over deletion for handling missing bioarchaeological and paleopathology data, especially when the data are continuous. Whereas deletion methods reduce sample size, imputation maintains sample size, improving statistical power and preventing bias from being introduced into the dataset.

Original languageEnglish (US)
Pages (from-to)349-364
Number of pages16
JournalAmerican Journal of Biological Anthropology
Volume179
Issue number3
DOIs
StatePublished - Nov 1 2022
Externally publishedYes

Keywords

  • bioarchaeology
  • imputation
  • missing data
  • paleopathology

ASJC Scopus subject areas

  • Anthropology
  • Genetics
  • Epidemiology
  • Anatomy
  • Archaeology
  • Palaeontology

Fingerprint

Dive into the research topics of 'Missing data in bioarchaeology II: A test of ordinal and continuous data imputation'. Together they form a unique fingerprint.

Cite this