Abstract
This study explores advanced techniques in machine learning to develop a short tree-based adaptive classification test based on an existing lengthy instrument. A case study was carried out for an assessment of risk for juvenile delinquency. Two unique facts of this case are (a) the items in the original instrument measure a large number of distinctive constructs; (b) the target outcomes are of low prevalence, which renders imbalanced training data. Due to the high dimensionality of the items, traditional item response theory (IRT)-based adaptive testing approaches may not work well, whereas decision trees, which are developed in the machine learning discipline, present as a promising alternative solution for adaptive tests. A cross-validation study was carried out to compare eight tree-based adaptive test constructions with five benchmark methods using data from a sample of 3,975 subjects. The findings reveal that the best-performing tree-based adaptive tests yielded better classification accuracy than the benchmark method IRT scoring with optimal cutpoints, and yielded comparable or better classification accuracy than the best benchmark method, random forest with balanced sampling. The competitive classification accuracy of the tree-based adaptive tests also come with an over 30-fold reduction in the length of the instrument, only administering between 3 to 6 items to any individual. This study suggests that tree-based adaptive tests have an enormous potential when used to shorten instruments that measure a large variety of constructs.
Original language | English (US) |
---|---|
Pages (from-to) | 499-514 |
Number of pages | 16 |
Journal | Applied Psychological Measurement |
Volume | 44 |
Issue number | 7-8 |
DOIs | |
State | Published - Oct 1 2020 |
Keywords
- adaptive test
- classification tree
- machine learning
ASJC Scopus subject areas
- Social Sciences (miscellaneous)
- Psychology (miscellaneous)
Fingerprint
Dive into the research topics of 'Using Machine Learning Methods to Develop a Short Tree-Based Adaptive Classification Test: Case Study With a High-Dimensional Item Pool and Imbalanced Data'. Together they form a unique fingerprint.Datasets
-
Honduras_tree_APM_online_supplement_V4_for_publisher – Supplemental material for Using Machine Learning Methods to Develop a Short Tree-Based Adaptive Classification Test: Case Study With a High-Dimensional Item Pool and Imbalanced Data
Zheng, Y. (Contributor), Cheon, H. (Contributor) & Katz, C. (Contributor), figshare SAGE Publications, Jan 1 2020
DOI: 10.25384/sage.12514493.v1, https://doi.org/10.25384%2Fsage.12514493.v1
Dataset
-
Using Machine Learning Methods to Develop a Short Tree-Based Adaptive Classification Test: Case Study With a High-Dimensional Item Pool and Imbalanced Data
Zheng, Y. (Creator), Cheon, H. (Contributor) & Katz, C. (Creator), SAGE Journals, 2020
DOI: 10.25384/sage.c.5029571, https://sage.figshare.com/collections/Using_Machine_Learning_Methods_to_Develop_a_Short_Tree-Based_Adaptive_Classification_Test_Case_Study_With_a_High-Dimensional_Item_Pool_and_Imbalanced_Data/5029571
Dataset
-
Using Machine Learning Methods to Develop a Short Tree-Based Adaptive Classification Test: Case Study With a High-Dimensional Item Pool and Imbalanced Data
Zheng, Y. (Creator), Cheon, H. (Contributor) & Katz, C. (Creator), figshare SAGE Publications, 2020
DOI: 10.25384/sage.c.5029571.v1, https://sage.figshare.com/collections/Using_Machine_Learning_Methods_to_Develop_a_Short_Tree-Based_Adaptive_Classification_Test_Case_Study_With_a_High-Dimensional_Item_Pool_and_Imbalanced_Data/5029571/1
Dataset