TY - JOUR
T1 - Computational graph-based framework for integrating econometric models and machine learning algorithms in emerging data-driven analytical environments
AU - Kim, Taehooie
AU - Zhou, Xuesong
AU - Pendyala, Ram M.
N1 - Publisher Copyright:
© 2021 Hong Kong Society for Transportation Studies Limited.
PY - 2022
Y1 - 2022
N2 - In an era of big data and emergence of disrupting mobility technologies, statistical models have been utilized to uncover the influence of significant factors, and machine learning algorithms have been used to explore complex patterns in large datasets. Focusing on discrete choice modeling applications, this research aims to introduce computational graph (CG)-based frameworks for integrating the strengths of econometric models and machine learning algorithms. Specifically, multinomial logit (MNL), nested logit (NL), and integrated choice and latent variable (ICLV) models are selected to demonstrate the performance of the graph-oriented functional representation. Furthermore, the calculation of gradients in the log-likelihood function is accomplished using automatic differentiation (AD). Using the 2017 National Household Travel Survey data and synthetic datasets, we compare estimation results from the proposed methods with those obtained from Biogeme and Apollo. The results indicate that the CG-based choice modeling approach can produce consistent estimates of parameters with substantial computational efficiency.
AB - In an era of big data and emergence of disrupting mobility technologies, statistical models have been utilized to uncover the influence of significant factors, and machine learning algorithms have been used to explore complex patterns in large datasets. Focusing on discrete choice modeling applications, this research aims to introduce computational graph (CG)-based frameworks for integrating the strengths of econometric models and machine learning algorithms. Specifically, multinomial logit (MNL), nested logit (NL), and integrated choice and latent variable (ICLV) models are selected to demonstrate the performance of the graph-oriented functional representation. Furthermore, the calculation of gradients in the log-likelihood function is accomplished using automatic differentiation (AD). Using the 2017 National Household Travel Survey data and synthetic datasets, we compare estimation results from the proposed methods with those obtained from Biogeme and Apollo. The results indicate that the CG-based choice modeling approach can produce consistent estimates of parameters with substantial computational efficiency.
KW - Computational graphs (CGs)
KW - and gradient calculation
KW - automatic differentiation (AD)
KW - integrated choice and latent variable (ICLV)
KW - multinomial logit (MNL)
KW - nested logit (NL)
UR - http://www.scopus.com/inward/record.url?scp=85107847492&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85107847492&partnerID=8YFLogxK
U2 - 10.1080/23249935.2021.1938744
DO - 10.1080/23249935.2021.1938744
M3 - Article
AN - SCOPUS:85107847492
SN - 2324-9935
VL - 18
SP - 1346
EP - 1375
JO - Transportmetrica A: Transport Science
JF - Transportmetrica A: Transport Science
IS - 3
ER -