TY - JOUR
T1 - Assessing the role of geographic context in transportation mode detection from GPS data
AU - Roy, Avipsa
AU - Fuller, Daniel
AU - Nelson, Trisalyn
AU - Kedron, Peter
N1 - Funding Information:
The authors would like to thank INTERACT team for providing valuable feedback and supporting the work. The study is supported by a grant #IP2-1507071C from the Canadian Institutes of Health Research. This study was approved by the Memorial University Interdisciplinary Committee on Ethics in Human Research (20180188-EX).
Funding Information:
The authors would like to thank INTERACT team for providing valuable feedback and supporting the work. The study is supported by a grant # IP2-1507071C from the Canadian Institutes of Health Research . This study was approved by the Memorial University Interdisciplinary Committee on Ethics in Human Research ( 20180188-EX ).
Publisher Copyright:
© 2022 The Authors
PY - 2022/4
Y1 - 2022/4
N2 - The increasing availability of health monitoring devices and smartphones has created an opportunity for researchers to access high-resolution (spatial and temporal) mobility data for understanding travel behavior in cities. Although information from GPS data has been used in several studies to detect transportation modes, there is a research gap in understanding the role of geographic context in transportation mode detection. Integrating the geography in which mobility occurs, provides context clues that may allow models predicting transportation modes to be more generalizable. Our goals are first, to develop a data-driven modeling framework for transportation mode detection using GPS mobility data along with geographic context, and second, to assess how model accuracy and generalizability varies upon adding geographic context. To this extent we extracted features from raw GPS mobility data (speed, altitude, turning angle and net displacement) and integrated context in the form of geographic features to classify active (i.e. walk/bike), public (i.e. bus/train), and private (i.e. car) transportation modes in three different Canadian cities - Montreal, St. Johns, and Vancouver. To assess the role of integrating geographic context in mode detection, we adopted two different modeling approaches – generalized and context-specific, and compared results using random forests, extreme gradient boost, and multilayer perceptron classifiers. Our results indicate that for context-specific models the highest classification accuracy improved by 64% for Montreal, by 74% for St. John's and by 77% for Vancouver compared to the generalized model. We also found that the multilayer perceptron (96%) achieved the highest classification accuracy upon adding contextual variables compared to random forests (94.6%) and extreme gradient boost (93.3%) classifier. Our study highlights that adding contextual information specific to a city's geography can improve the predictive accuracy of transportation mode detection models, however, in case of limited knowledge about the geographic setting of a study area, a generalized model combining GPS data from several cities may still be useful for predicting modes from trip data.
AB - The increasing availability of health monitoring devices and smartphones has created an opportunity for researchers to access high-resolution (spatial and temporal) mobility data for understanding travel behavior in cities. Although information from GPS data has been used in several studies to detect transportation modes, there is a research gap in understanding the role of geographic context in transportation mode detection. Integrating the geography in which mobility occurs, provides context clues that may allow models predicting transportation modes to be more generalizable. Our goals are first, to develop a data-driven modeling framework for transportation mode detection using GPS mobility data along with geographic context, and second, to assess how model accuracy and generalizability varies upon adding geographic context. To this extent we extracted features from raw GPS mobility data (speed, altitude, turning angle and net displacement) and integrated context in the form of geographic features to classify active (i.e. walk/bike), public (i.e. bus/train), and private (i.e. car) transportation modes in three different Canadian cities - Montreal, St. Johns, and Vancouver. To assess the role of integrating geographic context in mode detection, we adopted two different modeling approaches – generalized and context-specific, and compared results using random forests, extreme gradient boost, and multilayer perceptron classifiers. Our results indicate that for context-specific models the highest classification accuracy improved by 64% for Montreal, by 74% for St. John's and by 77% for Vancouver compared to the generalized model. We also found that the multilayer perceptron (96%) achieved the highest classification accuracy upon adding contextual variables compared to random forests (94.6%) and extreme gradient boost (93.3%) classifier. Our study highlights that adding contextual information specific to a city's geography can improve the predictive accuracy of transportation mode detection models, however, in case of limited knowledge about the geographic setting of a study area, a generalized model combining GPS data from several cities may still be useful for predicting modes from trip data.
KW - GPS
KW - Geographic context
KW - Model generalizability
KW - Supervised learning
KW - Travel mode detection
UR - http://www.scopus.com/inward/record.url?scp=85127330739&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85127330739&partnerID=8YFLogxK
U2 - 10.1016/j.jtrangeo.2022.103330
DO - 10.1016/j.jtrangeo.2022.103330
M3 - Article
AN - SCOPUS:85127330739
SN - 0966-6923
VL - 100
JO - Journal of Transport Geography
JF - Journal of Transport Geography
M1 - 103330
ER -