TY - GEN
T1 - An automated framework for explaining facts extracted from mobility datasets
AU - Tahir, Anique
AU - Sun, Yuhan
AU - Sarwat, Mohamed
N1 - Funding Information:
VIII. ACKNOWLEDGEMENT This work is supported in part by the National Science Foundation (NSF) under Grant 1845789, the Salt River Project Agricultural Improvement and Power District (SRP), and the DOD-ARMY Training and Doctrine Command (TRADOC).
Publisher Copyright:
© 2019 IEEE.
PY - 2019/6
Y1 - 2019/6
N2 - When a data scientist analyzes mobility data (e.g., using a data visualization tool), she may find out some interesting facts in the dataset. An example of a fact can be: 'The number of Taxi trips in NYC on January 23, 2016, dropped drastically as compared to other days of the same month'. However, the data scientist may be left clueless if they cannot find a crisp explanation to such a fact. Furthermore, the tedious task of finding an explanation by manually scraping the data becomes even impossible with big data. Existing techniques are designed for non-spatial data which cannot be applied to spatial data because it does not consider the spatial proximity. In this paper, we propose an automatic framework which guides the data scientist to explain the fact discovered from mobility data. Our approach expands on the aggravation and intervention techniques while using spatial partitioning/clustering to improve explanations for spatial data. Experiments show that the proposed approach outperforms the state-of-The-Art approaches in finding the explanation for facts extracted from NYC taxi real mobility dataset.
AB - When a data scientist analyzes mobility data (e.g., using a data visualization tool), she may find out some interesting facts in the dataset. An example of a fact can be: 'The number of Taxi trips in NYC on January 23, 2016, dropped drastically as compared to other days of the same month'. However, the data scientist may be left clueless if they cannot find a crisp explanation to such a fact. Furthermore, the tedious task of finding an explanation by manually scraping the data becomes even impossible with big data. Existing techniques are designed for non-spatial data which cannot be applied to spatial data because it does not consider the spatial proximity. In this paper, we propose an automatic framework which guides the data scientist to explain the fact discovered from mobility data. Our approach expands on the aggravation and intervention techniques while using spatial partitioning/clustering to improve explanations for spatial data. Experiments show that the proposed approach outperforms the state-of-The-Art approaches in finding the explanation for facts extracted from NYC taxi real mobility dataset.
KW - Database
KW - Mobile spatial data
KW - Spatial explanation
UR - http://www.scopus.com/inward/record.url?scp=85071001797&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85071001797&partnerID=8YFLogxK
U2 - 10.1109/MDM.2019.00-48
DO - 10.1109/MDM.2019.00-48
M3 - Conference contribution
AN - SCOPUS:85071001797
T3 - Proceedings - IEEE International Conference on Mobile Data Management
SP - 269
EP - 278
BT - Proceedings - 2019 20th International Conference on Mobile Data Management, MDM 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 20th International Conference on Mobile Data Management, MDM 2019
Y2 - 10 June 2019 through 13 June 2019
ER -