TY - GEN
T1 - Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities
T2 - 6th IEEE International Congress on Big Data, BigData Congress 2017
AU - Prasad, Sushil K.
AU - Aghajarian, Danial
AU - McDermott, Michael
AU - Shah, Dhara
AU - Mokbel, Mohamed
AU - Puri, Satish
AU - Rey, Sergio J.
AU - Shekhar, Shashi
AU - Xe, Yiqun
AU - Vatsavai, Ranga Raju
AU - Wang, Fusheng
AU - Liang, Yanhui
AU - Vo, Hoang
AU - Wang, Shaowen
N1 - Funding Information:
Prasad’s group research is partially supported by NSF grant 1205650. Rey’s group research is partially supported by NSF grant 1421935. Fusheng Wang’s group research is partially supported by NSF grants ACI 1443054 and IIS 1350885. Shekhar’s group research is partially supported by NSF grant IIS-1320580 and USDOD grant HM0210-13-1-0005.
Publisher Copyright:
© 2017 IEEE.
Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2017/9/7
Y1 - 2017/9/7
N2 - This vision paper reviews the current state-ofart and lays out emerging research challenges in parallel processing of spatial-temporal large datasets relevant to a variety of scientific communities. The spatio-temporal data, whether captured through remote sensors (global earth observations), ground and ocean sensors (e.g., soil moisture sensors, buoys), social media and hand-held, traffic-related sensors and cameras, medical imaging (e.g., MRI), or large scale simulations (e.g., climate) have always been 'big.' A common thread among all these big collections of datasets is that they are spatial and temporal. Processing and analyzing these datasets requires high-performance computing (HPC) infrastructures. Various agencies, scientific communities and increasingly the society at large rely on spatial data management, analysis, and spatial data mining to gain insights and produce actionable plans. Therefore, an ecosystem of integrated and reliable software infrastructure is required for spatialtemporal big data management and analysis that will serve as crucial tools for solving a wide set of research problems from different scientific and engineering areas and to empower users with next-generation tools. This vision requires a multidisciplinary effort to significantly advance domain research and have a broad impact on the society. The areas of research discussed in this paper include (i) spatial data mining, (ii) data analytics over remote sensing data, (iii) processing medical images, (iv) spatial econometrics analyses, (v) Map-Reducebased systems for spatial computation and visualization, (vi) CyberGIS systems, and (vii) foundational parallel algorithms and data structures for polygonal datasets, and why HPC infrastructures, including harnessing graphics accelerators, are needed for time-critical applications.
AB - This vision paper reviews the current state-ofart and lays out emerging research challenges in parallel processing of spatial-temporal large datasets relevant to a variety of scientific communities. The spatio-temporal data, whether captured through remote sensors (global earth observations), ground and ocean sensors (e.g., soil moisture sensors, buoys), social media and hand-held, traffic-related sensors and cameras, medical imaging (e.g., MRI), or large scale simulations (e.g., climate) have always been 'big.' A common thread among all these big collections of datasets is that they are spatial and temporal. Processing and analyzing these datasets requires high-performance computing (HPC) infrastructures. Various agencies, scientific communities and increasingly the society at large rely on spatial data management, analysis, and spatial data mining to gain insights and produce actionable plans. Therefore, an ecosystem of integrated and reliable software infrastructure is required for spatialtemporal big data management and analysis that will serve as crucial tools for solving a wide set of research problems from different scientific and engineering areas and to empower users with next-generation tools. This vision requires a multidisciplinary effort to significantly advance domain research and have a broad impact on the society. The areas of research discussed in this paper include (i) spatial data mining, (ii) data analytics over remote sensing data, (iii) processing medical images, (iv) spatial econometrics analyses, (v) Map-Reducebased systems for spatial computation and visualization, (vi) CyberGIS systems, and (vii) foundational parallel algorithms and data structures for polygonal datasets, and why HPC infrastructures, including harnessing graphics accelerators, are needed for time-critical applications.
KW - CyberGIS
KW - High performance computing
KW - Map-reduce systems
KW - Medical images
KW - Parallel algorithms and data structures
KW - Remote sensing data
KW - Spatial data mining
KW - Spatial econometrics
UR - http://www.scopus.com/inward/record.url?scp=85032384688&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85032384688&partnerID=8YFLogxK
U2 - 10.1109/BigDataCongress.2017.39
DO - 10.1109/BigDataCongress.2017.39
M3 - Conference contribution
AN - SCOPUS:85032384688
T3 - Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017
SP - 232
EP - 250
BT - Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017
A2 - Karypis, George
A2 - Zhang, Jia
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 25 June 2017 through 30 June 2017
ER -