TY - GEN
T1 - Assessment of a new GeoAI foundation model for flood inundation mapping
AU - Li, Wenwen
AU - Lee, Hyunho
AU - Wang, Sizhe
AU - Hsu, Chia Yu
AU - Arundel, Samantha T.
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/11/13
Y1 - 2023/11/13
N2 - Vision foundation models are a new frontier in Geospatial Artificial Intelligence (GeoAI), an interdisciplinary research area that applies and extends AI for geospatial problem solving and geographic knowledge discovery, because of their potential to enable powerful image analysis by learning and extracting important image features from vast amounts of geospatial data. This paper evaluates the performance of the first-of-its-kind geospatial foundation model, IBM-NASA's Prithvi, to support a crucial geospatial analysis task: flood inundation mapping. This model is compared with convolutional neural network and vision transformer-based architectures in terms of mapping accuracy for flooded areas. A benchmark dataset, Sen1Floods11, is used in the experiments, and the models' predictability, generalizability, and transferability are evaluated based on both a test dataset and a dataset that is completely unseen by the model. Results show the good transferability of the Prithvi model, highlighting its performance advantages in segmenting flooded areas in previously unseen regions. The findings also indicate areas for improvement for the Prithvi model in terms of adopting multi-scale representation learning, developing more end-to-end pipelines for high-level image analysis tasks, and offering more flexibility in terms of input data bands.
AB - Vision foundation models are a new frontier in Geospatial Artificial Intelligence (GeoAI), an interdisciplinary research area that applies and extends AI for geospatial problem solving and geographic knowledge discovery, because of their potential to enable powerful image analysis by learning and extracting important image features from vast amounts of geospatial data. This paper evaluates the performance of the first-of-its-kind geospatial foundation model, IBM-NASA's Prithvi, to support a crucial geospatial analysis task: flood inundation mapping. This model is compared with convolutional neural network and vision transformer-based architectures in terms of mapping accuracy for flooded areas. A benchmark dataset, Sen1Floods11, is used in the experiments, and the models' predictability, generalizability, and transferability are evaluated based on both a test dataset and a dataset that is completely unseen by the model. Results show the good transferability of the Prithvi model, highlighting its performance advantages in segmenting flooded areas in previously unseen regions. The findings also indicate areas for improvement for the Prithvi model in terms of adopting multi-scale representation learning, developing more end-to-end pipelines for high-level image analysis tasks, and offering more flexibility in terms of input data bands.
KW - Artificial Intelligence
KW - GeoAI
KW - Segformer
KW - U-Net
KW - semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85180010544&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85180010544&partnerID=8YFLogxK
U2 - 10.1145/3615886.3627747
DO - 10.1145/3615886.3627747
M3 - Conference contribution
AN - SCOPUS:85180010544
T3 - GeoAI 2023 - Proceedings of the 6th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery
SP - 102
EP - 109
BT - GeoAI 2023 - Proceedings of the 6th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery
A2 - Newsam, Shawn
A2 - Yang, Lexie
A2 - Mai, Gengchen
A2 - Martins, Bruno
A2 - Lunga, Dalton
A2 - Gao, Song
PB - Association for Computing Machinery, Inc
T2 - 6th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery, GeoAI 2023
Y2 - 13 November 2023
ER -