TY - JOUR
T1 - Efficient Continuous Big Data Integrity Checking for Decentralized Storage
AU - Yu, Haiyang
AU - Hu, Qi
AU - Yang, Zhen
AU - Liu, Huan
N1 - Funding Information:
Manuscript received September 14, 2020; revised January 24, 2021 and March 7, 2021; accepted March 20, 2021. Date of publication March 23, 2021; date of current version July 7, 2021. This work was supported by the National Natural Science Foundation of China under Grant 61671030, Industrial Internet Innovation Development Project, and China Postdoctoral Science Foundation under Grant 2019M660377. This work was supported by the National Key Research and Development Program of China (2020YFB2009501). It was also supported by Engineering Research Center of Intelligent Perception and Autonomous Control, Ministry of Education. Recommended for acceptance by Dr. Yulei Wu. (Corresponding author: Zhen Yang.) Haiyang Yu, Qi Hu, and Zhen Yang are with the Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China (e-mail: [email protected]; [email protected]; yangzhen@bjut. edu.cn Member).
Publisher Copyright:
© 2013 IEEE.
PY - 2021/4/1
Y1 - 2021/4/1
N2 - Decentralized storage powered by blockchain is becoming a new trend that allows data owners to outsource their data to remote storage resources offered by various storage providers. Unfortunately, unqualified storage providers easily encounter unpredictable downtime due to security threats, such as malicious attacks or system failures, which is unacceptable in many real-time or data-driven applications. As a result, continuous data integrity should be guaranteed in decentralized storage, which ensures that data is intact and available for the entire storage period. However, this requires frequent checking for long time periods and incurs heavy burdens of both communication and computation, especially in a big data scenario. In this paper, we propose an efficient continuous big data integrity checking approach for decentralized storage. We design a data-time sampling strategy that randomly checks the integrity of multiple files at each time slot with high checking probability. Furthermore, to tackle the fairness problem derived from the sampling strategy, we propose a fair approach by designing an arbitration algorithm with the verifiable random function. Security analysis shows the security of our approach under the random oracle model. Evaluation and experiments demonstrate that our approach is more efficient in the big data scenario compared with the state-of-the-arts.
AB - Decentralized storage powered by blockchain is becoming a new trend that allows data owners to outsource their data to remote storage resources offered by various storage providers. Unfortunately, unqualified storage providers easily encounter unpredictable downtime due to security threats, such as malicious attacks or system failures, which is unacceptable in many real-time or data-driven applications. As a result, continuous data integrity should be guaranteed in decentralized storage, which ensures that data is intact and available for the entire storage period. However, this requires frequent checking for long time periods and incurs heavy burdens of both communication and computation, especially in a big data scenario. In this paper, we propose an efficient continuous big data integrity checking approach for decentralized storage. We design a data-time sampling strategy that randomly checks the integrity of multiple files at each time slot with high checking probability. Furthermore, to tackle the fairness problem derived from the sampling strategy, we propose a fair approach by designing an arbitration algorithm with the verifiable random function. Security analysis shows the security of our approach under the random oracle model. Evaluation and experiments demonstrate that our approach is more efficient in the big data scenario compared with the state-of-the-arts.
KW - Big data
KW - data integrity checking
KW - decentralized storage
KW - sampling
KW - verifiable random function
UR - http://www.scopus.com/inward/record.url?scp=85103250452&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85103250452&partnerID=8YFLogxK
U2 - 10.1109/TNSE.2021.3068261
DO - 10.1109/TNSE.2021.3068261
M3 - Article
AN - SCOPUS:85103250452
SN - 2327-4697
VL - 8
SP - 1658
EP - 1673
JO - IEEE Transactions on Network Science and Engineering
JF - IEEE Transactions on Network Science and Engineering
IS - 2
M1 - 9384270
ER -