TY - JOUR
T1 - Deep reinforcement learning with spatio-temporal traffic forecasting for data-driven base station sleep control
AU - Wu, Qiong
AU - Chen, Xu
AU - Zhou, Zhi
AU - Chen, Liang
AU - Zhang, Junshan
N1 - Funding Information:
Manuscript received May 24, 2019; revised April 21, 2020; accepted January 17, 2021; approved by IEEE/ACM TRANSACTIONS ON NETWORKING Editor E. Uysal. Date of publication January 29, 2021; date of current version April 16, 2021. This work was supported in part by the National Key Research and Development under Grant U20A20159 and Grant 61972432, in part by the Program for Guangdong Introducing Innovative and Entrepreneurial Teams under Grant 2017ZT07X355, and in part by the Pearl River Talent Recruitment Program under Grant 2017GC010465. (Corresponding author: Xu Chen.) Qiong Wu, Xu Chen, and Zhi Zhou are with the School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou 510006, China (e-mail: chenxu35@mail.sysu.edu.cn).
Publisher Copyright:
© 1993-2012 IEEE.
PY - 2021/4
Y1 - 2021/4
N2 - To meet the ever increasing mobile traffic demand in 5G era, base stations (BSs) have been densely deployed in radio access networks (RANs) to increase the network coverage and capacity. However, as the high density of BSs is designed to accommodate peak traffic, it would consume an unnecessarily large amount of energy if BSs are on during off-peak time. To save the energy consumption of cellular networks, an effective way is to deactivate some idle base stations that do not serve any traffic demand. In this paper, we develop a traffic-aware dynamic BS sleep control framework, named DeepBSC, which presents a novel data-driven learning approach to determine the BS active/sleep modes while meeting lower energy consumption and satisfactory Quality of Service (QoS) requirements. Specifically, the traffic demands are predicted by the proposed GS-STN model, which leverages the geographical and semantic spatial-temporal correlations of mobile traffic. With accurate mobile traffic forecasting, the BS sleep control problem is cast as a Markov Decision Process that is solved by Actor-Critic reinforcement learning methods. To reduce the variance of cost estimation in the dynamic environment, we propose a benchmark transformation method that provides robust performance indicator for policy update. To expedite the training process, we adopt a Deep Deterministic Policy Gradient (DDPG) approach, together with an explorer network, which can strengthen the exploration further. Extensive experiments with a real-world dataset corroborate that our proposed framework significantly outperforms the existing methods.
AB - To meet the ever increasing mobile traffic demand in 5G era, base stations (BSs) have been densely deployed in radio access networks (RANs) to increase the network coverage and capacity. However, as the high density of BSs is designed to accommodate peak traffic, it would consume an unnecessarily large amount of energy if BSs are on during off-peak time. To save the energy consumption of cellular networks, an effective way is to deactivate some idle base stations that do not serve any traffic demand. In this paper, we develop a traffic-aware dynamic BS sleep control framework, named DeepBSC, which presents a novel data-driven learning approach to determine the BS active/sleep modes while meeting lower energy consumption and satisfactory Quality of Service (QoS) requirements. Specifically, the traffic demands are predicted by the proposed GS-STN model, which leverages the geographical and semantic spatial-temporal correlations of mobile traffic. With accurate mobile traffic forecasting, the BS sleep control problem is cast as a Markov Decision Process that is solved by Actor-Critic reinforcement learning methods. To reduce the variance of cost estimation in the dynamic environment, we propose a benchmark transformation method that provides robust performance indicator for policy update. To expedite the training process, we adopt a Deep Deterministic Policy Gradient (DDPG) approach, together with an explorer network, which can strengthen the exploration further. Extensive experiments with a real-world dataset corroborate that our proposed framework significantly outperforms the existing methods.
KW - Base station sleep control
KW - Deep reinforcement learning
KW - Spatio-temporal traffic forecasting
UR - http://www.scopus.com/inward/record.url?scp=85100472461&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100472461&partnerID=8YFLogxK
U2 - 10.1109/TNET.2021.3053771
DO - 10.1109/TNET.2021.3053771
M3 - Article
AN - SCOPUS:85100472461
SN - 1063-6692
VL - 29
SP - 935
EP - 948
JO - IEEE/ACM Transactions on Networking
JF - IEEE/ACM Transactions on Networking
IS - 2
M1 - 9340607
ER -