TY - JOUR
T1 - Twin-delayed deep deterministic policy gradient for low-frequency oscillation damping control
AU - Cui, Qiushi
AU - Kim, Gyoungjae
AU - Weng, Yang
N1 - Funding Information:
Funding: This research was funded by the Collaborative Research of Learning and Optimizing Power Systems: A Geometric Approach, Award Number: 1810537, as well as the CAREER award of Faithful, Reducible, and Invertible Learning in Distribution System for Power Flow, Award Number: 2048288.
Publisher Copyright:
© 2021 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2021/10/1
Y1 - 2021/10/1
N2 - Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To address the overestimation bias issue, we redesign the learning structure of the deep deterministic policy gradient (DDPG). Then we develop a damping control twin-delayed deep deterministic policy gradient method to handle the damping control issue under unknown latency in the power network. The purpose is to address the damping control issue under unknown latency in the power network. This paper will create a novel reward algorithm, taking into account the machine speed deviation, the episode termination prevention, and the feedback from action space. In this way, the system optimally damps down frequency oscillations while maintaining the system’s stability and reliable operation within defined limits. The simulation results verify the proposed algorithm in various perspectives, including the latency sensitivity analysis under high renewable energy penetration and the comparison with conventional and machine learning control algorithms. The proposed method shows a fast learning curve and good control performance under varying communication latency.
AB - Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To address the overestimation bias issue, we redesign the learning structure of the deep deterministic policy gradient (DDPG). Then we develop a damping control twin-delayed deep deterministic policy gradient method to handle the damping control issue under unknown latency in the power network. The purpose is to address the damping control issue under unknown latency in the power network. This paper will create a novel reward algorithm, taking into account the machine speed deviation, the episode termination prevention, and the feedback from action space. In this way, the system optimally damps down frequency oscillations while maintaining the system’s stability and reliable operation within defined limits. The simulation results verify the proposed algorithm in various perspectives, including the latency sensitivity analysis under high renewable energy penetration and the comparison with conventional and machine learning control algorithms. The proposed method shows a fast learning curve and good control performance under varying communication latency.
KW - Damping control
KW - Latency
KW - Low-frequency oscillations
KW - Twin-delayed deep deterministic policy gradient
KW - Wide-area measurement systems
UR - http://www.scopus.com/inward/record.url?scp=85117327438&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85117327438&partnerID=8YFLogxK
U2 - 10.3390/en14206695
DO - 10.3390/en14206695
M3 - Article
AN - SCOPUS:85117327438
SN - 1996-1073
VL - 14
JO - Energies
JF - Energies
IS - 20
M1 - 6695
ER -