Twin-delayed deep deterministic policy gradient for low-frequency oscillation damping control

Qiushi Cui; Gyoungjae Kim; Yang Weng

doi:10.3390/en14206695

Twin-delayed deep deterministic policy gradient for low-frequency oscillation damping control

Qiushi Cui, Gyoungjae Kim, Yang Weng

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To address the overestimation bias issue, we redesign the learning structure of the deep deterministic policy gradient (DDPG). Then we develop a damping control twin-delayed deep deterministic policy gradient method to handle the damping control issue under unknown latency in the power network. The purpose is to address the damping control issue under unknown latency in the power network. This paper will create a novel reward algorithm, taking into account the machine speed deviation, the episode termination prevention, and the feedback from action space. In this way, the system optimally damps down frequency oscillations while maintaining the system’s stability and reliable operation within defined limits. The simulation results verify the proposed algorithm in various perspectives, including the latency sensitivity analysis under high renewable energy penetration and the comparison with conventional and machine learning control algorithms. The proposed method shows a fast learning curve and good control performance under varying communication latency.

Original language	English (US)
Article number	6695
Journal	Energies
Volume	14
Issue number	20
DOIs	https://doi.org/10.3390/en14206695
State	Published - Oct 1 2021

Keywords

Damping control
Latency
Low-frequency oscillations
Twin-delayed deep deterministic policy gradient
Wide-area measurement systems

ASJC Scopus subject areas

Renewable Energy, Sustainability and the Environment
Fuel Technology
Energy Engineering and Power Technology
Energy (miscellaneous)
Control and Optimization
Electrical and Electronic Engineering

Access to Document

10.3390/en14206695

Cite this

@article{3e9eafd2edf946fbaa54e524d938ac98,

title = "Twin-delayed deep deterministic policy gradient for low-frequency oscillation damping control",

abstract = "Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To address the overestimation bias issue, we redesign the learning structure of the deep deterministic policy gradient (DDPG). Then we develop a damping control twin-delayed deep deterministic policy gradient method to handle the damping control issue under unknown latency in the power network. The purpose is to address the damping control issue under unknown latency in the power network. This paper will create a novel reward algorithm, taking into account the machine speed deviation, the episode termination prevention, and the feedback from action space. In this way, the system optimally damps down frequency oscillations while maintaining the system{\textquoteright}s stability and reliable operation within defined limits. The simulation results verify the proposed algorithm in various perspectives, including the latency sensitivity analysis under high renewable energy penetration and the comparison with conventional and machine learning control algorithms. The proposed method shows a fast learning curve and good control performance under varying communication latency.",

keywords = "Damping control, Latency, Low-frequency oscillations, Twin-delayed deep deterministic policy gradient, Wide-area measurement systems",

author = "Qiushi Cui and Gyoungjae Kim and Yang Weng",

note = "Funding Information: Funding: This research was funded by the Collaborative Research of Learning and Optimizing Power Systems: A Geometric Approach, Award Number: 1810537, as well as the CAREER award of Faithful, Reducible, and Invertible Learning in Distribution System for Power Flow, Award Number: 2048288. Publisher Copyright: {\textcopyright} 2021 by the authors. Licensee MDPI, Basel, Switzerland.",

year = "2021",

month = oct,

day = "1",

doi = "10.3390/en14206695",

language = "English (US)",

volume = "14",

journal = "Energies",

issn = "1996-1073",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "20",

}

TY - JOUR

T1 - Twin-delayed deep deterministic policy gradient for low-frequency oscillation damping control

AU - Cui, Qiushi

AU - Kim, Gyoungjae

AU - Weng, Yang

N1 - Funding Information: Funding: This research was funded by the Collaborative Research of Learning and Optimizing Power Systems: A Geometric Approach, Award Number: 1810537, as well as the CAREER award of Faithful, Reducible, and Invertible Learning in Distribution System for Power Flow, Award Number: 2048288. Publisher Copyright: © 2021 by the authors. Licensee MDPI, Basel, Switzerland.

PY - 2021/10/1

Y1 - 2021/10/1

N2 - Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To address the overestimation bias issue, we redesign the learning structure of the deep deterministic policy gradient (DDPG). Then we develop a damping control twin-delayed deep deterministic policy gradient method to handle the damping control issue under unknown latency in the power network. The purpose is to address the damping control issue under unknown latency in the power network. This paper will create a novel reward algorithm, taking into account the machine speed deviation, the episode termination prevention, and the feedback from action space. In this way, the system optimally damps down frequency oscillations while maintaining the system’s stability and reliable operation within defined limits. The simulation results verify the proposed algorithm in various perspectives, including the latency sensitivity analysis under high renewable energy penetration and the comparison with conventional and machine learning control algorithms. The proposed method shows a fast learning curve and good control performance under varying communication latency.

AB - Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To address the overestimation bias issue, we redesign the learning structure of the deep deterministic policy gradient (DDPG). Then we develop a damping control twin-delayed deep deterministic policy gradient method to handle the damping control issue under unknown latency in the power network. The purpose is to address the damping control issue under unknown latency in the power network. This paper will create a novel reward algorithm, taking into account the machine speed deviation, the episode termination prevention, and the feedback from action space. In this way, the system optimally damps down frequency oscillations while maintaining the system’s stability and reliable operation within defined limits. The simulation results verify the proposed algorithm in various perspectives, including the latency sensitivity analysis under high renewable energy penetration and the comparison with conventional and machine learning control algorithms. The proposed method shows a fast learning curve and good control performance under varying communication latency.

KW - Damping control

KW - Latency

KW - Low-frequency oscillations

KW - Twin-delayed deep deterministic policy gradient

KW - Wide-area measurement systems

UR - http://www.scopus.com/inward/record.url?scp=85117327438&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85117327438&partnerID=8YFLogxK

U2 - 10.3390/en14206695

DO - 10.3390/en14206695

M3 - Article

AN - SCOPUS:85117327438

SN - 1996-1073

VL - 14

JO - Energies

JF - Energies

IS - 20

M1 - 6695

ER -

Twin-delayed deep deterministic policy gradient for low-frequency oscillation damping control

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this