Reinforcement-Learning-Based Tracking Control of Waste Water Treatment Process under Realistic System Conditions and Control Performance Requirements

Qinmin Yang; Weiwei Cao; Wenchao Meng; Jennie Si

doi:10.1109/TSMC.2021.3122802

Reinforcement-Learning-Based Tracking Control of Waste Water Treatment Process under Realistic System Conditions and Control Performance Requirements

Qinmin Yang, Weiwei Cao, Wenchao Meng, Jennie Si

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Contribution to journal › Article › peer-review

37 Scopus citations

Abstract

The tracking control of a wastewater treatment process (WWTP) is considered. The process is highly nonlinear, with strong coupling, difficult to model mathematically, and the operation is subject to unknown disturbances. We address this multivariable tracking control problem by applying the direct heuristic dynamic programming (dHDP)-based reinforcement learning control. The control goal is to track a desired reference of the dissolved oxygen (DO) concentration of the 5th aerobic zone (SO5) and nitrate concentration of the 2nd anoxic zone (SNO2) by manipulating the oxygen transfer coefficient of the 5th aerobic zone (KLa5) and internal recycle flow rate (Qa). The dHDP aims at achieving a minimal accumulated WWTP tracking error while dealing with strong coupling between the SO5 and SNO2 and eliminating unknown disturbances in the process. The proposed dHDP approach devises an optimal control strategy entirely driven by WWTP process data as an online learning control method.We have conducted extensive and systematic simulations based on the well-known BSM1 platform of the WWTP controlled by dHDP to compare and contrast performances with other methods.

Original language	English (US)
Pages (from-to)	5284-5294
Number of pages	11
Journal	IEEE Transactions on Systems, Man, and Cybernetics: Systems
Volume	52
Issue number	8
DOIs	https://doi.org/10.1109/TSMC.2021.3122802
State	Published - Aug 1 2022

Keywords

Action strategy approximation
cost function estimation
direct heuristic dynamic programming (direct HDP or dHDP)
online learning
tracking control
wastewater treatment process (WWTP)

ASJC Scopus subject areas

Software
Control and Systems Engineering
Human-Computer Interaction
Computer Science Applications
Electrical and Electronic Engineering

Access to Document

10.1109/TSMC.2021.3122802

Cite this

Reinforcement-Learning-Based Tracking Control of Waste Water Treatment Process under Realistic System Conditions and Control Performance Requirements. / Yang, Qinmin; Cao, Weiwei; Meng, Wenchao et al.
In: IEEE Transactions on Systems, Man, and Cybernetics: Systems, Vol. 52, No. 8, 01.08.2022, p. 5284-5294.

Research output: Contribution to journal › Article › peer-review

@article{03db0d67d1894eba9670df85d3258847,

title = "Reinforcement-Learning-Based Tracking Control of Waste Water Treatment Process under Realistic System Conditions and Control Performance Requirements",

abstract = "The tracking control of a wastewater treatment process (WWTP) is considered. The process is highly nonlinear, with strong coupling, difficult to model mathematically, and the operation is subject to unknown disturbances. We address this multivariable tracking control problem by applying the direct heuristic dynamic programming (dHDP)-based reinforcement learning control. The control goal is to track a desired reference of the dissolved oxygen (DO) concentration of the 5th aerobic zone (SO5) and nitrate concentration of the 2nd anoxic zone (SNO2) by manipulating the oxygen transfer coefficient of the 5th aerobic zone (KLa5) and internal recycle flow rate (Qa). The dHDP aims at achieving a minimal accumulated WWTP tracking error while dealing with strong coupling between the SO5 and SNO2 and eliminating unknown disturbances in the process. The proposed dHDP approach devises an optimal control strategy entirely driven by WWTP process data as an online learning control method.We have conducted extensive and systematic simulations based on the well-known BSM1 platform of the WWTP controlled by dHDP to compare and contrast performances with other methods.",

keywords = "Action strategy approximation, cost function estimation, direct heuristic dynamic programming (direct HDP or dHDP), online learning, tracking control, wastewater treatment process (WWTP)",

author = "Qinmin Yang and Weiwei Cao and Wenchao Meng and Jennie Si",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.",

year = "2022",

month = aug,

day = "1",

doi = "10.1109/TSMC.2021.3122802",

language = "English (US)",

volume = "52",

pages = "5284--5294",

journal = "IEEE Transactions on Systems, Man, and Cybernetics: Systems",

issn = "2168-2216",

publisher = "IEEE Advancing Technology for Humanity",

number = "8",

}

TY - JOUR

T1 - Reinforcement-Learning-Based Tracking Control of Waste Water Treatment Process under Realistic System Conditions and Control Performance Requirements

AU - Yang, Qinmin

AU - Cao, Weiwei

AU - Meng, Wenchao

AU - Si, Jennie

PY - 2022/8/1

Y1 - 2022/8/1

N2 - The tracking control of a wastewater treatment process (WWTP) is considered. The process is highly nonlinear, with strong coupling, difficult to model mathematically, and the operation is subject to unknown disturbances. We address this multivariable tracking control problem by applying the direct heuristic dynamic programming (dHDP)-based reinforcement learning control. The control goal is to track a desired reference of the dissolved oxygen (DO) concentration of the 5th aerobic zone (SO5) and nitrate concentration of the 2nd anoxic zone (SNO2) by manipulating the oxygen transfer coefficient of the 5th aerobic zone (KLa5) and internal recycle flow rate (Qa). The dHDP aims at achieving a minimal accumulated WWTP tracking error while dealing with strong coupling between the SO5 and SNO2 and eliminating unknown disturbances in the process. The proposed dHDP approach devises an optimal control strategy entirely driven by WWTP process data as an online learning control method.We have conducted extensive and systematic simulations based on the well-known BSM1 platform of the WWTP controlled by dHDP to compare and contrast performances with other methods.

AB - The tracking control of a wastewater treatment process (WWTP) is considered. The process is highly nonlinear, with strong coupling, difficult to model mathematically, and the operation is subject to unknown disturbances. We address this multivariable tracking control problem by applying the direct heuristic dynamic programming (dHDP)-based reinforcement learning control. The control goal is to track a desired reference of the dissolved oxygen (DO) concentration of the 5th aerobic zone (SO5) and nitrate concentration of the 2nd anoxic zone (SNO2) by manipulating the oxygen transfer coefficient of the 5th aerobic zone (KLa5) and internal recycle flow rate (Qa). The dHDP aims at achieving a minimal accumulated WWTP tracking error while dealing with strong coupling between the SO5 and SNO2 and eliminating unknown disturbances in the process. The proposed dHDP approach devises an optimal control strategy entirely driven by WWTP process data as an online learning control method.We have conducted extensive and systematic simulations based on the well-known BSM1 platform of the WWTP controlled by dHDP to compare and contrast performances with other methods.

KW - Action strategy approximation

KW - cost function estimation

KW - direct heuristic dynamic programming (direct HDP or dHDP)

KW - online learning

KW - tracking control

KW - wastewater treatment process (WWTP)

UR - http://www.scopus.com/inward/record.url?scp=85132903841&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85132903841&partnerID=8YFLogxK

U2 - 10.1109/TSMC.2021.3122802

DO - 10.1109/TSMC.2021.3122802

M3 - Article

AN - SCOPUS:85132903841

SN - 2168-2216

VL - 52

SP - 5284

EP - 5294

JO - IEEE Transactions on Systems, Man, and Cybernetics: Systems

JF - IEEE Transactions on Systems, Man, and Cybernetics: Systems

IS - 8

ER -

Reinforcement-Learning-Based Tracking Control of Waste Water Treatment Process under Realistic System Conditions and Control Performance Requirements

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this