TY - JOUR
T1 - Toward Expedited Impedance Tuning of a Robotic Prosthesis for Personalized Gait Assistance by Reinforcement Learning Control
AU - Li, Minhan
AU - Wen, Yue
AU - Gao, Xiang
AU - Si, Jennie
AU - Huang, He
N1 - Funding Information:
This work was supported in part by the National Science Foundation under Grant 1563454, Grant 1563921, Grant 1808752, and Grant 1808898.
Publisher Copyright:
© 2004-2012 IEEE.
PY - 2022/2/1
Y1 - 2022/2/1
N2 - Personalizing medical devices such as lower limb wearable robots is challenging. While the initial feasibility of automating the process of knee prosthesis control parameter tuning has been demonstrated in a principled way, the next critical issue is to improve tuning efficiency and speed it up for the human user, in clinic settings, while maintaining human safety. We, therefore, propose a policy iteration with constraint embedded (PICE) method as an innovative solution to the problem under the framework of reinforcement learning. Central to PICE is the use of a projected Bellman equation with a constraint of assuring positive semidefiniteness of performance values during policy evaluation. Additionally, we developed both online and offline PICE implementations that provide additional flexibility for the designer to fully utilize measurement data, either from on-policy or off-policy, to further improve PICE tuning efficiency. Our human subject testing showed that the PICE provided effective policies with significantly reduced tuning time. For the first time, we also experimentally evaluated and demonstrated the robustness of the deployed policies by applying them to different tasks and users. Putting it together, our new way of problem solving has been effective as PICE has demonstrated its potential toward truly automating the process of control parameter tuning for robotic knee prosthesis users.
AB - Personalizing medical devices such as lower limb wearable robots is challenging. While the initial feasibility of automating the process of knee prosthesis control parameter tuning has been demonstrated in a principled way, the next critical issue is to improve tuning efficiency and speed it up for the human user, in clinic settings, while maintaining human safety. We, therefore, propose a policy iteration with constraint embedded (PICE) method as an innovative solution to the problem under the framework of reinforcement learning. Central to PICE is the use of a projected Bellman equation with a constraint of assuring positive semidefiniteness of performance values during policy evaluation. Additionally, we developed both online and offline PICE implementations that provide additional flexibility for the designer to fully utilize measurement data, either from on-policy or off-policy, to further improve PICE tuning efficiency. Our human subject testing showed that the PICE provided effective policies with significantly reduced tuning time. For the first time, we also experimentally evaluated and demonstrated the robustness of the deployed policies by applying them to different tasks and users. Putting it together, our new way of problem solving has been effective as PICE has demonstrated its potential toward truly automating the process of control parameter tuning for robotic knee prosthesis users.
KW - Impedance control
KW - knee prosthesis
KW - policy iteration
KW - rehabilitation robotics
KW - reinforcement learning (RL)
UR - http://www.scopus.com/inward/record.url?scp=85107219950&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85107219950&partnerID=8YFLogxK
U2 - 10.1109/TRO.2021.3078317
DO - 10.1109/TRO.2021.3078317
M3 - Article
AN - SCOPUS:85107219950
SN - 1552-3098
VL - 38
SP - 407
EP - 420
JO - IEEE Transactions on Robotics
JF - IEEE Transactions on Robotics
IS - 1
ER -