Optimal tracking control for robot manipulators with input constraint based reinforcement learning

Nguyen Duc Dien; Nguyen Tan Luy; Lai Khac Lai; Tran Thanh Hai

doi:10.15625/1813-9663/18099

Author affiliations

Authors

Nguyen Duc Dien University of Economics - Technology for Industry, 456 Minh Khai Street, Hai Ba Trung District, Ha Noi, Viet Nam
Nguyen Tan Luy Ho Chi Minh City University of Technology–VNU, 268 Ly Thuong Kiet Street, Ward 14, District 10, Ho Chi Minh City, Viet Nam
Lai Khac Lai Thai Nguyen University of Technology, 666 Street 3/2, Thai Nguyen City, Thai Nguyen Province, Viet Nam
Tran Thanh Hai Industrial University of Ho Chi Minh City, 12 Nguyen Van Bao Street, Ward 4, Go Vap District, Ho Chi Minh City, Viet Nam

DOI:

https://doi.org/10.15625/1813-9663/18099

Keywords:

Reinforcement learning, saturation torques, saturated optimal tracking control, robot.

Abstract

This paper introduces an optimal tracking controller for robot manipulators with saturation torques. The robot model is presented as a strict-feedback nonlinear system. Firstly, the position tracking control problem is transformed into the optimal tracking control problem. Subsequently, the saturated optimal control law is designed. The optimal control law is determined through the solution of the Hamilton-Jacobi-Bellman (HJB) equation. We use a reinforcement learning algorithm with only one neural network (NN) to approximate the solution of the equation HJB. The technique of experience replay is used to relax a persistent citation condition. By Lyapunov analysis, the tracking and the approximation errors are uniformly ultimately bounded (UUB). Finally, the simulation on a robot manipulator with saturation torques is performed to verify the efficiency of the proposed controller.

Metrics

PDF views

217

References

J. Baek, W. Kwon, and C. Kang, “A new widely and stably adaptive sliding-mode control with nonsingular terminal sliding variable for robot manipulators,” IEEE Access, vol. 8, pp. 43 443– 43 454, 2020. DOI: https://doi.org/10.1109/ACCESS.2020.2977434

D. Chwa and H. Kwon, “Nonlinear robust control of unknown robot manipulator systems with actuators and disturbances using system identification and integral sliding mode disturbance observer,” IEEE Access, vol. 10, pp. 35 410–35 421, 2022. DOI: https://doi.org/10.1109/ACCESS.2022.3163306

W. He, Y. Dong, and C. Sun, “Adaptive neural impedance control of a robotic manipulator with input saturation,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 46, no. 3, pp. 334–344, 2015. DOI: https://doi.org/10.1109/TSMC.2015.2429555

W. Jie, L. M. Cheol, K. Jaehyung, and K. H. Hee, “Fast fractional-order terminal sliding mode control with rbfnn based sliding perturbation observer for 7-dof robot manipulator,” IEEE Access, vol. 9, pp. 67 117–67 128, 2021. DOI: https://doi.org/10.1109/ACCESS.2021.3075697

W. Jie, Z. Yudong, B. Yulong, H. H. Kim, and M. C. Lee, “Trajectory tracking control using fractional-order terminal sliding mode control with sliding perturbation observer for a 7-dof robot manipulator,” IEEE/ASME Transactions on Mechatronics, vol. 25, no. 4, pp. 1886–1893, DOI: https://doi.org/10.1109/TMECH.2020.2992676

R. Kamalapurkar, H. Dinh, S. Bhasin, and W. E. Dixon, “Approximate optimal trajectory tracking for continuous-time nonlinear systems,” Automatica, vol. 51, pp. 40–48, 2015. DOI: https://doi.org/10.1016/j.automatica.2014.10.103

L. Kong, W. He, C. Yang, and C. Sun, “Robust neurooptimal control for a robot via adaptive dynamic programming,” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 6, pp. 2584–2594, 2020. DOI: https://doi.org/10.1109/TNNLS.2020.3006850

F. L. Lewis, D. M. Dawson, and C. T. Abdallah, Robot manipulator control: theory and practice. CRC Press, 2003. DOI: https://doi.org/10.1201/9780203026953

F. L. Lewis, K. Liu, and A. Yesildirek, “Neural net robot controller with guaranteed tracking performance,” IEEE Transactions on Neural Networks, vol. 6, no. 3, pp. 703–715, 1995. DOI: https://doi.org/10.1109/72.377975

S. Ling, H. Wang, and P. X. Liu, “Adaptive fuzzy dynamic surface control of flexible-joint robot systems with input saturation,” IEEE/CAA Journal of Automatica Sinica, vol. 6, no. 1, pp. 97–107, 2019.

N. T. Luy, “Reinforecement learning-based optimal tracking control for wheeled mobile robot,” in 2012 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER). IEEE, 2012, pp. 371–376.

L. F. Modares H and J. ZP, “h∞ tracking control of completely unknown continuous-time systems via off-policy reinforcement learning,” IEEE Trans Neural Netw Learn Syst, vol. 26, no. 10, pp. 2550–62, 2015.

V.-C. Nguyen, A.-T. Vo, and H.-J. Kang, “A non-singular fast terminal sliding mode control based on third-order sliding mode observer for a class of second-order uncertain nonlinear systems and its application to robot manipulators,” IEEE Access, vol. 8, pp. 78 109–78 120, 2020.

L. Nguyen Tan, “Distributed optimal control for nonholonomic systems with input constraints and uncertain interconnections,” Nonlinear Dynamics, vol. 93, no. 2, pp. 801–817, 2018.

L. N. Tan, “Distributed h∞ optimal tracking control for strict-feedback nonlinear large-scale systems with disturbances and saturating actuators,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 50, no. 11, pp. 4719–4731, 2018.

L. N. Tan and T. C. Pham, “Optimal tracking control for pmsm with partially unknown dynamics, saturation voltages, torque, and voltage disturbances,” IEEE Transactions on Industrial Electronics, vol. 69, no. 4, pp. 3481–3491, 2021.

T. N. Truong, A. T. Vo, and H.-J. Kang, “A backstepping global fast terminal sliding mode control for trajectory tracking control of industrial robotic manipulators,” IEEE Access, vol. 9, pp. 31 921–31 931, 2021.

K. G. Vamvoudakis and F. L. Lewis, “Online actor–critic algorithm to solve the continuous-time infinite horizon optimal control problem,” Automatica, vol. 46, no. 5, pp. 878–888, 2010.

K. G. Vamvoudakis, M. F. Miranda, and J. P. Hespanha, “Asymptotically stable adaptive– optimal control algorithm with saturating actuators and relaxed persistence of excitation,” IEEE transactions on neural networks and learning systems, vol. 27, no. 11, pp. 2386–2398, 2015.

M. Van, M. Mavrovouniotis, and S. S. Ge, “An adaptive backstepping nonsingular fast terminal sliding mode control for robust fault tolerant control of robot manipulators,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 49, no. 7, pp. 1448–1458, 2018.

C. Yang, D. Huang, W. He, and L. Cheng, “Neural control of robot manipulators with trajectory tracking constraints and input saturation,” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 9, pp. 4231–4242, 2020.

H. Zargarzadeh, T. Dierks, and S. Jagannathan, “Adaptive neural network-based optimal control of nonlinear continuous-time systems in strict-feedback form,” International Journal of Adaptive Control and Signal Processing, vol. 28, no. 3-5, pp. 305–324, 2014.

S. Zeghloul, M. A. Laribi, and J.-P. Gazeau, “Robotics and mechatronics,” in Proceedings of the 4th IFToMM International Symposium on Robotics and Mechatronics. Springer, 2015.

L. Zhang, Y. Wang, Y. Hou, and H. Li, “Fixed-time sliding mode control for uncertain robot manipulators,” IEEE Access, vol. 7, pp. 149 750–149 763, 2019.

S. Ling, H. Wang, and P. X. Liu, “Adaptive fuzzy dynamic surface control of flexible-joint robot systems with input saturation,” IEEE/CAA Journal of Automatica Sinica, vol. 6, no. 1, pp. 97–107, 2019. DOI: https://doi.org/10.1109/JAS.2019.1911330

N. T. Luy, “Reinforecement learning-based optimal tracking control for wheeled mobile robot,” in 2012 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER). IEEE, 2012, pp. 371–376. DOI: https://doi.org/10.1109/CYBER.2012.6392582

L. F. Modares H and J. ZP, “h∞ tracking control of completely unknown continuous-time systems via off-policy reinforcement learning,” IEEE Trans Neural Netw Learn Syst, vol. 26, no. 10, pp. 2550–62, 2015. DOI: https://doi.org/10.1109/TNNLS.2015.2441749

V.-C. Nguyen, A.-T. Vo, and H.-J. Kang, “A non-singular fast terminal sliding mode control based on third-order sliding mode observer for a class of second-order uncertain nonlinear systems and its application to robot manipulators,” IEEE Access, vol. 8, pp. 78 109–78 120, 2020. DOI: https://doi.org/10.1109/ACCESS.2020.2989613

L. Nguyen Tan, “Distributed optimal control for nonholonomic systems with input constraints and uncertain interconnections,” Nonlinear Dynamics, vol. 93, no. 2, pp. 801–817, 2018. DOI: https://doi.org/10.1007/s11071-018-4228-8

L. N. Tan, “Distributed h∞ optimal tracking control for strict-feedback nonlinear large-scale systems with disturbances and saturating actuators,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 50, no. 11, pp. 4719–4731, 2018. DOI: https://doi.org/10.1109/TSMC.2018.2861470

L. N. Tan and T. C. Pham, “Optimal tracking control for pmsm with partially unknown dynamics, saturation voltages, torque, and voltage disturbances,” IEEE Transactions on Industrial Electronics, vol. 69, no. 4, pp. 3481–3491, 2021. DOI: https://doi.org/10.1109/TIE.2021.3075892

T. N. Truong, A. T. Vo, and H.-J. Kang, “A backstepping global fast terminal sliding mode control for trajectory tracking control of industrial robotic manipulators,” IEEE Access, vol. 9, pp. 31 921–31 931, 2021. DOI: https://doi.org/10.1109/ACCESS.2021.3060115

K. G. Vamvoudakis and F. L. Lewis, “Online actor–critic algorithm to solve the continuous-time infinite horizon optimal control problem,” Automatica, vol. 46, no. 5, pp. 878–888, 2010. DOI: https://doi.org/10.1016/j.automatica.2010.02.018

K. G. Vamvoudakis, M. F. Miranda, and J. P. Hespanha, “Asymptotically stable adaptive– optimal control algorithm with saturating actuators and relaxed persistence of excitation,” IEEE transactions on neural networks and learning systems, vol. 27, no. 11, pp. 2386–2398, 2015. DOI: https://doi.org/10.1109/TNNLS.2015.2487972

M. Van, M. Mavrovouniotis, and S. S. Ge, “An adaptive backstepping nonsingular fast terminal sliding mode control for robust fault tolerant control of robot manipulators,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 49, no. 7, pp. 1448–1458, 2018. DOI: https://doi.org/10.1109/TSMC.2017.2782246

C. Yang, D. Huang, W. He, and L. Cheng, “Neural control of robot manipulators with trajectory tracking constraints and input saturation,” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 9, pp. 4231–4242, 2020. DOI: https://doi.org/10.1109/TNNLS.2020.3017202

H. Zargarzadeh, T. Dierks, and S. Jagannathan, “Adaptive neural network-based optimal control of nonlinear continuous-time systems in strict-feedback form,” International Journal of Adaptive Control and Signal Processing, vol. 28, no. 3-5, pp. 305–324, 2014. DOI: https://doi.org/10.1002/acs.2432

S. Zeghloul, M. A. Laribi, and J.-P. Gazeau, “Robotics and mechatronics,” in Proceedings of the 4th IFToMM International Symposium on Robotics and Mechatronics. Springer, 2015. DOI: https://doi.org/10.1007/978-3-319-22368-1

L. Zhang, Y. Wang, Y. Hou, and H. Li, “Fixed-time sliding mode control for uncertain robot manipulators,” IEEE Access, vol. 7, pp. 149 750–149 763, 2019. DOI: https://doi.org/10.1109/ACCESS.2019.2946866

X. Zhao, B. Tao, L. Qian, and H. Ding, “Model-based actor-critic learning for optimal tracking control of robots with input saturation,” IEEE Transactions on Industrial Electronics, vol. 68, no. 6, pp. 5046–5056, 2020. DOI: https://doi.org/10.1109/TIE.2020.2992003

J. Zhou, E. Liu, X. Tian, and Z. Li, “Adaptive fuzzy backstepping control based on dynamic surface control for uncertain robotic manipulator,” IEEE Access, vol. 10, pp. 23 333–23 341, 2022. DOI: https://doi.org/10.1109/ACCESS.2022.3154779

Q. Zhou, S. Zhao, H. Li, R. Lu, and C. Wu, “Adaptive neural network tracking control for robotic manipulators with dead zone,” IEEE Transactions on Neural Networks and Learning Systems, vol. 30, no. 12, pp. 3611–3620, 2018. DOI: https://doi.org/10.1109/TNNLS.2018.2869375