2024
|
Sun, Haining; Tang, Xiaoqiang; Ge, Shuzhi Sam Compliant Tracking Control and Force Redistribution for a Portable Cable-Driven Robot IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, DOI: 10.1109/TMECH.2024.3457890. Abstract | BibTeX | Endnote @article{ISI:001329005900001,
title = {Compliant Tracking Control and Force Redistribution for a Portable Cable-Driven Robot},
author = {Haining Sun and Xiaoqiang Tang and Shuzhi Sam Ge},
doi = {10.1109/TMECH.2024.3457890},
times_cited = {0},
issn = {1083-4435},
year = {2024},
date = {2024-10-01},
journal = {IEEE-ASME TRANSACTIONS ON MECHATRONICS},
publisher = {IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC},
address = {445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA},
abstract = {This study introduces the development of a portable cable-driven robot with a compliant trajectory tracking method that incorporates real-time and continuous positive force redistribution. Based on the dynamic model of the robot, a rapid-convergence tracking controller exhibiting variable compliance to external disturbances is designed to ensure stable trajectory tracking. A real-time, continuous positive force redistribution process is seamlessly integrated into the tracking controller to ensure that cable forces remain positive and continuous throughout the motion. Low compliance allows the robot to resist external disturbances and maintain accurate trajectory tracking. Conversely, high compliance permits the robot to temporarily sacrifice some tracking accuracy for safety and then resume tracking once the disturbance subsides. Lyapunov stability analysis is utilized to validate the stability of the control system. Experiments are conducted on the designed robot to evaluate the practical applicability of the control method. Results demonstrate the feasibility of the force redistribution without compromising trajectory tracking performance.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
This study introduces the development of a portable cable-driven robot with a compliant trajectory tracking method that incorporates real-time and continuous positive force redistribution. Based on the dynamic model of the robot, a rapid-convergence tracking controller exhibiting variable compliance to external disturbances is designed to ensure stable trajectory tracking. A real-time, continuous positive force redistribution process is seamlessly integrated into the tracking controller to ensure that cable forces remain positive and continuous throughout the motion. Low compliance allows the robot to resist external disturbances and maintain accurate trajectory tracking. Conversely, high compliance permits the robot to temporarily sacrifice some tracking accuracy for safety and then resume tracking once the disturbance subsides. Lyapunov stability analysis is utilized to validate the stability of the control system. Experiments are conducted on the designed robot to evaluate the practical applicability of the control method. Results demonstrate the feasibility of the force redistribution without compromising trajectory tracking performance. - FNClarivate Analytics Web of Science
- VR1.0
- PTJ
- AUSun, HN
Tang, XQ
Ge, SS
- AFHaining Sun
Xiaoqiang Tang
Shuzhi Sam Ge
- TICompliant Tracking Control and Force Redistribution for a Portable Cable-Driven Robot
- SOIEEE-ASME TRANSACTIONS ON MECHATRONICS
- LAEnglish
- DTArticle
- DERobots; Force; Power Cables; Trajectory Tracking; Accuracy; Vectors; Trajectory; Robot Kinematics; Termination Of Employment; Sun; Cable-driven Robot; Force Redistribution; Motion Control
- IDPARALLEL ROBOTS; REDUNDANCY RESOLUTION; INTERNAL FORCE; MANIPULATORS
- ABThis study introduces the development of a portable cable-driven robot with a compliant trajectory tracking method that incorporates real-time and continuous positive force redistribution. Based on the dynamic model of the robot, a rapid-convergence tracking controller exhibiting variable compliance to external disturbances is designed to ensure stable trajectory tracking. A real-time, continuous positive force redistribution process is seamlessly integrated into the tracking controller to ensure that cable forces remain positive and continuous throughout the motion. Low compliance allows the robot to resist external disturbances and maintain accurate trajectory tracking. Conversely, high compliance permits the robot to temporarily sacrifice some tracking accuracy for safety and then resume tracking once the disturbance subsides. Lyapunov stability analysis is utilized to validate the stability of the control system. Experiments are conducted on the designed robot to evaluate the practical applicability of the control method. Results demonstrate the feasibility of the force redistribution without compromising trajectory tracking performance.
- C1[Sun, Haining; Tang, Xiaoqiang] Tsinghua Univ, Dept Mech Engn, State Key Lab Tribol, Beijing 100084, Peoples R China.
[Sun, Haining] Agcy Sci Technol & Res, Singapore Inst Mfg Technol, Singapore 138634, Singapore. [Ge, Shuzhi Sam] Natl Univ Singapore, Inst Funct Intelligent Mat, Dept Elect & Comp Engn, Singapore 117583, Singapore - C3Tsinghua University; Agency for Science Technology & Research (A*STAR); A*STAR - Singapore Institute of Manufacturing Technology (SIMTech); Institute for Functional Intelligent Materials (I-FIM); National University of Singapore
- RPTang, XQ (corresponding author), Tsinghua Univ, Dept Mech Engn, State Key Lab Tribol, Beijing 100084, Peoples R China; Ge, SS (corresponding author), Natl Univ Singapore, Inst Funct Intelligent Mat, Dept Elect & Comp Engn, Singapore 117583, Singapore
- FUNational Natural Science Foundation of China [51975307]; Ministry of Education, Singapore, under its Research Centre of Excellence [EDUNC-33-18-279-V12]
- FXThis work was supported in part by the National Natural Science Foundation of China under Grant 51975307 and in part by the Ministry of Education, Singapore, under its Research Centre of Excellence award to the Institute for Functional Intelligent Materials (I-FIM, under Grant EDUNC-33-18-279-V12).
- NR36
- TC0
- Z90
- U10
- U20
- PUIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- PIPISCATAWAY
- PA445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA
- SN1083-4435
- J9IEEE-ASME TRANS MECHATRON
- JIIEEE-ASME Trans. Mechatron.
- PDOCT 1
- PY2024
- DI10.1109/TMECH.2024.3457890
- PG12
- WCAutomation & Control Systems; Engineering, Manufacturing; Engineering, Electrical & Electronic; Engineering, Mechanical
- SCAutomation & Control Systems; Engineering
- GAI3A1F
- UTWOS:001329005900001
- ER
- EF
|
Ma, Jie; Han, Zhiji; Li, Mingge; Liu, Zhijie; He, Wei; Ge, Shuzhi Sam Conductive hydrogels-based self-sensing soft robot state perception and trajectory tracking JOURNAL OF FIELD ROBOTICS, 2024, DOI: 10.1002/rob.22420. Abstract | BibTeX | Endnote @article{ISI:001298976600001,
title = {Conductive hydrogels-based self-sensing soft robot state perception and trajectory tracking},
author = {Jie Ma and Zhiji Han and Mingge Li and Zhijie Liu and Wei He and Shuzhi Sam Ge},
doi = {10.1002/rob.22420},
times_cited = {0},
issn = {1556-4959},
year = {2024},
date = {2024-08-28},
journal = {JOURNAL OF FIELD ROBOTICS},
publisher = {WILEY},
address = {111 RIVER ST, HOBOKEN 07030-5774, NJ USA},
abstract = {Soft robots face significant challenges in proprioceptive sensing and precise control due to their highly deformable and compliant nature. This paper addresses these challenges by developing a conductive hydrogel sensor and integrating it into a soft robot for bending angle measurement and motion control. A quantitative mapping between the hydrogel resistance and the robot's bending gesture is formulated. Furthermore, a nonlinear differentiator is proposed to estimate the angular velocity for closed-loop control, eliminating the reliance on conventional sensors. Meanwhile, a controller is designed to track both structural and nonstructural trajectories. The proposed approach integrates advanced soft sensing materials and intelligent control algorithms, significantly improving the proprioception and motion accuracy of soft robots. This work bridges the gap between novel material design and practical control applications, opening up new possibilities for soft robots to perform delicate tasks in various fields. The experimental results demonstrate the effectiveness of the proposed sensing and control approach in achieving precise and robust motion control of the soft robot.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
Soft robots face significant challenges in proprioceptive sensing and precise control due to their highly deformable and compliant nature. This paper addresses these challenges by developing a conductive hydrogel sensor and integrating it into a soft robot for bending angle measurement and motion control. A quantitative mapping between the hydrogel resistance and the robot's bending gesture is formulated. Furthermore, a nonlinear differentiator is proposed to estimate the angular velocity for closed-loop control, eliminating the reliance on conventional sensors. Meanwhile, a controller is designed to track both structural and nonstructural trajectories. The proposed approach integrates advanced soft sensing materials and intelligent control algorithms, significantly improving the proprioception and motion accuracy of soft robots. This work bridges the gap between novel material design and practical control applications, opening up new possibilities for soft robots to perform delicate tasks in various fields. The experimental results demonstrate the effectiveness of the proposed sensing and control approach in achieving precise and robust motion control of the soft robot. - FNClarivate Analytics Web of Science
- VR1.0
- PTJ
- AUMa, J
Han, ZJ
Li, MG
Liu, ZJ
He, W
Ge, SS
- AFJie Ma
Zhiji Han
Mingge Li
Zhijie Liu
Wei He
Shuzhi Sam Ge
- TIConductive hydrogels-based self-sensing soft robot state perception and trajectory tracking
- SOJOURNAL OF FIELD ROBOTICS
- LAEnglish
- DTArticle
- DERobotics; Sensing
- ABSoft robots face significant challenges in proprioceptive sensing and precise control due to their highly deformable and compliant nature. This paper addresses these challenges by developing a conductive hydrogel sensor and integrating it into a soft robot for bending angle measurement and motion control. A quantitative mapping between the hydrogel resistance and the robot's bending gesture is formulated. Furthermore, a nonlinear differentiator is proposed to estimate the angular velocity for closed-loop control, eliminating the reliance on conventional sensors. Meanwhile, a controller is designed to track both structural and nonstructural trajectories. The proposed approach integrates advanced soft sensing materials and intelligent control algorithms, significantly improving the proprioception and motion accuracy of soft robots. This work bridges the gap between novel material design and practical control applications, opening up new possibilities for soft robots to perform delicate tasks in various fields. The experimental results demonstrate the effectiveness of the proposed sensing and control approach in achieving precise and robust motion control of the soft robot.
- C3University of Science & Technology Beijing; University of Science & Technology Beijing; University of Science & Technology Beijing; National University of Singapore; National University of Singapore; Institute for Functional Intelligent Materials (I-FIM)
- RPLiu, ZJ (corresponding author), Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing 100083, Peoples R China
- FXNational Key Research and Development Program of China; National Natural Science Foundation of China, Grant/Award Numbers:62103039, 62073030; Joint Fund of Ministry of Education for Equipment Pre-Research; Ministry of Education, Singapore, Grant/Award Number: EDUNC-33-18-279-V12
- NR56
- TC0
- Z90
- U16
- U26
- PUWILEY
- PIHOBOKEN
- PA111 RIVER ST, HOBOKEN 07030-5774, NJ USA
- SN1556-4959
- J9J FIELD ROBOT
- JIJ. Field Robot.
- PDAUG 28
- PY2024
- DI10.1002/rob.22420
- PG15
- WCRobotics
- SCRobotics
- GAD8Y2Q
- UTWOS:001298976600001
- ER
- EF
|
Sun, Haining; Tang, Xiaoqiang; Ge, Shuzhi Sam Controllable Trade-Off Between Performance and Constrained Input for Vibration Suppression in Flexible Space Structures IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 60 (3), pp. 3390-3402, 2024, DOI: 10.1109/TAES.2024.3361434. Abstract | BibTeX | Endnote @article{ISI:001246582400007,
title = {Controllable Trade-Off Between Performance and Constrained Input for Vibration Suppression in Flexible Space Structures},
author = {Haining Sun and Xiaoqiang Tang and Shuzhi Sam Ge},
doi = {10.1109/TAES.2024.3361434},
times_cited = {0},
issn = {0018-9251},
year = {2024},
date = {2024-06-01},
journal = {IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS},
volume = {60},
number = {3},
pages = {3390-3402},
publisher = {IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC},
address = {445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA},
abstract = {Achieving vibration suppression of flexible space structures requires not only considering the vibration settling time, but also avoiding excessive control forces that might damage the structure. In this article, an active control scheme is designed to suppress undesired vibrations in flexible structures by exerting small cable forces. It employs a cable-driven parallel robot (CDPR) as an actuator. The prominent feature of the controller lies in its ability to achieve a controllable trade-off between the effectiveness of vibration suppression and control inputs. Moreover, the controller can effectively suppress vibrations even when cable forces are constrained. The underlying principle is that the vibration energy is consumed through the negative work done by the cable forces until the tip displacement is ultimately reduced to a small range. The stability of the controller is verified by the Lyapunov method. Simulations demonstrate the effectiveness of the proposed control scheme, while experimental validation on a prototype consisting of a six-meter-long flexible structure and a four-cable CDPR further supports these findings.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
Achieving vibration suppression of flexible space structures requires not only considering the vibration settling time, but also avoiding excessive control forces that might damage the structure. In this article, an active control scheme is designed to suppress undesired vibrations in flexible structures by exerting small cable forces. It employs a cable-driven parallel robot (CDPR) as an actuator. The prominent feature of the controller lies in its ability to achieve a controllable trade-off between the effectiveness of vibration suppression and control inputs. Moreover, the controller can effectively suppress vibrations even when cable forces are constrained. The underlying principle is that the vibration energy is consumed through the negative work done by the cable forces until the tip displacement is ultimately reduced to a small range. The stability of the controller is verified by the Lyapunov method. Simulations demonstrate the effectiveness of the proposed control scheme, while experimental validation on a prototype consisting of a six-meter-long flexible structure and a four-cable CDPR further supports these findings. - FNClarivate Analytics Web of Science
- VR1.0
- PTJ
- AUSun, HN
Tang, XQ
Ge, SS
- AFHaining Sun
Xiaoqiang Tang
Shuzhi Sam Ge
- TIControllable Trade-Off Between Performance and Constrained Input for Vibration Suppression in Flexible Space Structures
- SOIEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS
- LAEnglish
- DTArticle
- DEVibrations; Flexible Structures; Force; Aerospace Electronics; Vibration Control; Damping; Finite Element Analysis
- IDTRACKING; ATTITUDE; ROBOTS; BEAM
- ABAchieving vibration suppression of flexible space structures requires not only considering the vibration settling time, but also avoiding excessive control forces that might damage the structure. In this article, an active control scheme is designed to suppress undesired vibrations in flexible structures by exerting small cable forces. It employs a cable-driven parallel robot (CDPR) as an actuator. The prominent feature of the controller lies in its ability to achieve a controllable trade-off between the effectiveness of vibration suppression and control inputs. Moreover, the controller can effectively suppress vibrations even when cable forces are constrained. The underlying principle is that the vibration energy is consumed through the negative work done by the cable forces until the tip displacement is ultimately reduced to a small range. The stability of the controller is verified by the Lyapunov method. Simulations demonstrate the effectiveness of the proposed control scheme, while experimental validation on a prototype consisting of a six-meter-long flexible structure and a four-cable CDPR further supports these findings.
- C3Agency for Science Technology & Research (A*STAR); A*STAR - Singapore Institute of Manufacturing Technology (SIMTech); Tsinghua University; National University of Singapore; Institute for Functional Intelligent Materials (I-FIM)
- RPTang, XQ (corresponding author), Tsinghua Univ, Dept Mech Engn, State Key Lab Tribol, Beijing 100084, Peoples R China
- FXNo Statement Available
- NR52
- TC0
- Z90
- U18
- U28
- PUIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- PIPISCATAWAY
- PA445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA
- SN0018-9251
- J9IEEE TRANS AEROSP ELECTRON SY
- JIIEEE Trans. Aerosp. Electron. Syst.
- PDJUN
- PY2024
- VL60
- BP3390
- EP3402
- DI10.1109/TAES.2024.3361434
- PG13
- WCEngineering, Aerospace; Engineering, Electrical & Electronic; Telecommunications
- SCEngineering; Telecommunications
- GAUF2O6
- UTWOS:001246582400007
- ER
- EF
|
Wang, Hao; Sun, Bin; Ge, Shuzhi Sam; Su, Jie; Jin, Ming Liang On non-von Neumann flexible neuromorphic vision sensors NPJ FLEXIBLE ELECTRONICS, 8 (1), 2024, DOI: 10.1038/s41528-024-00313-3. Abstract | BibTeX | Endnote @article{ISI:001215635300003,
title = {On non-von Neumann flexible neuromorphic vision sensors},
author = {Hao Wang and Bin Sun and Shuzhi Sam Ge and Jie Su and Ming Liang Jin},
doi = {10.1038/s41528-024-00313-3},
times_cited = {2},
year = {2024},
date = {2024-05-07},
journal = {NPJ FLEXIBLE ELECTRONICS},
volume = {8},
number = {1},
publisher = {NATURE PORTFOLIO},
address = {HEIDELBERGER PLATZ 3, BERLIN, 14197, GERMANY},
abstract = {The structure and mechanism of the human visual system contain rich treasures, and surprising effects can be achieved by simulating the human visual system. In this article, starting from the human visual system, we compare and discuss the discrepancies between the human visual system and traditional machine vision systems. Given the wide variety and large volume of visual information, the use of non-von Neumann structured, flexible neuromorphic vision sensors can effectively compensate for the limitations of traditional machine vision systems based on the von Neumann architecture. Firstly, this article addresses the emulation of retinal functionality and provides an overview of the principles and circuit implementation methods of non-von Neumann computing architectures. Secondly, in terms of mimicking the retinal surface structure, this article introduces the fabrication approach for flexible sensor arrays. Finally, this article analyzes the challenges currently faced by non-von Neumann flexible neuromorphic vision sensors and offers a perspective on their future development.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
The structure and mechanism of the human visual system contain rich treasures, and surprising effects can be achieved by simulating the human visual system. In this article, starting from the human visual system, we compare and discuss the discrepancies between the human visual system and traditional machine vision systems. Given the wide variety and large volume of visual information, the use of non-von Neumann structured, flexible neuromorphic vision sensors can effectively compensate for the limitations of traditional machine vision systems based on the von Neumann architecture. Firstly, this article addresses the emulation of retinal functionality and provides an overview of the principles and circuit implementation methods of non-von Neumann computing architectures. Secondly, in terms of mimicking the retinal surface structure, this article introduces the fabrication approach for flexible sensor arrays. Finally, this article analyzes the challenges currently faced by non-von Neumann flexible neuromorphic vision sensors and offers a perspective on their future development. - FNClarivate Analytics Web of Science
- VR1.0
- PTJ
- AUWang, H
Sun, B
Ge, SS
Su, J
Jin, ML
- AFHao Wang
Bin Sun
Shuzhi Sam Ge
Jie Su
Ming Liang Jin
- TIOn non-von Neumann flexible neuromorphic vision sensors
- SONPJ FLEXIBLE ELECTRONICS
- LAEnglish
- DTArticle
- IDNEURAL-NETWORK; GANGLION-CELLS; ANALOG; PHOTODETECTOR; ULTRAFAST; MEMORY; PHOTOTRANSISTORS; PLASTICITY; MONOLAYER; FRAMEWORK
- ABThe structure and mechanism of the human visual system contain rich treasures, and surprising effects can be achieved by simulating the human visual system. In this article, starting from the human visual system, we compare and discuss the discrepancies between the human visual system and traditional machine vision systems. Given the wide variety and large volume of visual information, the use of non-von Neumann structured, flexible neuromorphic vision sensors can effectively compensate for the limitations of traditional machine vision systems based on the von Neumann architecture. Firstly, this article addresses the emulation of retinal functionality and provides an overview of the principles and circuit implementation methods of non-von Neumann computing architectures. Secondly, in terms of mimicking the retinal surface structure, this article introduces the fabrication approach for flexible sensor arrays. Finally, this article analyzes the challenges currently faced by non-von Neumann flexible neuromorphic vision sensors and offers a perspective on their future development.
- C1[Wang, Hao; Jin, Ming Liang] Qingdao Univ, Inst Future, Sch Automat, Shandong Key Lab Ind Control Technol, Qingdao 266071, Peoples R China.
[Sun, Bin; Su, Jie] Qingdao Univ, Coll Elect & Informat Engn, Qingdao 266071, Peoples R China. [Ge, Shuzhi Sam] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117583, Singapore. [Ge, Shuzhi Sam] Natl Univ Singapore, Inst Funct Intelligent Mat, Singapore 117583, Singapore - C3Qingdao University; Qingdao University; National University of Singapore; National University of Singapore; Institute for Functional Intelligent Materials (I-FIM)
- RPJin, ML (corresponding author), Qingdao Univ, Inst Future, Sch Automat, Shandong Key Lab Ind Control Technol, Qingdao 266071, Peoples R China; Su, J (corresponding author), Qingdao Univ, Coll Elect & Informat Engn, Qingdao 266071, Peoples R China; Ge, SS (corresponding author), Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117583, Singapore; Ge, SS (corresponding author), Natl Univ Singapore, Inst Funct Intelligent Mat, Singapore 117583, Singapore
- FUNational Natural Science Foundation of China (National Science Foundation of China) [201909099]; Young Taishan Scholars Program of Shandong Province [EDUNC-33-18-279-V12]; Ministry of Education, Singapore [52003134, 12374088]; National Natural Science Foundation of China
- FXThe work was financially supported by the Young Taishan Scholars Program of Shandong Province (grant nos. 201909099) to M.L. Jin, Ministry of Education, Singapore, under its Research Centre of Excellence award to the Institute for Functional Intelligent Materials (I-FIM, project No. EDUNC-33-18-279-V12) to S.S. Ge, and National Natural Science Foundation of China (grant nos. 52003134 and 12374088) to M.L. Jin and J. Su.
- NR164
- TC2
- Z92
- U121
- U221
- PUNATURE PORTFOLIO
- PIBERLIN
- PAHEIDELBERGER PLATZ 3, BERLIN, 14197, GERMANY
- J9NPJ FLEX ELECTRON
- JInpj Flex. Electron.
- PDMAY 7
- PY2024
- VL8
- DI10.1038/s41528-024-00313-3
- PG26
- WCEngineering, Electrical & Electronic; Materials Science, Multidisciplinary
- SCEngineering; Materials Science
- GAPQ8H5
- UTWOS:001215635300003
- ER
- EF
|
Zhang, Yuxiang; Liang, Xiaoling; Li, Dongyu; Ge, Shuzhi Sam; Gao, Bingzhao; Chen, Hong; Lee, Tong Heng Adaptive Safe Reinforcement Learning With Full-State Constraints and Constrained Adaptation for Autonomous Vehicles 12 IEEE TRANSACTIONS ON CYBERNETICS, 54 (3), pp. 1907-1920, 2024, DOI: 10.1109/TCYB.2023.3283771. Abstract | BibTeX | Endnote @article{ISI:001203365100010,
title = {Adaptive Safe Reinforcement Learning With Full-State Constraints and Constrained Adaptation for Autonomous Vehicles},
author = {Yuxiang Zhang and Xiaoling Liang and Dongyu Li and Shuzhi Sam Ge and Bingzhao Gao and Hong Chen and Tong Heng Lee},
doi = {10.1109/TCYB.2023.3283771},
times_cited = {12},
issn = {2168-2267},
year = {2024},
date = {2024-03-01},
journal = {IEEE TRANSACTIONS ON CYBERNETICS},
volume = {54},
number = {3},
pages = {1907-1920},
publisher = {IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC},
address = {445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA},
abstract = {High-performance learning-based control for the typical safety-critical autonomous vehicles invariably requires that the full-state variables are constrained within the safety region even during the learning process. To solve this technically critical and challenging problem, this work proposes an adaptive safe reinforcement learning (RL) algorithm that invokes innovative safety-related RL methods with the consideration of constraining the full-state variables within the safety region with adaptation. These are developed toward assuring the attainment of the specified requirements on the full-state variables with two notable aspects. First, thus, an appropriately optimized backstepping technique and the asymmetric barrier Lyapunov function (BLF) methodology are used to establish the safe learning framework to ensure system full-state constraints requirements. More specifically, each subsystem's control and partial derivative of the value function are decomposed with asymmetric BLF-related items and an independent learning part. Then, the independent learning part is updated to solve the Hamilton-Jacobi-Bellman equation through an adaptive learning implementation to attain the desired performance in system control. Second, with further Lyapunov-based analysis, it is demonstrated that safety performance is effectively doubly assured via a methodology of a constrained adaptation algorithm during optimization (which incorporates the projection operator and can deal with the conflict between safety and optimization). Therefore, this algorithm optimizes system control and ensures that the full set of state variables involved is always constrained within the safety region during the whole learning process. Comparison simulations and ablation studies are carried out on motion control problems for autonomous vehicles, which have verified superior performance with smaller variance and better convergence performance under uncertain circumstances. The effectiveness of the safe performance of overall system control with the proposed method accordingly has been verified.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
High-performance learning-based control for the typical safety-critical autonomous vehicles invariably requires that the full-state variables are constrained within the safety region even during the learning process. To solve this technically critical and challenging problem, this work proposes an adaptive safe reinforcement learning (RL) algorithm that invokes innovative safety-related RL methods with the consideration of constraining the full-state variables within the safety region with adaptation. These are developed toward assuring the attainment of the specified requirements on the full-state variables with two notable aspects. First, thus, an appropriately optimized backstepping technique and the asymmetric barrier Lyapunov function (BLF) methodology are used to establish the safe learning framework to ensure system full-state constraints requirements. More specifically, each subsystem's control and partial derivative of the value function are decomposed with asymmetric BLF-related items and an independent learning part. Then, the independent learning part is updated to solve the Hamilton-Jacobi-Bellman equation through an adaptive learning implementation to attain the desired performance in system control. Second, with further Lyapunov-based analysis, it is demonstrated that safety performance is effectively doubly assured via a methodology of a constrained adaptation algorithm during optimization (which incorporates the projection operator and can deal with the conflict between safety and optimization). Therefore, this algorithm optimizes system control and ensures that the full set of state variables involved is always constrained within the safety region during the whole learning process. Comparison simulations and ablation studies are carried out on motion control problems for autonomous vehicles, which have verified superior performance with smaller variance and better convergence performance under uncertain circumstances. The effectiveness of the safe performance of overall system control with the proposed method accordingly has been verified. - FNClarivate Analytics Web of Science
- VR1.0
- PTJ
- AUZhang, YX
Liang, XL
Li, DY
Ge, SS
Gao, BZ
Chen, H
Lee, TH
- AFYuxiang Zhang
Xiaoling Liang
Dongyu Li
Shuzhi Sam Ge
Bingzhao Gao
Hong Chen
Tong Heng Lee
- TIAdaptive Safe Reinforcement Learning With Full-State Constraints and Constrained Adaptation for Autonomous Vehicles
- SOIEEE TRANSACTIONS ON CYBERNETICS
- LAEnglish
- DTArticle
- DEAdaptive Dynamic Programming (ADP); Autonomous Vehicles; Barrier Lyapunov Function (BLF); Safe Reinforcement Learning (RL)
- IDBARRIER LYAPUNOV FUNCTIONS; NONLINEAR-SYSTEMS; TRACKING CONTROL
- ABHigh-performance learning-based control for the typical safety-critical autonomous vehicles invariably requires that the full-state variables are constrained within the safety region even during the learning process. To solve this technically critical and challenging problem, this work proposes an adaptive safe reinforcement learning (RL) algorithm that invokes innovative safety-related RL methods with the consideration of constraining the full-state variables within the safety region with adaptation. These are developed toward assuring the attainment of the specified requirements on the full-state variables with two notable aspects. First, thus, an appropriately optimized backstepping technique and the asymmetric barrier Lyapunov function (BLF) methodology are used to establish the safe learning framework to ensure system full-state constraints requirements. More specifically, each subsystem's control and partial derivative of the value function are decomposed with asymmetric BLF-related items and an independent learning part. Then, the independent learning part is updated to solve the Hamilton-Jacobi-Bellman equation through an adaptive learning implementation to attain the desired performance in system control. Second, with further Lyapunov-based analysis, it is demonstrated that safety performance is effectively doubly assured via a methodology of a constrained adaptation algorithm during optimization (which incorporates the projection operator and can deal with the conflict between safety and optimization). Therefore, this algorithm optimizes system control and ensures that the full set of state variables involved is always constrained within the safety region during the whole learning process. Comparison simulations and ablation studies are carried out on motion control problems for autonomous vehicles, which have verified superior performance with smaller variance and better convergence performance under uncertain circumstances. The effectiveness of the safe performance of overall system control with the proposed method accordingly has been verified.
- C3National University of Singapore; Institute for Functional Intelligent Materials (I-FIM); National University of Singapore; National University of Singapore; Beihang University; Tongji University; Tongji University
- RPGe, SS (corresponding author), Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117583, Singapore; Ge, SS (corresponding author), Natl Univ Singapore, Inst Funct Intelligent Mat, Singapore 117583, Singapore; Gao, BZ (corresponding author), Tongji Univ, Clean Energy Automot Engn Ctr, Shanghai 201804, Peoples R China
- FXNo Statement Available
- NR48
- TC11
- Z912
- U128
- U271
- PUIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- PIPISCATAWAY
- PA445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA
- SN2168-2267
- J9IEE TRANS CYBERN
- JIIEEE T. Cybern.
- PDMAR
- PY2024
- VL54
- BP1907
- EP1920
- DI10.1109/TCYB.2023.3283771
- PG14
- WCAutomation & Control Systems; Computer Science, Artificial Intelligence; Computer Science, Cybernetics
- SCAutomation & Control Systems; Computer Science
- UTWOS:001203365100010
- ER
- EF
|