An identifier-actor-optimizer policy learning architecture for optimal control of continuous-time nonlinear systems
Crossref DOI link: https://doi.org/10.1007/s11433-019-1481-2
Published Online: 2020-03-19
Published Print: 2020-06
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Cheng, Lin
Wang, ZhenBo
Jiang, FangHua
Li, JunFeng
Text and Data Mining valid from 2020-03-19
Version of Record valid from 2020-03-19
Article History
Received: 15 October 2019
Accepted: 26 November 2019
First Online: 19 March 2020