An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
Crossref DOI link: https://doi.org/10.1007/s10994-018-5727-z
Published Online: 2018-07-03
Published Print: 2018-09
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Joseph, Ajin George
Bhatnagar, Shalabh
Text and Data Mining valid from 2018-07-03
Article History
Received: 4 November 2017
Accepted: 8 June 2018
First Online: 3 July 2018