Policy gradient in Lipschitz Markov Decision Processes
Crossref DOI link: https://doi.org/10.1007/s10994-015-5484-1
Published Online: 2015-03-03
Published Print: 2015-09
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Pirotta, Matteo
Restelli, Marcello
Bascetta, Luca
Text and Data Mining valid from 2015-03-03