A perspective on off-policy evaluation in reinforcement learning
Crossref DOI link: https://doi.org/10.1007/s11704-019-9901-7
Published Online: 2019-06-17
Published Print: 2019-10
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Li, Lihong
Text and Data Mining valid from 2019-06-17
Version of Record valid from 2019-06-17
Article History
Received: 4 April 2019
First Online: 17 June 2019