Convergence of Markov decision processes with constraints and state-action dependent discount factors
Crossref DOI link: https://doi.org/10.1007/s11425-017-9292-1
Published Online: 2019-02-15
Published Print: 2020-01
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Wu, Xiao
Guo, Xianping
Text and Data Mining valid from 2019-02-15
Article History
Received: 2 January 2017
Accepted: 27 March 2018
First Online: 15 February 2019