Offline minimax Q-function learning for undiscounted indefinite-horizon MDPs
Crossref DOI link: https://doi.org/10.1007/s10463-025-00924-1
Published Online: 2025-04-21
Published Print: 2025-08
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Li, Fengying
Li, Yuqiang
Wu, Xianyi
Bai, Wei
Text and Data Mining valid from 2025-04-21
Version of Record valid from 2025-04-21
Article History
Received: 5 September 2023
Revised: 6 November 2024
Accepted: 20 December 2024
First Online: 21 April 2025