Potential-based reward shaping for finite horizon online POMDP planning
Crossref DOI link: https://doi.org/10.1007/s10458-015-9292-6
Published Online: 2015-03-05
Published Print: 2016-05
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Eck, Adam
Soh, Leen-Kiat
Devlin, Sam
Kudenko, Daniel
Text and Data Mining valid from 2015-03-05