Exploration Bonuses Based on Upper Confidence Bounds for Sparse Reward Games
Crossref DOI link: https://doi.org/10.1007/978-3-319-71649-7_14
Published Online: 2017-12-22
Published Print: 2017
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Mizukami, Naoki
Suzuki, Jun
Kameko, Hirotaka
Tsuruoka, Yoshimasa
License valid from 2017-01-01