Bandit algorithms for policy learning: methods, implementation, and welfare-performance
Crossref DOI link: https://doi.org/10.1007/s42973-024-00165-6
Published Online: 2024-09-17
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Kitagawa, Toru
Rowley, Jeff
Text and Data Mining valid from 2024-09-17
Version of Record valid from 2024-09-17
Article History
Received: 15 June 2024
Revised: 31 August 2024
Accepted: 31 August 2024
First Online: 17 September 2024