Crossmark

Bandit algorithms for policy learning: methods, implementation, and welfare-performance

Published Online: 2024-09-17

Published Print: 2024-07

Authors

Kitagawa, Toru

Rowley, Jeff
License Information

Text and Data Mining valid from 2024-07-01

Version of Record valid from 2024-07-01
More Information

Article History

Received: 15 June 2024

Revised: 31 August 2024

Accepted: 31 August 2024

First Online: 17 September 2024

Document is current