Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching
Crossref DOI link: https://doi.org/10.1007/s00245-024-10207-5
Published Online: 2024-12-16
Published Print: 2025-02
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Denkert, Robert
Pham, Huyên https://orcid.org/0000-0002-9758-3550
Warin, Xavier
Text and Data Mining valid from 2024-12-16
Version of Record valid from 2024-12-16
Article History
Accepted: 29 November 2024
First Online: 16 December 2024
Declarations
:
: The authors have no relevant or non-financial interests to disclose.