Crossmark

Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching

Published Online: 2024-12-16

Published Print: 2025-02

Authors

Denkert, Robert

Pham, Huyên https://orcid.org/0000-0002-9758-3550
Warin, Xavier
License Information

Text and Data Mining valid from 2024-12-16

Version of Record valid from 2024-12-16
More Information

Article History

Accepted: 29 November 2024

First Online: 16 December 2024

Declarations

:

: The authors have no relevant or non-financial interests to disclose.

Document is current