Regularization and Two Time Scales for Convergence of Reinforcement Learning
Crossref DOI link: https://doi.org/10.1007/s00245-025-10304-z
Published Online: 2025-08-31
Published Print: 2025-10
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Carvalho, Diogo S. https://orcid.org/0000-0003-3008-7322
Santos, Pedro A.
Melo, Francisco S.
Funding for this research was provided by:
Universidade de Lisboa
Text and Data Mining valid from 2025-08-31
Version of Record valid from 2025-08-31
Article History
Accepted: 5 August 2025
First Online: 31 August 2025
Declarations
:
: The authors have no relevant financial or non-financial interests to disclose.