Crossmark

Windows deep transformer Q-networks: an extended variance reduction architecture for partially observable reinforcement learning

Published Online: 2024-11-27

Published Print: 2025-01

Authors

Wang, Zijian

Wang, Bin https://orcid.org/0000-0002-0092-3951
Dou, Hongbo

Liu, Zhongyuan
License Information

Text and Data Mining valid from 2024-11-27

Version of Record valid from 2024-11-27
More Information

Article History

Accepted: 6 November 2024

First Online: 27 November 2024

Declarations

:

: The authors declare that there is no conflict of interest.

: Not applicable.

: Not applicable.

: Not applicable.

Document is current