Reward Function Design Method for Long Episode Pursuit Tasks Under Polar Coordinate in Multi-Agent Reinforcement Learning
Crossref DOI link: https://doi.org/10.1007/s12204-024-2713-4
Published Online: 2024-04-08
Published Print: 2024-08
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Dong, Yubo
Cui, Tao
Zhou, Yufan
Song, Xun
Zhu, Yue
Dong, Peng
Text and Data Mining valid from 2024-04-08
Version of Record valid from 2024-04-08
Article History
Received: 2 June 2023
Accepted: 10 October 2023
First Online: 8 April 2024
Ethics
: <b>Conflict of Interest</b> The authors declare that they have no conflict of interest.