Asynchronous hierarchical deep reinforcement learning with learnable reward shaping for distributed multi-UCAV air combat decision
Crossref DOI link: https://doi.org/10.1007/s11431-025-3130-x
Published Online: 2026-01-04
Published Print: 2026-01
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Zheng, Yifan
Xin, Bin
Chen, Jie
Jiao, Keming
Zhao, Zhixin
Text and Data Mining valid from 2026-01-01
Version of Record valid from 2026-01-01
Article History
Received: 31 July 2025
Accepted: 5 November 2025
First Online: 4 January 2026