Policy Transfer Reinforcement Learning Method for Partially Observable Conditions
Zhongyu WANG, Xiaopeng XU, Dong WANG
Modern Defense Technology . 2024, (2): 63 -71 .  DOI: 10.3969/j.issn.1009-086x.2024.02.007