Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Reinforcement Learning-Based Cooperative Trajectory Planning for Unmanned Combat Aerial Vehicles and Decoy UAVs
Haozhe QI, Mingfa ZHENG, Xiaorong HU, Nan YANG
Modern Defense Technology    2026, 54 (3): 71-81.   DOI: 10.3969/j.issn.1009-086x.2026.03.007
Abstract7)   HTML0)    PDF (2932KB)(4)       Save

Unmanned aerial vehicle (UAV) cooperative combat is crucial in modern warfare. The cooperative mode between unmanned combat aerial vehicles (UCAVs) and decoy UAVs has gained significant attention due to its tactical value. This paper proposes a cooperative trajectory planning method based on the proximal policy optimization (PPO) algorithm for UCAV and decoy UAV strike missions against key enemy targets. We construct a Markov decision process (MDP) model incorporating dynamic threat assessment, integrating UAV kinematics and battlefield constraints, and design the state/action spaces and a hierarchical reward function. Simulation results demonstrate that the proposed method effectively guides UCAVs and decoys to achieve efficient cooperation in complex environments, significantly increasing mission success rates while reducing interception risks from enemy air defense systems. This provides theoretical and technical support for intelligent path planning in UAV cooperative operations.

Table and Figures | Reference | Related Articles | Metrics