基于深度强化学习的蜂窝连接无人机轨迹优化方法及仿真-AET-电子技术应用

基于深度强化学习的蜂窝连接无人机轨迹优化方法及仿真

电子技术应用

李胜昌1，张振峡1，王乐和1，吴佳华2，于果2，孟祥禄2

1.中国兵器工业计算机应用技术研究所；2.大连理工大学

摘要： 蜂窝连接无人机（cellular-connected UAV）在5G网络中展现出巨大潜力。针对其在执行通信任务时需保持稳定地面基站连接的问题，研究了无人机轨迹优化。目标是在给定区域内，使无人机从起点到终点的飞行过程中联合最小化任务完成时间与通信中断时间，并最大化通信吞吐量。由于该问题具有非凸性，采用基于多步学习的竞争性双深度Q网络（Dueling Double Deep Q Network， D3QN）算法求解，通过无人机与地面基站的交互学习实现自适应轨迹优化。仿真结果表明，与直飞策略相比，该方法将任务完成时间缩短了28%，通信中断时长降低了42%，系统平均吞吐量提升了35%，在任务效率、通信稳定性和系统吞吐量等方面均实现显著优化。

关键词： 无人机蜂窝通信深度强化学习轨迹优化竞争性双深度Q网络

中图分类号：TN929.53 文献标志码：A DOI: 10.16157/j.issn.0258-7998.257391
中文引用格式： 李胜昌，张振峡，王乐和，等. 基于深度强化学习的蜂窝连接无人机轨迹优化方法及仿真[J]. 电子技术应用，2026，52(4)：29-37.
英文引用格式： Li Shengchang，Zhang Zhenxia，Wang Lehe，et al. A deep reinforcement learning-based trajectory optimization method and simulation for cellular-connected UAVs[J]. Application of Electronic Technique，2026，52(4)：29-37.

A deep reinforcement learning-based trajectory optimization method and simulation for cellular-connected UAVs

Li Shengchang1，Zhang Zhenxia1，Wang Lehe1，Wu Jiahua2，Yu Guo2，Meng Xianglu2

1.Computer Application Technology Research Institute of China North Industries Group Corporation Limited；2.Dalian University of Technology

Abstract： Cellular-connected unmanned aerial vehicles (UAVs) show great potential in 5G networks. To address the challenge of maintaining stable connections with ground base stations during communication tasks, this paper investigates an UAV trajectory optimization problem. The objective is to jointly minimize the task completion time and communication outage duration while maximizing communication throughput as the UAV travels from the starting point to the destination within a given area. Considering the non-convex nature of the problem, a multi-step learning-based Dueling Double Deep Q Network (D3QN) algorithm is adopted to achieve adaptive trajectory optimization through interactive learning between the UAV and ground base stations. Simulation results show that, compared with the direct flight strategy, this method reduces the task completion time by 28%, cuts down the communication outage duration by 42%, and increases the average system throughput by 35%, achieving significant improvements in task efficiency, communication stability and system throughput.

Key words : unmanned aerial vehicle；cellular communications；deep reinforcement learning；trajectory optimization；dueling double deep Q-network

引言

近年来，无人机（Unmanned Aerial Vehicle, UAV）凭借其高机动性与灵活部署能力，在诸多领域迅速发展[1][2]。在无人机的各类应用场景中，保障其飞行安全与提升任务完成时效至关重要，而这两者均依赖于稳定且高质量的空地通信链路。为此，蜂窝连接无人机（cellular-connected UAV）作为无人机与地面基站之间的重要无线连接方式，受到广泛关注[3]。与通信距离受限的传统WiFi方案相比，蜂窝连接不受短距约束[4]，可实现真正意义上的远程控制；相较于覆盖范围广的卫星通信，蜂窝连接无人机亦具有显著成本优势。尤其是在第五代移动通信技术[4]的支撑下，蜂窝连接无人机有望显著提升空地链路的速率与可靠性。近年来，蜂窝连接无人机已在搜救、空中监测、交通管控、航拍以及包裹投递等领域获得成功应用。

本文详细内容请下载：

https://www.chinaaet.com/resource/share/2000007034

作者信息：

李胜昌1，张振峡1，王乐和1，吴佳华2，于果2，孟祥禄2

（1.中国兵器工业计算机应用技术研究所，北京 100083；2.大连理工大学，辽宁大连 116000）

原创声明：此内容为AET网站原创，未经授权禁止转载。

相关内容