This is an outdated version published on 2026-01-30. Read the most recent version.

Autonomous Carrier Landing Control Strategy for VTOL UAVs Based on Deep Deterministic Policy Gradient Reinforcement Learning

Authors

Yiming Liu School of Aerospace Engineering, Georgia Institute of Technology
Daniel R. Hoffman Department of Electrical and Computer Engineering, University of California
Jianwei Wang Department of Mechanical and Aerospace Engineering, The Hong Kong University of Science and Technology

DOI:

https://doi.org/10.9999/ijair.v1i1.5

Abstract

Autonomous shipboard recovery of vertical take-off and landing (VTOL) unmanned aerial vehicles (UAVs) is characterized by tight terminal constraints, rapidly varying wind disturbances, and deck motion induced by sea states. These factors lead to significant model uncertainty and render purely model-based designs brittle when the operating envelope broadens.
This paper develops an autonomous carrier-landing control strategy based on Deep Deter- ministic Policy Gradient (DDPG) for continuous control. Carrier recovery is formulated as a constrained Markov decision process (CMDP) using a deck-relative state representation and an action space consistent with common inner-loop attitude/thrust architectures. To improve training stability and reduce unsafe behaviors, we propose (i) a structured reward with explicit terminal touchdown constraints, (ii) constraint-aware termination and curriculum scheduling across approach phases, and (iii) domain randomization over aerodynamics, actuator dynamics, sensing latency/noise, wind gusts, and deck motion.
Comprehensive simulation studies demonstrate that the learned policy achieves higher land- ing success rates and lower touchdown dispersion than tuned PID guidance–control baselines under a wide range of perturbations. We further report ablations on reward terms and random- ization ranges, and discuss practical considerations for sim-to-real transfer.

Downloads

Published

2026-01-30 — Updated on 2026-01-30

Versions

How to Cite

Liu, Y., Hoffman, D. R., & Wang, J. (2026). Autonomous Carrier Landing Control Strategy for VTOL UAVs Based on Deep Deterministic Policy Gradient Reinforcement Learning. International Journal of Artificial Intelligence Research, 1(1). https://doi.org/10.9999/ijair.v1i1.5

Download Citation

Issue

Vol. 1 No. 1 (2026): International Journal of Artificial Intelligence Research

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

This article is published under the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Autonomous Carrier Landing Control Strategy for VTOL UAVs Based on Deep Deterministic Policy Gradient Reinforcement Learning

Authors

DOI:

Abstract

Downloads

Published

Versions

How to Cite

Issue

Section

License

Current Issue

Information