Research on LSTM-driven UAV path planning

Dianyi Zhou; Xi Du; Shiyi Liu; Qingyu Su; Hongyang Guo

doi:10.1117/12.3049651

8 November 2024 Research on LSTM-driven UAV path planning

Dianyi Zhou, Xi Du, Shiyi Liu, Qingyu Su, Hongyang Guo

Author Affiliations +

Proceedings Volume 13416, Fourth International Conference on Advanced Algorithms and Neural Networks (AANN 2024); 134163R (2024) https://doi.org/10.1117/12.3049651
Event: 2024 4th International Conference on Advanced Algorithms and Neural Networks, 2024, Qingdao, China

Abstract

In decision problems, single step decision making refers to making a decision at each time step based only on the current state, without considering the long-term state or future effects. This approach is suitable for those scenarios with immediate feedback and operational impact, but can be challenging when facing complex and long-term dependent environments. We will explore the advantages and disadvantages of single step decision making and how this strategy can be used to optimize the decision process in practice. This innovative algorithm integrates the memory capabilities of recurrent neural networks (RNNs) into deep reinforcement learning frameworks. Unlike traditional Deep Q Network (DQN) setups, where feedforward neural networks are typically used for the the RPP-LSTM employs an LSTM network as the Q-value network. This integration allows the Q network to retain memory of previous environmental states and actions, thereby addressing the myopic nature of decision-making prevalent in methods. By leveraging LSTM's ability to capture and utilize temporal dependencies, the RPP-LSTM algorithm enhances the UAV's path planning capability by considering a broader context of environmental changes and past decisions. This approach is particularly beneficial in dynamic environments where the immediate decision based solely on current state information may not be optimal. The LSTM-equipped Q-value network can effectively learn and adapt to varying environmental conditions, leading in tasks. Furthermore, the incorporates a stratified punishment and reward mechanism designed to optimize the rationality of UAV path planning. This function encourages the UAV to make decisions that not only achieve immediate goals but also contribute to long-term planning objectives, ensuring strategic adaptability in complex scenarios. Simulation results demonstrate the superiority of the RPP-LSTM algorithm over traditional approaches relying on feedforward neural networks (FNNs). It exhibits enhanced adaptability to complex environments and achieves superior performance in terms of both robustness and accuracy in real-time UAV path planning scenarios. This integration of LSTM with deep reinforcement learning represents a significant advancement towards more intelligent and effective autonomous UAV operations in dynamic and challenging environments.

(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.

Citation Download Citation

Dianyi Zhou, Xi Du, Shiyi Liu, Qingyu Su, and Hongyang Guo "Research on LSTM-driven UAV path planning", Proc. SPIE 13416, Fourth International Conference on Advanced Algorithms and Neural Networks (AANN 2024), 134163R (8 November 2024); https://doi.org/10.1117/12.3049651

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

;

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
8 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Unmanned aerial vehicles

Distance measurement

Decision making

Education and training

Neural networks

Evolutionary algorithms

Ultrasonics

Show All Keywords

Keywords/Phrases

Search In:

Publication Years