Liu, D., Cui, J., Zhang, J., Yang, C. and Hanzo, L., 2021. Deep Reinforcement Learning Aided Packet-Routing For Aeronautical Ad-Hoc Networks Formed by Passenger Planes. IEEE Transactions on Vehicular Technology, 70 (5), 5166-5171.
Full text available as:
|
PDF
AANET_DRL_routing.pdf - Accepted Version Available under License Creative Commons Attribution Non-commercial. 2MB | |
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
Abstract
Data packet routing in aeronautical ad-hoc networks (AANETs) is challenging due to their high-dynamic topology. In this paper, we invoke deep reinforcement learning for routing in AANETs aiming at minimizing the end-to-end (E2E) delay. Specifically, a deep Q-network (DQN) is conceived for capturing the relationship between the optimal routing decision and the local geographic information observed by the forwarding node. The DQN is trained in an offline manner based on historical flight data and then stored by each airplane for assisting their routing decisions during flight. To boost the learning efficiency and the online adaptability of the proposed DQN-routing, we further exploit the knowledge concerning the system’s dynamics by using a deep value network (DVN) conceived with a feedback mechanism. Our simulation results show that both DQN-routing and DVN-routing achieve lower E2E delay than the benchmark protocol, and DVN-routing performs similarly to the optimal routing that relies on perfect global information.
Item Type: | Article |
---|---|
ISSN: | 0018-9545 |
Additional Information: | © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works |
Uncontrolled Keywords: | AANET; routing; deep reinforcement learning |
Group: | Faculty of Science & Technology |
ID Code: | 35262 |
Deposited By: | Symplectic RT2 |
Deposited On: | 12 Mar 2021 09:26 |
Last Modified: | 14 Mar 2022 14:26 |
Downloads
Downloads per month over past year
Repository Staff Only - |