Skip to main navigation Skip to search Skip to main content

Adaptive Control of MIMO UAVs for Prioritized Harvesting via Hierarchical Reinforcement Learning

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This work addresses the control and coordination of a fleet of MIMO-capable UAVs for efficiently harvesting prioritized traffic from a random distribution of heterogeneous MIMO-capable users. The objective is to maximize the infinite horizon average reward per UAV subject to mobility and average power constraints. Consequently, this problem is formulated as a Dynamic Pickup and Delivery Problem (DPDP) and solved via a Hierarchical Reinforcement Learning (HRL) framework. In the upper tier of the policy hierarchy, a double deep Q-network adaptively partitions the DPDP into a series of static Pickup and Delivery Problems (PDPs) of varying timescales by dynamically caching and releasing batches of user requests. Subsequently, the lower tier employs a mixed integer programming construction, modeled as a multiple Traveling Salesman Problem (mTSP) with capacity and resource constraints, wherein the goal is to optimize user association and scheduling of the released batch of user requests via graphical branch-and-bound, 3D UAV service positions via zero-forcing beam-forming and two-stage exhaustive grid search, and 3D UAV trajectories using learning based competitive swarm optimization. Simulations prove that the HRL solution outperforms static UAV deployments (63%), adaptive Voronoi decompositions (33%), and state-of-the-art iterative fleet automation algorithms (69%), vis-á-vis user quality-of-service and UAV power efficiency.

Original languageEnglish (US)
Title of host publicationConference Record of the 58th Asilomar Conference on Signals, Systems and Computers, ACSSC 2024
EditorsMichael B. Matthews
PublisherIEEE Computer Society
Pages7-11
Number of pages5
ISBN (Electronic)9798350354058
DOIs
StatePublished - 2024
Event58th Asilomar Conference on Signals, Systems and Computers, ACSSC 2024 - Hybrid, Pacific Grove, United States
Duration: Oct 27 2024Oct 30 2024

Publication series

NameConference Record - Asilomar Conference on Signals, Systems and Computers
ISSN (Print)1058-6393

Conference

Conference58th Asilomar Conference on Signals, Systems and Computers, ACSSC 2024
Country/TerritoryUnited States
CityHybrid, Pacific Grove
Period10/27/2410/30/24

ASJC Scopus subject areas

  • Signal Processing
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Adaptive Control of MIMO UAVs for Prioritized Harvesting via Hierarchical Reinforcement Learning'. Together they form a unique fingerprint.

Cite this