Learning to rest: A Q-learning approach to flying base station trajectory design with landing spots

Bayerlein, Harald; Gangula, Rajeev; Gesbert, David
ASILOMAR 2018, 52nd Asilomar Conference on Signals, Systems and Computers, 28-31 October 2018, Pacific Grove, USA

Abstract--We consider the problem of trajectory optimization for an autonomous UAV-mounted base station that provides communication services to ground users with the aid of landing spots (LSs). Recently, the concept of LSs was introduced to alleviate the problem of short mission durations arising from the limited on-board battery budget of the UAV, which severely limits network performance. In this work, using Q-learning, a
model-free reinforcement learning (RL) technique, we train a neural network (NN) to make movement decisions for the UAV that maximize the data collected from the ground users while minimizing power consumption by exploiting the landing spots. We show that the system intelligently integrates landing spots into the trajectory to extend flying time and is able to learn the topology of the network over several flying epochs without any
explicit information about the environment.

DOI
Type:
Conférence
City:
Pacific Grove
Date:
2018-10-28
Department:
Systèmes de Communication
Eurecom Ref:
5761
Copyright:
© 2018 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/5761