Trajectory optimization for autonomous flying base station via reinforcement learning