Q-Learning-based setting of cell individual offset for handover of flying base stations