Model-aided deep reinforcement learning for sample-efficient UAV trajectory design in IoT networks