On using deep reinforcement learning to balance power consumption and latency in 5G NR