Nona: A stochastic congestion-aware job scheduler for real-time inference queries