N. W. D. Evans, J. S. Mason
Proc. EUSIPCO, volume 1, pages 539-542, 2002
Abstract: This paper addresses the problem of noise estimation in the context of speech processing. A recently proposed quantile-based approach to noise estimation has the merit of not relying on the explicit detection of speech, non-speech boundaries. Here this approach is extended to both time and frequency. The resultant time-frequency quantile-based noise esimation is shown to give superior ASR perfomrnace. Results on the Aurora 2 Distributed Speech Reocgnition Database show an average relative preformacne improcement over the ETSI front-end baseline of 35%. The merits of the new system include: the relatively few parameters to optimise, the independence of absolute signal levels and minimal latency, all of which assist in real-time implementations.