Acoustic context recognition using local binary pattern codebooks

Battaglino, Daniele; Lepauloux, Ludovick; Pilati, Laurent; Evans, Nicholas
WASPAA 2015, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 18-21 October 2015, New Paltz, NY, USA

Automatic context recognition enables mobile devices to adapt their configuration to different environments and situations. This paper investigates the use of acoustic cues as a means of recognising context. The majority of existing approaches exploit Mel-scaled cepstral coefficients (MFCCs) developed for the analysis of speech signals. The hypothesis in this paper is that new features are needed in order to capture complex acoustic structure. The paper introduces the use of local binary pattern (LBP) analysis which is used to complement MFCCs with acoustic texture information. The second
contribution relates to a bag-of-features extension which clusters LBPs into a small number of codewords. Both approaches outperform the current state of the art and the latter is particularly appealing for embedded applications in which computational efficiency is paramount.

New Paltz
Sécurité numérique
Eurecom Ref:
© 2015 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.