Graduate School and Research Center in Digital Sciences


Eurecom - Digital Security 
Cifre Doctoral student ( 2014 - 2017)


Acoustic scene classification: contributions to fundamental and applied research


Acoustic context information may be used by microphone-equipped devices in order to adapt their behaviour or configuration according to a particular scenario.  Recognition of such scenarios according to the acoustic context is the goal of acoustic scene classification (ASC).  The choice of audio sensors, instead of alternatives (e.g.motion or light sensors), is a natural one; almost all mobile and smart devices are equipped with at least one microphone. 


Almost all previous solutions to ASC rely on feature extraction approaches designed specifically for speech and music genre recognition and are thus not necessarily optimal for ASC.  Further limitations of existing solutions relate to the requirements for real-time and low footprint implementations.  These requirements must be met in order that ASC algorithms can be developed for low power, always listening devices.  


The work reported in this thesis aims to address these limitations and hence to reduce the gap between academic and industrial research in terms of methods, protocols and metrics. Accordingly, this thesis presents the ASC problem from a dual perspective.  This includes contributions in bothfundamental research, which report contributions with respect to standard protocols and methods in addition to applied research, which describes contributions to the adaptation of current methods to `real-world' applications.