Monomicrophone blind audio source separation using EM-Kalman filters and short+long term AR modeling

Bensaid, Siouar;Schutz, Antony; Slock, Dirk T M
Asilomar 2009, 43rd Asilomar Conference on Signals Systems and Computers, November 1-4, 2009, Asilomar, USA

Blind sources separation (BSS) arises in a variety of fields in speech processing such as speech enhancement, speakers diarization and identification. Generally, methods for BSS consider several observations of the same recording. Single microphone analysis is the worst underdetermined case, but, it's also the more realistic one. In our approach, the autoregressive structure (short term prediction) and the periodic signature (long term prediction) of voiced speech signal are jointly modeled. The filters parameters are extracted using a combined version of the EM-Algorithm with the Rauch-Tung-Striebel optimal smoother while the fixed-lag Kalman smoother algorithm is used for the initialization.


DOI
Type:
Conférence
City:
Asilomar
Date:
2009-11-01
Department:
Systèmes de Communication
Eurecom Ref:
2878
Copyright:
© 2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/2878