TU Dortmund Department of Computer Science LS XII Pattern Recognition GroupPublications → Publication Details

Reverberation-Robust Online Multi-Speaker Tracking by using a Microphone Array and CASA Processing


Axel Plinge AND Marius H. Hennecke AND Gernot A. Fink
Proc. 13th International Workshop on Acoustic Signal Enhancement (IWAENC), Aachen, Germany, 2012.

Online tracking of speakers is an important task for applications in smart environments such as camera control, meeting annotation and speech separation. Challenges for an audio-only system are small-room reverberation, noise, the unknown number of speakers, and gaps occurring in natural speech. Combining models from neurobiology and cognitive psychology with many-channel signal processing and pattern recognition techniques, a hybrid method was developed. By employing online CASA processing to signals from a microphone array, the real-time capable method is able to track an arbitrary number of concurrent moving speakers in highly reverberant environments.

 [bib] [pdf] [data] [demo]