TU Dortmund Department of Computer Science LS XII Pattern Recognition GroupPublications → Publication Details

Detecting Hands in Video Images Using Scale Invariant Local Descriptors.


J. Richarz, T. Pl{\"o}tz and G. A. Fink
Proc. IASTED Int. Conf. on Visualization, Imaging and Image Processing (VIIP 2007), pages 259-264, Palma de Mallorca, Spain, 2007.

In this paper, we describe our approach on hand detection in cluttered images using scale invariant features. We claim that, while modelling hands as a whole is bound to fail because of their strongly articulated nature, treating them as a collection of weakly connected characteristic regions seems promising. Different approaches to finding and robustly modelling such regions - or local object descriptors - invariantly to scale and orientation of the object in question have been proposed. As an example, we demonstrate our approach using the well-known scale-invariant feature transform (SIFT), combined with a region-based postprocessing to eliminate false positives. We present detailed results on a large set of images from a realistic interaction scenario with a smart room.

 [bib] [pdf]