TU Dortmund → Department of Computer Science → LS XII → Pattern Recognition Group → Publications → Publication Details
B. Schauerte, J. Richarz and G. A. Fink IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, Taipei, Taiwan, 2010.
When persons interact, non-verbal cues are used to direct the attention of persons towards objects of interest. Achieving joint attention this way is an important aspect of natural communication. Most importantly, it allows to couple verbal descriptions with the visual appearance of objects, if the referred-to object is non-verbally indicated. In this contribution, we present a system that utilizes bottom-up saliency and pointing gestures to efficiently identify pointed-at objects. Furthermore, the system focuses the visual attention by steering a pan-tilt-zoom camera towards the object of interest and thus provides a suitable model-view for SIFT-based recognition and learning. We demonstrate the practical applicability of the proposed system through experimental evaluation in different environments with multiple pointers and objects.