Dynamically Adaptive Tracking of Gestures and Facial Expressions*

D. Metaxas1, G. Tsechpenakis1, Z. Li1, Y. Huang1, and A. Kanaujia1

1 Center for Computational Biomedicine, Imaging and Modeling (CBIM), Computer Science Department, Rutgers University, 110 Frelinghuysen Rd, Piscataway, NJ 08854

Abstract. We present a dynamic data-driven framework for tracking gestures and facial expressions from monocular sequences. Our system uses two cameras, one for the face and one for the body view for processing in different scales. Specifically, and for the gesture tracking module, we track the hands and the head, obtaining as output the blobs (ellipses) of the ROIs, and we detect the shoulder positions with straight lines. For the facial expressions, we first extract the 2D facial features, using a fusion between KLT tracker and a modified Active Shape Model, and then we obtain the 3D face mask with fitting a generic model to the extracted 2D features. The main advantages of our system are (i) the adaptivity, i. e., it is robust to external conditions, e.g., lighting, and independent from the examined individual, and (ii) its computational efficiency, providing us results off- and online with a rates higher than 20fps.

* This research has been funded by an NSF-ITR/NGS-0313134 and an NSF-ITR-[ASE+ECS]-0428231 Collaborative Project to the first author.

LNCS 3993, pp. 554-561.

Last modified: