OSA's Digital Library

Applied Optics

Applied Optics

APPLICATIONS-CENTERED RESEARCH IN OPTICS

  • Vol. 43, Iss. 2 — Jan. 10, 2004
  • pp: 366–378

Three-dimensional tracking of multiple skin-colored regions by a moving stereoscopic system

Antonis A. Argyros and Manolis I. A. Lourakis  »View Author Affiliations


Applied Optics, Vol. 43, Issue 2, pp. 366-378 (2004)
http://dx.doi.org/10.1364/AO.43.000366


View Full Text Article

Enhanced HTML    Acrobat PDF (1037 KB)





Browse Journals / Lookup Meetings

Browse by Journal and Year


   


Lookup Conference Papers

Close Browse Journals / Lookup Meetings

Article Tools

Share
Citations

Abstract

A system that performs three-dimensional (3D) tracking of multiple skin-colored regions (SCRs) in images acquired by a calibrated, possibly moving stereoscopic rig is described. The system consists of a collection of techniques that permit the modeling and detection of SCRs, the determination of their temporal association in monocular image sequences, the establishment of their correspondence between stereo images, and the extraction of their 3D positions in a world-centered coordinate system. The development of these techniques has been motivated by the need for robust, near-real-time tracking performance. SCRs are detected by use of a Bayesian classifier that is trained with the aid of a novel technique. More specifically, the classifier is bootstrapped with a small set of training data. Then, as new images are being processed, an iterative training procedure is employed to refine the classifier. Furthermore, a technique is proposed to enable the classifier to cope with changes in illumination. Tracking of SCRs in time as well as matching of SCRs in the images of the employed stereo rig is performed through computationally inexpensive and robust techniques. One of the main characteristics of the skin-colored region tracker (SCRT) instrument is its ability to report the 3D positions of SCRs in a world-centered coordinate system by employing a possibly moving stereo rig with independently verging CCD cameras. The system operates on images of dimensions 640 × 480 pixels at a rate of 13 Hz on a conventional Pentium 4 processor at 1.8 GHz. Representative experimental results from the application of the SCRT to image sequences are also provided.

© 2004 Optical Society of America

OCIS Codes
(150.6910) Machine vision : Three-dimensional sensing
(330.0330) Vision, color, and visual optics : Vision, color, and visual optics

History
Original Manuscript: May 15, 2003
Revised Manuscript: September 19, 2003
Published: January 10, 2004

Citation
Antonis A. Argyros and Manolis I. A. Lourakis, "Three-dimensional tracking of multiple skin-colored regions by a moving stereoscopic system," Appl. Opt. 43, 366-378 (2004)
http://www.opticsinfobase.org/ao/abstract.cfm?URI=ao-43-2-366


Sort:  Author  |  Year  |  Journal  |  Reset  

References

  1. K. Meyer, H. L. Applewhite, F. A. Biocca, “A survey of position trackers,” Presence 1, 173–200 (1992).
  2. D. M. Gavrila, “The visual analysis of human movement: a survey,” Comput. Vis. Image Underst. 73, 82–98 (1999). [CrossRef]
  3. Q. Delamarre, O. Faugeras, “3D articulated models and multi-view tracking with physical forces,” Comput. (Vis. Image Underst. 81, 328–357 (2001). [CrossRef]
  4. T. S. Jebara, A. Pentland, “Parameterized structure from motion for 3D adaptive feedback tracking of faces,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1997), pp. 144–150. [CrossRef]
  5. S. H. Kim, N. K. Kim, S. C. Ahn, H. G. Kim, “Object oriented face detection using range and color information,” in Proceedings of the 3rd IEEE International Conference on Automatic Face and Gesture Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1998), pp. 76–81.
  6. M. J. Jones, J. M. Rehg, “Statistical color models with application to skin detection,” in IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1999), pp. 274–280.
  7. D. Saxe, R. Foulds, “Toward robust skin identification in video images,” 2nd International Conference on Automatic Face and Gesture Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1996), pp. 379–384.
  8. D. Chai, K. N. Ngan, “Locating the facial region of a head-and-shoulders color image,” in Proceedings of the 3rd IEEE International Conference on Automatic Face and Gesture Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1998), pp. 124–129. [CrossRef]
  9. M. H. Yang, N. Ahuja, “Detecting Human Faces in Color Images,” in Proceedings of the IEEE International Conference on Image Processing (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1998), pp. 127–130.
  10. J. C. Terrillon, M. N. Shirazi, H. Fukamachi, S. Akamatsu, “Comparative performance of different skin chrominance models and chrominance spaces for the automatic detection of human faces in color images,” in Proceedings of IEEE International Conference on Face and Gesture Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 2000), pp. 54–61. [CrossRef]
  11. J. Cai, A. Goshtasby, “Detecting human faces in color images,” Image Vis. Comput. 18, 63–75 (1999). [CrossRef]
  12. S. McKenna, Y. Raja, S. Gong, “Tracking color objects using adaptive mixture models,” Image Vis. Comput. 17, 225–231 (1999). [CrossRef]
  13. Y. Raja, S. McKenna, G. Gong, “Tracking and segmenting people in varying lighting conditions using color,” in Proceedings of the 3rd IEEE International Conference on Automatic Face and Gesture Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1998), pp. 228–233. [CrossRef]
  14. T. S. Jebara, A. Pentland, “Parametrized structure from motion for 3D adaptive feedback tracking of faces,” in Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1997), pp. 144–150. [CrossRef]
  15. T. S. Jebara, K. Russel, A. Pentland, “Mixture of eigenfeatures for real-time structure from texture,” in Proceedings of the Sixth International Conference on Computer Vision (Narosa, Bombay, 1998), pp. 128–135.
  16. M. H. Yang, N. Ahuja, Face Detection and Gesture Recognition for Human-Computer Interaction (Kluwer Academic, Dordrecht, The Netherlands, 2001). [CrossRef]
  17. M. H. Yang, D. J. Kriegman, N. Ahuja, “Detecting faces in images: a survey,” IEEE Trans. Pattern Anal. Mach. Intell. 24, 34–58 (2002). [CrossRef]
  18. R. E. Kalman, “A new approach to linear filtering and prediction problems,” Trans. ASME Ser. D. 82, 35–45 (1960). [CrossRef]
  19. D. Comaniciu, V. Ramesh, P. Meer, “Real-time tracking of non-rigid objects using mean shift,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 2000), pp. 142–151.
  20. O. Javed, M. Shah, “Tracking and object classification for automated surveillance,” in European Conference on Computer Vision (Springer-Verlag, Berlin, 2002), pp. 343–357.
  21. N. T. Siebel, S. Maybank, “Fusion of multiple tracking algorithms for robust people tracking,” in European Conference on Computer Vision (Springer-Verlag, Berlin, 2002), pp. 373–387.
  22. M. Spengler, B. Schiele, “Multi-object tracking based on a modular knowledge hierarchy,” in International Conference on Computer Vision Systems (Springer-VerlagHeidelberg, 2003), pp. 376–385. [CrossRef]
  23. C. Stauffer, W. Grimson, “Adaptive background mixture models for real-time tracking,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 1999), pp. 246–252.
  24. J. Triesch, C. von der Malsburg, “Democratic integration: self-organized integration of adaptive cues,” Neural Comput. 13, 2049–2074 (2001). [CrossRef] [PubMed]
  25. R. Fablet, M. J. Black, “Automatic detection and tracking of human motion with a view-based representation,” in European Conference on Computer Vision (Springer-Verlag, Berlin, 2002), pp. 476–491.
  26. M. Isard, A. Blake, “Icondensation: unifying low-level and high-level tracking in a stochastic framework,” in European Conference on Computer Vision (Springer-Verlag, Berlin, 1998), pp. 893–908.
  27. J. Vermaak, P. Perez, M. Gangnet, A. Blake, “Towards improved observation models for visual tracking: selective adaptation,” in European Conference on Computer Vision (Springer-Verlag, Berlin, 2002), pp. 645–660.
  28. C. Hue, J.-P. Le Cadre, P. Pérez, “Sequential Monte Carlo methods for multiple target tracking and data fusion,” IEEE Trans. Signal Process. 50, 309–325 (2002). [CrossRef]
  29. M. Isard, J. MacCormick, “Bramble: a Bayesian multiple-blob tracker,” in Proceedings of the International Conference on Computer Vision ICCV (IEEE Computer Society, Los Alamitos, Calif., 2001).
  30. E. Koller-Meier, F. Ade, “Tracking multiple objects using the condensation algorithm,” J. Robot. Auton. Syst. 34(2–3), 93–105 (2001). [CrossRef]
  31. P. Perez, C. Hue, J. Vermaak, M. Gangnet, “Color-based probabilistic tracking,” Proceedings of the European Conference on Computer Vision (Springer-Verlag, Berlin, 2002), pp. 661–675.
  32. Y. Li, A. Hilton, J. Illingworth, “A relaxation algorithm for real-time multiple view 3D-tracking,” Image Vis. Comput. 20, 841–859 (2002). [CrossRef]
  33. T. Inaguma, K. Oomura, H. Saji, H. Nakatani, “Efficient Search Technique for Hand Gesture Tracking in Three Dimensions”, in International Workshop on Biologically Motivated Computer Vision (Springer-Verlag, Berlin, 2000), pp. 594–601. [CrossRef]
  34. R. Hartley, P. Sturm, “Triangulation,” Compu. Vis. Image Underst. 68, 146–157 (1997). [CrossRef]
  35. O. Faugeras, Q.-T. Luong, T. Papadopoulo, The Geometry of Multiple Images (MIT Press, Cambridge, Mass., 2001).
  36. S. O. Orphanoudakis, A. A. Argyros, M. Vincze, “Towards a cognitive vision methodology: understanding and interpreting activities of experts,” ERCIM News, No. 53 (ERCIM EEIG, Sophia-Antipolis, France, 2003); http://www.ercim.org .
  37. K. Jack, Video Demystified: A Handbook for the Digital Engineer (HighText, Solana Beach, Calif., 1993).
  38. D. A. Forsyth, J. Ponce, Computer Vision: A Modern Approach (Prentice-Hall, Englewood Cliffs, N.J., 2003).
  39. J. F. Canny, “A computational approach to edge detection,” IEEE Trans. Pattern Anal. Mach. Intell. 8, 769–798 (1986).
  40. L. Robert, C. Zeller, O. D. Faugeras, M. Hebert, “Applications of non-metric vision to some visually guided robotic tasks,” in Visual Navigation: From Biological Systems to Unmanned Ground Vehicles, Y. Aloimonos, ed. (Erlbaum, Hillsdale, N.J., 1997), Chap. 5, pp. 89–134.
  41. Z. Zhang, “Determining the epipolar geometry and its uncertainty: a review,” Int. J. Comput. Vision 27, 161–195 (1998). [CrossRef]
  42. H. Hirschmüller, “Improvements in real-time correlation-based stereo vision,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (Institute of Electrical and Electronics Engineers, Piscataway, N.J., 2001), pp. 141–148.
  43. R. Goldman, “Intersection of two lines in three-space,” in Graphics Gems, A. S. Glassner, ed. (Academic, San Diego, Calif., 1990), Vol. 1, p. 304.
  44. M. I. A. Lourakis, A. A. Argyros, “Efficient 3D camera matchmoving using markerless, segmentation-free plane tracking,” Technical Report ICS/FORTH-TR-324 (Institute of Computer Science, Foundation for Research and Technology—Hellas, Heraklion, Greece, Sept.2003).

Cited By

Alert me when this paper is cited

OSA is able to provide readers links to articles that cite this paper by participating in CrossRef's Cited-By Linking service. CrossRef includes content from more than 3000 publishers and societies. In addition to listing OSA journal articles that cite this paper, citing articles from other participating publishers will also be listed.


« Previous Article  |  Next Article »

OSA is a member of CrossRef.

CrossCheck Deposited