OSA's Digital Library

Journal of the Optical Society of America A

Journal of the Optical Society of America A

| OPTICS, IMAGE SCIENCE, AND VISION

  • Editor: Franco Gori
  • Vol. 29, Iss. 6 — Jun. 1, 2012
  • pp: 928–935

Efficient object detection and tracking in video sequences

Fadi Dornaika and Fadi Chakik  »View Author Affiliations


JOSA A, Vol. 29, Issue 6, pp. 928-935 (2012)
http://dx.doi.org/10.1364/JOSAA.29.000928


View Full Text Article

Enhanced HTML    Acrobat PDF (863 KB)





Browse Journals / Lookup Meetings

Browse by Journal and Year


   


Lookup Conference Papers

Close Browse Journals / Lookup Meetings

Article Tools

Share
Citations

Abstract

One of the most important problems in computer vision is the computation of the two-dimensional projective transformation (homography) that maps features of planar objects in different images and videos. This computation is required by many applications such as image mosaicking, image registration, and augmented reality. The real-time performance imposes constraints on the methods used. In this paper, we address the real-time detection and tracking of planar objects in a video sequence where the object of interest is given by a reference image template. Most existing approaches for homography estimation are based on two steps: feature extraction (first step) followed by a combinatorial optimization method (second step) to match features between the reference template and the scene frame. This paper has two main contributions. First, we detect both planar and nonplanar objects via efficient object feature classification in the input images, which is applied prior to performing the matching step. Second, for the tracking part (planar objects), we propose a fast method for the computation of the homography that is based on the transferred object features and their associated local raw brightness. The advantage of the proposed schemes is a fast matching as well as fast and robust object registration that is given by either a homography or three-dimensional pose.

© 2012 Optical Society of America

OCIS Codes
(100.0100) Image processing : Image processing
(150.0150) Machine vision : Machine vision
(150.1135) Machine vision : Algorithms
(100.4994) Image processing : Pattern recognition, image transforms
(100.4999) Image processing : Pattern recognition, target tracking

ToC Category:
Machine Vision

History
Original Manuscript: December 21, 2011
Manuscript Accepted: February 13, 2012
Published: May 21, 2012

Citation
Fadi Dornaika and Fadi Chakik, "Efficient object detection and tracking in video sequences," J. Opt. Soc. Am. A 29, 928-935 (2012)
http://www.opticsinfobase.org/josaa/abstract.cfm?URI=josaa-29-6-928


Sort:  Author  |  Year  |  Journal  |  Reset  

References

  1. S. Prince, K. Xu, and A. Cheok, “Augmented reality camera tracking with homographies,” IEEE Comput. Graph. Appl. 22, 39–45 (2002). [CrossRef]
  2. M. Ozuysal, P. Fua, and V. Lepetit, “Fast keypoint recognition in ten lines of code,” in IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2007), pp. 1–8.
  3. P. Viola and M. Jones, “Robust real-time object detection,” Int. J. Comput. Vis. 57, 137–154 (2004). [CrossRef]
  4. P. David, D. Dementhon, R. Duraiswami, and H. Samet, “Softposit: simultaneous pose and correspondence determination,” Int. J. Comput. Vis. 59, 259–284 (2004). [CrossRef]
  5. B. Boufama and D. Oconnell, “Identification and matching of planes in a pair of uncalibrated images,” Int. J. Pattern Recogn. Artif. Intell. 17, 1127–1143 (2003). [CrossRef]
  6. E. Tola, P. Fua, and V. Lepetit, “A fast local descriptor for dense matching,” in IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2008), pp. 1–8.
  7. T. Zhang and C. Tomasi, “Fast, robust, and consistent camera motion estimation,” in IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 1999), pp. 164–170.
  8. B. Zitova and J. Flusser, “Image registration methods: a survey,” Image Vis. Comput. 21, 977–1000 (2003). [CrossRef]
  9. J. R. Cozar, N. Guil, and E. Zapata, “Planar object detection under scaled orthographic projection,” Pattern Recogn. Lett. 23, 719–729 (2002). [CrossRef]
  10. A. Yilmaz, O. Javed, and M. Shah, “Object tracking: a survey,” ACM Comput. Surv. 38, 1–45 (2006). [CrossRef]
  11. P. Dollar, B. Babenko, S. Belongie, P. Perona, and Z. Tu, “Multiple component learning for object detection,” in European Conference on Computer Vision (Springer-Verlag, 2008), pp. 211–224.
  12. A. Heidari and P. Aarabi, “Real-time object tracking on iPhone,” in Proceedings of the International Symposium on Visual Computing, Vol. 6938 of Lecture Notes in Computer Science (Springer-Verlag, 2011), pp. 768–777.
  13. M. Brown and D. G. Lowe, “Automatic panoramic image stitching using invariant features,” Int. J. Comput. Vis. 74, 59–73 (2007). [CrossRef]
  14. Y. Matsushita, E. Ofek, W. Ge, X. Tang, and H. Shum, “Full frame video stabilization with motion inpainting,” IEEE Trans. Pattern Anal. Mach. Intell. 28, 1150–1163 (2006). [CrossRef]
  15. R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision (Cambridge University, 2000).
  16. P. Perez and N. Garcia, “Robust and accurate registration of images with unknown relative orientation and exposure,” in IEEE International Conference on Image Processing (IEEE, 2005), Vol. III, pp. 1104–1107.
  17. F. Dornaika and J. Elder, “Image registration for foveated omnidirectional sensing,” in Proceedings of the European Conference on Computer Vision, Vol. 2353 of Lecture Notes in Computer Science (Springer-Verlag, 2002), pp. 606–620.
  18. B. Pires and P. Aguiar, “Featureless global alignment of multiple images,” in IEEE International Conference on Image Processing (IEEE, 2005), Vol. I, pp. 57–60.
  19. F. Dornaika, “Registering conventional images with low resolution panoramic images,” in Proceedings of the International Conference on Computer Vision Systems (Applied Computer Science Group, 2007), pp. 1–10.
  20. R. Guerreiro and P. Aguiar, “Global motion estimation: feature-based, featureless, or both?!” in International Conference on Image Analysis and Recognition (Springer-Verlag, 2006), pp. 721–730.
  21. J. M. Buenaposada and L. Baumela, “Real-time tracking and estimation of plane pose,” in IEEE International Conference on Pattern Recognition (IEEE, 2002), pp. 697–700.
  22. Y. Altunbasak, R. Merserau, and A. Patti, “A fast parametric motion estimation algorithm with illumination and lens distortion correction,” IEEE Trans. Image Process. 12, 395–408 (2003). [CrossRef]
  23. J. Kannala, M. Salo, and J. Heikkila, “Algorithms for computing a planar homography from conics in correspondence,” in British Machine Vision Conference (BMVA, 2006), pp. 77–86.
  24. O. Faugeras and Q. Luong, The Geometry of Multiple Images (MIT, 2001).
  25. D. Lowe, “Distinctive image features from scale invariant keypoints,” Int. J. Comput. Vis. 60, 91–110 (2004). [CrossRef]
  26. H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool, “SURF: Speeded Up Robust Features,” Comput. Vis. Image Underst. 110, 346–359 (2008). [CrossRef]
  27. M. Agrawal, K. Konolige, and M. Blas, “CenSurE: center surround extremas for realtime feature detection and matching,” in European Conference on Computer Vision (Springer-Verlag, 2008), pp. 102–115.
  28. C. Cortes and V. Vapnick, “Support-vector networks,” Mach. Learn. 20, 273–297 (1995). [CrossRef]
  29. C. M. Bishop, Pattern Recognition and Machine Learning (Springer, 2006).
  30. B. Scholkopf, J. Platt, J. Shawe-Taylor, A. Smola, and R. Williamson, “Estimating the support of a high-dimensional distribution,” Tech. Rep. (Microsoft Research, 2000).
  31. M. de Berg, O. Cheong, M. Kreveld, and M. Overmars, Computational Geometry: Algorithms and Applications (Springer, 2008).
  32. M. A. Fischler and R. C. Bolles, “Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,” Commun. ACM 24, 381–395 (1981). [CrossRef]
  33. W. H. Press, S. A. Teukolsky, W. T. Wetterling, and B. P. Flannery, Numerical Recipes, The Art of Scientific Computing (Cambridge University, 1992).

Cited By

Alert me when this paper is cited

OSA is able to provide readers links to articles that cite this paper by participating in CrossRef's Cited-By Linking service. CrossRef includes content from more than 3000 publishers and societies. In addition to listing OSA journal articles that cite this paper, citing articles from other participating publishers will also be listed.


« Previous Article  |  Next Article »

OSA is a member of CrossRef.

CrossCheck Deposited