OSA's Digital Library

Applied Optics

Applied Optics

APPLICATIONS-CENTERED RESEARCH IN OPTICS

  • Editor: Joseph N. Mait
  • Vol. 49, Iss. 10 — Apr. 1, 2010
  • pp: B1–B8

Adaptive hierarchical architecture for visual recognition

Fok H. C. Tivive, Abdesselam Bouzerdoum, Son Lam Phung, and Khan M. Iftekharuddin  »View Author Affiliations


Applied Optics, Vol. 49, Issue 10, pp. B1-B8 (2010)
http://dx.doi.org/10.1364/AO.49.0000B1


View Full Text Article

Enhanced HTML    Acrobat PDF (679 KB)





Browse Journals / Lookup Meetings

Browse by Journal and Year


   


Lookup Conference Papers

Close Browse Journals / Lookup Meetings

Article Tools

Share
Citations

Abstract

We propose a new hierarchical architecture for visual pattern classification. The new architecture consists of a set of fixed, directional filters and a set of adaptive filters arranged in a cascade structure. The fixed filters are used to extract primitive features such as orientations and edges that are present in a wide range of objects, whereas the adaptive filters can be trained to find complex features that are specific to a given object. Both types of filter are based on the biological mechanism of shunting inhibition. The proposed architecture is applied to two problems: pedestrian detection and car detection. Evaluation results on benchmark data sets demonstrate that the proposed architecture outperforms several existing ones.

© 2010 Optical Society of America

OCIS Codes
(100.2000) Image processing : Digital image processing
(100.5010) Image processing : Pattern recognition
(100.3008) Image processing : Image recognition, algorithms and filters

History
Original Manuscript: October 2, 2009
Revised Manuscript: December 18, 2009
Manuscript Accepted: December 26, 2009
Published: January 28, 2010

Virtual Issues
Vol. 5, Iss. 8 Virtual Journal for Biomedical Optics

Citation
Fok H. C. Tivive, Abdesselam Bouzerdoum, Son Lam Phung, and Khan M. Iftekharuddin, "Adaptive hierarchical architecture for visual recognition," Appl. Opt. 49, B1-B8 (2010)
http://www.opticsinfobase.org/ao/abstract.cfm?URI=ao-49-10-B1


Sort:  Author  |  Year  |  Journal  |  Reset  

References

  1. K. Fukushima, “Neocognitron: a hierarchical neural network capable of visual pattern recognition,” Neural Networks 1, 119-130 (1988). [CrossRef]
  2. D. H. Hubel and T. N. Wiesel, “Receptive fields, binocular interaction and functional architecture in the cat's visual cortex,” J. Physiol. (London) 160, 106-154 (1962).
  3. Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel, “Backpropagation applied to handwritten zip code recognition,” Neural Comput. 1, 541-551 (1989). [CrossRef]
  4. S. L. Phung and A. Bouzerdoum, “A pyramidal neural network for visual pattern recognition,” IEEE Trans. Neural Netw. 18, 329-343 (2007). [CrossRef]
  5. M. Riesenhuber and T. Poggio, “Hierarchical models of object recognition in cortex,” Nat. Neurosci. 2, 1019-1025 (1999). [CrossRef]
  6. G. Kreiman, C. Koch, and I. Fried, “Category-specific visual responses of single neurons in the human medial temporal lobe,” Nat. Neurosci. 3, 946-953 (2000). [CrossRef]
  7. E. B. Goldstein, Sensation and Perception (Wadsworth, 2007).
  8. L. J. Borg-Graham, C. Monier, and Y. Fregnac, “Visual input evokes transient and strong shunting inhibition in visual cortical neurons,” Nature 393, 369-373 (1998).
  9. T. Hammadou and A. Bouzerdoum, “Novel image enhancement technique using shunting inhibitory cellular neural networks,” IEEE Trans. Consumer Electron. 47, 934-940 (2001). [CrossRef]
  10. M. Riedmiller, “Advanced supervised learning in multilayer perceptrons--from backpropagation to adaptive learning algorithms,” Comput. Standards Interfaces 16, 265-275 (1994).
  11. S. E. Fahlman, “An empirical study of learning speed in backpropagation networks,” Computer Science Technical Report CMU-CS-88-162 (Carnegie Mellon University, 1988).
  12. R. Collins, A. Lipton, H. Fujiyoshi, and T. Kanade, “Algorithms for cooperative multisensor surveillance,” Proc. IEEE 89, 1456-1477 (2001). [CrossRef]
  13. C. Papageorgiou and T. Poggio, “Trainable pedestrian detection,” in Proceedings of the International Conference on Image Processing (IEEE, 1999), Vol. 4, pp. 35-39.
  14. I. Haritaoglu and M. Flickner, “Attentive billboards,” in Proceedings of the International Conference on Image Analysis and Processing (Italian Group of Researchers in Pattern Recognition, 2001), pp. 162-167.
  15. S. Munder and D. M. Gavrila, “An experimental study on pedestrian classification,” IEEE Trans. Pattern Anal. Mach. Intell. 28, 1863-1868 (2006). [CrossRef]
  16. F. H. C. Tivive and A. Bouzerdoum, “A shunting inhibitory convolutional neural network for gender classification,” in Proceedings of the International Conference on Pattern Recognition (International Association of Pattern Recognition, 2006), pp. 421-424.
  17. C. Wohler and J. K. Anlauf, “An adaptable time-delay neural-network algorithm for image sequence analysis,” IEEE Trans. Neural Netw. 10, 1531-1536 (1999). [CrossRef]
  18. S. Agarwal, A. Awan, and D. Roth, “Learning to detect objects in images via a sparse, part-based representation,” IEEE Trans. Pattern Anal. Mach. Intell. 26, 1475-1490 (2004).
  19. J. Fang and G. Qiu, “Car/non-car classification in an informative sample subspace,” in Proceedings of the International Conference on Pattern Recognition (International Association of Pattern Recognition, 2006), vol. 2, pp. 962-965.
  20. Z. Zhu, Y. Zhao, and H. Lu, “Sequential architecture for efficient car detection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2007), pp. 1-8.
  21. K-K. Sung and T. Poggio, “Example-based learning for view-based human face detection,” IEEE Trans. Pattern Anal. Mach. Intell. 20, 39-51 (1998).
  22. C. Garcia and M. Delakis, “Convolutional face finder: a neural architecture for fast and robust face detection,” IEEE Trans. Pattern Anal. Mach. Intell. 26, 1408-1423(2004). [CrossRef]
  23. J. Mutch and D. G. Lowe, “Multiclass object recognition with sparse, localized features,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2006), Vol. 1, pp. 11-18.
  24. M. Fritz, B. Leibe, B. Caputo, and B. Schiele, “Integrating representative and discriminative models for object category detection,” in Proceedings of the IEEE International Conference on Computer Vision (IEEE, 2005), Vol. 2, pp. 1363-1370.

Cited By

Alert me when this paper is cited

OSA is able to provide readers links to articles that cite this paper by participating in CrossRef's Cited-By Linking service. CrossRef includes content from more than 3000 publishers and societies. In addition to listing OSA journal articles that cite this paper, citing articles from other participating publishers will also be listed.

Figures

Fig. 1 Fig. 2 Fig. 3
 
Fig. 4 Fig. 5
 

« Previous Article  |  Next Article »

OSA is a member of CrossRef.

CrossCheck Deposited