OSA's Digital Library

Journal of the Optical Society of America A

Journal of the Optical Society of America A


  • Vol. 20, Iss. 7 — Jul. 1, 2003
  • pp: 1271–1282

Independent spectral representations of images for recognition

Xiuwen Liu and Lei Cheng  »View Author Affiliations

JOSA A, Vol. 20, Issue 7, pp. 1271-1282 (2003)

View Full Text Article

Enhanced HTML    Acrobat PDF (1853 KB)

Browse Journals / Lookup Meetings

Browse by Journal and Year


Lookup Conference Papers

Close Browse Journals / Lookup Meetings

Article Tools



In recent years, studies have shown that independent components of local windows of natural images resemble the receptive fields of cells in the early stages of the mammalian visual pathway. However, the role of the independence in visual recognition is not well understood. We argue that the independence resolves the curse of dimensionality by reducing the complexity of probability models to the linear order of the dimension. In addition, we show empirically that the complexity reduction does not degrade the recognition performance on all the data sets that we have used with an independent spectral representation. In this representation, an input image is first decomposed into independent channels given by the estimated independent components from training images, and each channel’s response is then summarized by using its histogram as an estimate of the underlying probability model along that dimension. We demonstrate the sufficiency of the proposed representation for image characterization by synthesizing textures and objects through sampling and for recognition by applying it to large data sets. Our comparisons show that the independent spectral representation often gives improved recognition performance.

© 2003 Optical Society of America

OCIS Codes
(100.2960) Image processing : Image analysis
(100.5010) Image processing : Pattern recognition
(330.4060) Vision, color, and visual optics : Vision modeling
(330.6110) Vision, color, and visual optics : Spatial filtering

Original Manuscript: September 30, 2002
Revised Manuscript: January 10, 2003
Manuscript Accepted: January 10, 2003
Published: July 1, 2003

Xiuwen Liu and Lei Cheng, "Independent spectral representations of images for recognition," J. Opt. Soc. Am. A 20, 1271-1282 (2003)

Sort:  Author  |  Year  |  Journal  |  Reset  


  1. D. H. Ballard, An Introduction to Natural Computation (MIT Press, Cambridge, Mass., 1997).
  2. D. Marr, Vision (Freeman, New York, 1982).
  3. D. C. Knill, W. Richards, eds., Perception As Bayesian Inference (Cambridge U. Press, Cambridge, UK, 1996).
  4. H. von Helmholtz, Treatise on Physiological Optics (Dover, New York, 1867).
  5. R. Bellman, Adaptive Control Processes: A Guided Tour (Princeton U. Press, Princeton, N.J., 1961).
  6. H. Hotelling, “Analysis of a complex of statistical variables in principal components,” J. Educ. Psychol. 24, 417–441, 498–520 (1933). [CrossRef]
  7. M. M. Loève, Probability Theory (Van Nostrand, Princeton, N. J., 1955).
  8. L. Sirovich, M. Kirby, “Low-dimensional procedure for the characterization of human faces,” J. Opt. Soc. Am. A 4, 519–524 (1987). [CrossRef] [PubMed]
  9. P. Comon, “Independent component analysis: a new concept?” Signal Process. 36, 287–314 (1994). [CrossRef]
  10. P. J. Huber, “Projection pursuit,” Ann. Statistics 13, 435–475 (1985). [CrossRef]
  11. H. B. Barlow, “Unsupervised learning,” Neural Comput. 1, 295–311 (1989). [CrossRef]
  12. D. J. Field, “Relations between the statistics of natural images and the response properties of cortical cells,” J. Opt. Soc. Am. A 4, 2379–2394 (1987). [CrossRef] [PubMed]
  13. D. J. Field, “What is the goal of sensory coding?” Neural Comput. 6, 559–601 (1994). [CrossRef]
  14. B. A. Olshausen, D. J. Field, “Emergence of simple-cell receptive field properties by learning a sparse code for natural images,” Nature 381, 607–609 (1996). [CrossRef] [PubMed]
  15. A. J. Bell, T. J. Sejnowski, “The ‘independent components’ of natural scenes are edge filters,” Vision Res. 37, 3327–3338 (1997). [CrossRef]
  16. E. P. Simoncelli, B. A. Olshausen, “Natural image statistics and neural representation,” Annu. Rev. Neurosci. 24, 1193–1216 (2001). [CrossRef] [PubMed]
  17. A. Srivastava, A. Lee, E. P. Simoncelli, S. C. Zhu, “On advances in statistical modeling of natural images,” J. Math. Imaging Vision 18, 17–33 (2003). [CrossRef]
  18. N. Vasconcelos, G. Carneiro, “What is the role of independence for visual recognition?” in Proceedings of the 7th European Conference on Computer Vision (Springer, Berlin, 2002), Vol. 1, pp. 297–311.
  19. F. W. Campbell, J. G. Robson, “Application of Fourier analysis to the visibility of gratings,” J. Physiol. (London) 197, 551–566 (1968).
  20. R. L. De Valois, K. K. De Valois, Spatial Vision (Oxford U. Press, New York, 1988).
  21. A. Hyvärinen, “Survey on independent component analysis,” Neural Comput. Surv. 2, 194–128 (1999).
  22. A. Hyvärinen, “Fast and robust fixed-point algorithm for independent component analysis,” IEEE Trans. Neural Netw. 10, 626–634 (1999). [CrossRef]
  23. R. O. Duda, P. E. Hart, D. G. Stork, Pattern Classification, 2nd ed. (Wiley, New York, 2000).
  24. X. Liu, D. L. Wang, “A spectral histogram model for texton modeling and texture discrimination,” Vision Res. 42, 2617–2634 (2002). [CrossRef] [PubMed]
  25. D. A. Socolinsky, A. Selinger, “A comparative analysis of face recognition performance with visible and thermal infrared imagery,” in Proceedings of the International Conference on Pattern Recognition (IEEE Computer Society Press, Los Alamitos, Calif., 2002), Vol. 4, pp. 217–222.
  26. J. R. Bergen, E. H. Adelson, “Early vision and texture perception,” Nature 333, 363–367 (1988). [CrossRef] [PubMed]
  27. C. Chubb, J. Econopouly, M. S. Landy, “Histogram contrast analysis and the visual segregation of IID textures,” J. Opt. Soc. Am. A 11, 2350–2374 (1994). [CrossRef]
  28. D. J. Heeger, J. R. Bergen, “Pyramid-based texture analysis/synthesis,” in Proceedings of SIGGRAPHS (Addison-Wesley, Boston, Mass., 1995), pp. 229–238.
  29. S. C. Zhu, Y. N. Wu, D. Mumford, “Minimax entropy principle and its application to texture modeling,” Neural Comput. 9, 1627–1660 (1997). [CrossRef]
  30. S. C. Zhu, X. Liu, Y. N. Wu, “Exploring texture ensembles by efficient Markov chain Monte Carlo,” IEEE Trans. Pattern Anal. Mach. Intell. 22, 554–569 (2000). [CrossRef]
  31. X. Liu, A. Srivastava, “3D object recognition using perceptual components,” in Proceedings of the International Joint Conference on Neural Networks (IEEE Computer Society Press, Los Alamitos, Calif., 2001), Vol. 1, pp. 553–558.
  32. X. Liu, D. L. Wang, “Appearance-based recognition using perceptual components,” in Proceedings of the International Joint Conference on Neural Networks (IEEE Computer Society Press, Los Alamitos, Calif., 2001), Vol. 3, pp. 1943–1948.
  33. X. Liu, D. L. Wang, A. Srivastava, “Image segmentation using local spectral histograms,” in Proceedings of the International Conference on Image Processing (IEEE Press, Piscataway, N.J., 2001), Vol. 1, pp. 70–73.
  34. J. Hertz, A. Krogh, R. G. Palmer, Introduction to the Theory of Neural Computation (Addison-Wesley, Reading, Mass., 1991).
  35. T. Randen, J. H. Husoy, “Filtering for texture classification: a comparative study,” IEEE Trans. Pattern Recog. Mach. Intell. 21, 291–310 (1999). [CrossRef]
  36. Images in the ORL data set are available at http://www.uk.research.att.com/facedatabase.html .
  37. J. Zhang, Y. Yan, M. Lades, “Face recognition: eigenface, elastic matching, and neural nets,” Proc. IEEE 85, 1423–1435 (1997). [CrossRef]
  38. A. Srivastava, X. Liu, “Statistical hypothesis pruning for identifying faces from infrared images,” Image Vision Comput. (to be published).
  39. S. T. Roweis, L. K. Saul, “Nonlinear dimensionality reduction by locally linear embedding,” Science 290, 2323–2326 (2000). [CrossRef] [PubMed]
  40. A. Hyvärinen, P. O. Hoyer, “Emergence of phase and shift invariant features by decomposition of natural images into independent feature subspaces,” Neural Comput. 12, 1705–1720 (2000). [CrossRef]
  41. A. Hyvärinen, P. O. Hoyer, “A two-layer sparse coding model learns simple and complex cell receptive fields and topology from natural images,” Vision Res. 41, 2413–2423 (2001). [CrossRef]
  42. M. J. Wainwright, E. Simoncelli, A. S. Willsky, “Random cascades on wavelet trees and their use in analyzing and modeling natural images,” Appl. Comput. Harmonic Anal. 11, 89–123 (2001). [CrossRef]
  43. P. Domingos, M. Pazzani, “On the optimality of the simple Bayesian classifier under zero-one loss,” Machine Learning 29, 103–130 (1997). [CrossRef]

Cited By

Alert me when this paper is cited

OSA is able to provide readers links to articles that cite this paper by participating in CrossRef's Cited-By Linking service. CrossRef includes content from more than 3000 publishers and societies. In addition to listing OSA journal articles that cite this paper, citing articles from other participating publishers will also be listed.

« Previous Article  |  Next Article »

OSA is a member of CrossRef.

CrossCheck Deposited