OSA's Digital Library

Journal of the Optical Society of America A

Journal of the Optical Society of America A


  • Vol. 7, Iss. 6 — Jun. 1, 1990
  • pp: 1113–1123

Human discrimination of fractal images

David C. Knill, David Field, and Daniel Kerstent  »View Author Affiliations

JOSA A, Vol. 7, Issue 6, pp. 1113-1123 (1990)

View Full Text Article

Enhanced HTML    Acrobat PDF (1930 KB)

Browse Journals / Lookup Meetings

Browse by Journal and Year


Lookup Conference Papers

Close Browse Journals / Lookup Meetings

Article Tools



In order to transmit information in images efficiently, the visual system should be tuned to the statistical structure of the ensemble of images that it sees. Several authors have suggested that the ensemble of natural images exhibits fractal behavior and, therefore, has a power spectrum that drops off proportionally to 1/fβ(2 < β < 4). In this paper we investigate the question of which value of the exponent β describes the power spectrum of the ensemble of images to which the visual system is optimally tuned. An experiment in which subjects were asked to discriminate randomly generated noise textures based on their spectral drop-off was used. Whereas the discrimination-threshold function of an ideal observer was flat for different spectral drop-offs, human observers showed a broad peak in sensitivity for 2.8 < β < 3.6. The results are consistent with, but do not provide direct evidence for, the theory that the visual system is tuned to an ensemble of images with Markov statistics.

© 1990 Optical Society of America

Original Manuscript: November 7, 1988
Manuscript Accepted: January 20, 1990
Published: June 1, 1990

David C. Knill, David Field, and Daniel Kerstent, "Human discrimination of fractal images," J. Opt. Soc. Am. A 7, 1113-1123 (1990)

Sort:  Author  |  Year  |  Journal  |  Reset  


  1. F. Attneave, “Informational aspects of visual perception,” Psychol. Rev. 61, 183–193 (1954). [CrossRef] [PubMed]
  2. H. B. Barlow, “Sensory mechanisms, the reduction of redundancy and intelligence,” NPL Symposium on the Mechanization of Thought Processes, No. 10 (H. M. Stationary Office, London, 1959), pp. 535–539.
  3. H. B. Barlow, “The coding of sensory messages,” in Current Problems in Animal Behavior, W. H. Thorpe, O. L. Zangwill, eds. (Cambridge U. Press, Cambridge, 1961), pp. 331–360.
  4. H. B. Barlow, “The Ferrier lecture: critical limiting factors in the design of the eye and visual cortex,” Proc. R. Soc. London Ser. B 212, 1–34 (1981). [CrossRef]
  5. M. V. Srinivasan, S. B. Laughlin, A. Dubs, “Predictive coding: a fresh view of inhibition in the retina,” Proc. R. Soc. London Ser. B 216, 427–459 (1982). [CrossRef]
  6. T. Bossomaier, A. W. Synder, “Why spatial frequency processing in the visual cortex?” Vision Res. 26, 1307–1309 (1986). [CrossRef] [PubMed]
  7. D. J. Field, “Relations between the statistics of natural images and the response properties of cortical cells,” J. Opt. Soc. Am. A 4, 2379–2394 (1987). [CrossRef] [PubMed]
  8. R. Linsker, “Self-organization in a perceptual network,” IEEE Trans. Comput. 21, 105–117 (1988).
  9. D. M. Kammen, A. L. Yuille, “Spontaneous symmetry-breaking energy functions and the emergence of orientation selective cortical cells,” Biol. Cybern. 59, 23–31 (1988). [CrossRef] [PubMed]
  10. H. B. Barlow, P. Z. Foldiak, “Adaptation and decorrelation in the cortex,” in The Computing Neuron, R. C. Miall, R. M. Durbin, G. J. Mitchison, eds. (Addison-Wesley, Reading, Mass., 1989).
  11. C. E. Shannon, W. Weaver, The Mathematical Theory of Communication (U. Illinois Press, Champaign, Ill, 1949).
  12. Stationarity implies that the statistics of a random field are invariant over translations of the coordinate system on which it is defined. Isotropy implies that they are also invariant over rotations of the coordinate system. One result of these two assumptions is that the autocorrelation function can be expressed as a function of Euclidean distance. The assumption of stationarity is intuitively attractive, as it can result from viewing scenes from a range of positions, so that shifted versions of any given image are equally likely. A similar argument, however, cannot be made for the isotropy assumption, as we generally view scenes with our heads perpendicular to the ground. In a study related to the question of isotropy, Switkes et al.13 found more power at horizontal and vertical orientations in images of both natural and synthetic scenes. The assumption does, however, simplify our investigation by allowing us to look at the correlational structure of images as a function only of distance between points.
  13. E. Switkes, M. J. Mayer, J. A. Sloan, “Spatial frequency analysis of the visual environment: anisotropy and the carpentered environment hypothesis,” Vision Res. 18, 1393–1399 (1978). [CrossRef] [PubMed]
  14. The autocorrelation function of a random field I is given byRI((xi,yi),(xj,yj))=E[I(xi,yi)I(xj,yj)],where E[·] is the expectation operator. Since we are assuming that I is stationary and isotropic, we can write this as a function of the Euclidean distance between points:RI(Δr)=E[I(x,y)I(x+Δrcosθ,y+Δrsinθ)],where r= (Δx2+ Δy2)1/2and θ is the angle between the points. The second-order statistical structure of a stationary ensemble of images, given by the autocorrelation function in space, is given by the power spectrum in the frequency domain. The power spectrum is the Fourier transform of the autocorrelation functionPI(fx,fy)=∫0∞∫0∞RI(x,y)exp[i2π(fx+fy)]dxdy.The power at a given frequency is twice the variance of the corresponding Fourier coefficients (real and imaginary) of images in an ensemble. The real and imaginary parts of the Fourier coefficient at a given frequency are uncorrelated and have equal variance. The power at fx = fy = 0 is the squared mean of the ensemble. For an isotropic ensemble, the power spectrum may be written as a function of radial spatial frequency, PI(fr), where fr= (fx2 + fy2)1/2.
  15. B. Julesz, “Spatial frequency channels in one-, two- and three-dimensional vision: variations on an auditory theme by Bekesy,” in Vision Coding and Adaptability, C. S. Harris, ed. (Erlbaum, Hillside, N.J., 1980).
  16. B. B. Mandelbrot, Fractals: Form, Chance, and Dimension (Freeman, San Francisco, Calif., 1977).
  17. B. B. Mandelbrot, The Fractal Geometry of Nature (Freeman, San Francisco, Calif., 1982).
  18. A. P. Pentland, “Fractal-based description of natural scenes,” IEEE Trans. Pattern Anal. Mech. Intell. PAMI-6, 661–673 (1984). [CrossRef]
  19. The power spectrum of an ensemble with an exponential auto-correlation function clearly shows the effect of the scale constant k. For an isotropic ensemble, the spectrum is given byPI(fr)∝1(k+fr3/2)2.At frequencies much lower than k, the spectrum approximates white noise; that is, points in the image separated by a distance much greater than 1/k are effectively uncorrelated. For frequencies much greater than k, the spectrum falls off according to the power law 1/fr3. The images of this ensemble exhibit qualitatively different statistical behavior at different scales.
  20. B. Julesz, “Visual pattern discrimination,” IRE Trans. Inf. Theory IT-8, 84–92 (1962). [CrossRef]
  21. B. Julesz, E. Gilbert, L. Shepp, H. Frisch, “Inability of humans to discriminate between visual textures that agree in second-order statistics revisited,” Perception 2, 391–405 (1973). [CrossRef]
  22. B. Julesz, J. Bergen, “Textons, the fundamental elements in preattentive vision and perception of texture,” Bell Syst. Tech. J. 62, 619–1645 (1983).
  23. W. K. Pratt, O. D. Faugeras, A. Gagalowicz, “Visual discrimination of stochastic texture fields,” IEEE Trans. Syst. Man Cybern. SMC-8, 796–804 (1978). [CrossRef]
  24. R. A. Rensink, On the Visual Discrimination of Self-Similar Random Textures, Department of Computer Science Tech. Rep. 86-16 (University of British Columbia, Vancouver, B.C., Canada, 1986).
  25. Rensink generated line textures by using one-dimensional power spectra. Peak performance was found to be at a spectral drop-off of β1D= 3 for these textures. The equivalent two-dimensional spectral drop-off is given by β2D= β1D + 1 = 4.26
  26. R. F. Voss, “Random fractal forgeries,” in Fundamental Algorithms for Computer Science, R. A. Earnshaw, ed. (Springer-Verlag, Berlin, 1985), pp. 805–829. [CrossRef]
  27. H. B. Barlow, “The efficiency of detecting changes of density in random dot patterns,” Vision Res. 18, 637–650 (1978). [CrossRef] [PubMed]
  28. H. B. Barlow, “Measurements of the quantum efficiency of discrimination in human scotopic vision,” J. Physiol. 160, 169–188 (1962). [PubMed]
  29. A. B. Watson, H. B. Barlow, J. G. Robson, “What does the eye see best?” Nature (London) 302, 419–422 (1983). [CrossRef]
  30. A. E. Burgess, R. F. Wagner, R. J. Jennings, H. B. Barlow, “Efficiency of human visual signal discrimination,” Science 214, 93–94 (1981). [CrossRef] [PubMed]
  31. D. Kersten, “Spatial summation in visual noise,” Vision Res. 24, 1977–1990 (1984). [CrossRef] [PubMed]
  32. We corrected for the nonlinearity by raising the entries of the lookup table to an exponent of 0.375 (1/2.67) and rescaling them to give a range of 256 gray levels. The equation used for correcting the lookup-table entries wasl[i]=i0.375*250.00.625,0≤i<256,where l[i] is the ith entry in the lookup table.
  33. The resulting image statistics had toroidal symmetry, reflecting the symmetry of the fast Fourier-transform algorithm.
  34. A. B. Watson, D. G. Pelli, “quest: a Bayesian adaptive psychometric method,” Percept. Psychophys. 33, 113–120 (1983). [CrossRef] [PubMed]
  35. W. A. Weibull, “A statistical distribution function of wide applicability,” J. Appl. Mech. 18, 292–297 (1951).
  36. D. Kersten, “Statistical efficiency for the detection of visual noise,” Vision Res. 24, 1977–1990 (1984);Vision Res. 27, 1029–1040 (1987). [CrossRef]
  37. H. J. Larson, B. O. Shubert, Probabilistic Models in Engineering Sciences, Vol. 1: Random Variables and Stochastic Processes (Wiley, New York, 1979).
  38. D. Kersten, “Predictability and redundancy of natural images,” J. Opt. Soc. Am. A 4, 2395–2400 (1987). [CrossRef] [PubMed]
  39. H. O. Peitgen, P. H. Richter, The Beauty of Fractals (Springers-Verlag, Berlin, 1986). [CrossRef]

Cited By

Alert me when this paper is cited

OSA is able to provide readers links to articles that cite this paper by participating in CrossRef's Cited-By Linking service. CrossRef includes content from more than 3000 publishers and societies. In addition to listing OSA journal articles that cite this paper, citing articles from other participating publishers will also be listed.


Fig. 1 Fig. 2 Fig. 3
Fig. 4

« Previous Article  |  Next Article »

OSA is a member of CrossRef.

CrossCheck Deposited