1. Introduction
Back in 1873, Ernst Abbe formulated the theory of imaging, stating that the maximal resolution recoverable in a perfect optical imaging system is determined by the numerical aperture of the lenses involved [
1
E. Hecht, Optics (Addison-Wesley, 1998).
]. Some decades later, it became clear that the true limit on imaging arises from the optical wavelength
λ and the best recoverable resolution is
λ/2. This is because the propagation of EM waves in bulk media behaves as a low-pass filter, for distances much larger than the wavelength, rendering spatial frequencies larger than 1/
λ evanescent [
2
M. Saleh and B. Teich, Fundamentals of Photonics (Wiley, New York, 1991).
]. Therefore, such spatial frequencies decay rapidly, on a distance scale of several wavelengths. Hence, the observation of sub-wavelength features is essentially impossible using conventional imaging methods. Throughout the years, there have been many attempts to bypass the
λ/2 limit on imaging. One approach is the Near-field Scanning Optical Microscope (NSOM): a very narrow tip, which samples the electromagnetic field at a single point at very close proximity (“near field”) to the sub-wavelength specimen, is scanned across the object. The NSOM has nowadays become a frequently used commercial product, but it does have some major disadvantages: (I) the tip must be placed in the near-field, hence it cannot be used, for example, to look into living cells, and (II) acquiring an image requires scanning the specimen point-by-point [
3
E. A. Ash and G. Nicholls, “Super-resolution aperture scanning microscope,” Nature
237, 510–512 (1972). [CrossRef] [PubMed]
–
5
E. Betzig, J. K. Trautman, T. D. Harris, J. S. Weiner, and R. L. Kostelak, “Breaking the diffraction barrier: optical microscopy on a nanometric scale,” Science
251, 1468–1470 (1991). [CrossRef] [PubMed]
], which implies that real-time imaging is impossible. These are severe limitations particularly when studying objects that vary in time (e.g., living objects like bacteria). In the last few years, several genuine ideas have been proposed to allow for more effective sub-wavelength imaging. One approach is based on probing the information with sub-wavelength holes made from thin film of plasmonic metals [
6
T. W. Ebbesen, H. G. Lezec, H. F. Ghaemi, T. Thio, and P. A. Wolf, “Extraordinary optical transmission through subwavelength hole arrays,” Nature
391, 667–669 (1998). [CrossRef]
]. A more recent idea relies on constructing super-oscillatory wavepackets to sample at sub-wavelength resolution [
7
F. M. Huang and N. I. Zheludev, “Super-resolution without evanescent waves,” Nano Lett.
9, 1249–1254 (2009). [CrossRef] [PubMed]
]. However, both of these methods still require scanning, either in the near-field [
6
T. W. Ebbesen, H. G. Lezec, H. F. Ghaemi, T. Thio, and P. A. Wolf, “Extraordinary optical transmission through subwavelength hole arrays,” Nature
391, 667–669 (1998). [CrossRef]
] or in the plane where the super-oscillations are generated [
7
F. M. Huang and N. I. Zheludev, “Super-resolution without evanescent waves,” Nano Lett.
9, 1249–1254 (2009). [CrossRef] [PubMed]
]. Another intriguing avenue is constructing an imaging system made of negative-index materials. The early version of such a system is the “superlens”, where the sub-wavelength object is imaged 1:1 to another plane [
8
J. B. Pendry, “Negative refraction makes a perfect lens,” Phys. Rev. Lett.
85, 3966–3969 (2000). [CrossRef] [PubMed]
,
9
N. Fang, H. Lee, C. Sun, and X. Zhang, “Sub-diffraction-limited optical imaging with a silver superlens,” Science
308, 534–537 (2005). [CrossRef] [PubMed]
]. Hence, this superlens cannot yield any magnification of the object features. The more advanced version is the “hyperlens”, where the sub-wavelength information is magnified such that its smallest feature is larger than
λ/2, thereby transforming all its evanescent waves into propagating waves - which can subsequently be imaged with an ordinary microscope [
10
Z. Jacob, L. V. Alexeyev, and E. Narimanov, “Optical hyperlens: far-field imaging beyond the diffraction limit,” Opt. Express
14, 8247–8256 (2006). [CrossRef] [PubMed]
–
13
I. I. Smolyaninov, Y. J. Hung, and C. C. Davis, “Magnifying superlens in the visible frequency range,” Science
315, 1699–1701 (2007). [CrossRef] [PubMed]
]. Both the superlens and the hyperlens, albeit offering much promise, also have various shortcomings: the heavy loss involved in all current optical negative-index materials, and the stringent requirement to fabricate metamaterial structures at nanometer precision, to name only a few. All of these are nontrivial issues, posing serious challenges before such negative-index structures can become viable technology. Other ideas for sub-wavelength imaging rely on distributing smaller-than-wavelength fluorescing items on the object and repeating the experiments multiple times. In this way, the ensemble-average fluorescent light together with prior knowledge on the size and shape of the items facilitate acquiring sub-wavelength information on the object [
14
A. Yildiz, J. N. Forkey, S. A. McKinney, T. Ha, Y. E. Goldman, and P. R. Selvin, “Myosin v walks hand-over-hand: Single fluorophore imaging with 1.5nm localization,” Science
300, 2061–2065 (2003). [CrossRef] [PubMed]
,
15
S. W. Hell, R. Schmidt, and A. Egner, “Diffraction-unlimited three-dimensional optical nanoscopy with opposing lenses,” Nat. Photon.
3, 381–387 (2009). [CrossRef]
]. However, these methods are once again not real-time, and in addition in many cases attaching external items is undesirable, especially when dealing with biological specimen. Altogether, in spite of the major progress recently accomplished with sub-wavelength optical imaging (for a recent review see [
16
N. I. Zheludev, “What diffraction limit?” Nat. Mater.
7, 420–422 (2008). [CrossRef] [PubMed]
]), having a far-field method that could do real-time imaging with sub-wavelength resolution is still a long-standing goal.
In parallel to the attempts to obtain sub-wavelength imaging through “hardware”, there have been several attempts to achieve this goal through theoretical tools, such as bandwidth extrapolation and related techniques [
17
J. W. Goodman, Introduction to Fourier optics(Englewood, CO: Roberts & Co. Publishers, 2005), 3rd ed.
]. The key ideas in all of these methods are (quoting from Ref. [
17
J. W. Goodman, Introduction to Fourier optics(Englewood, CO: Roberts & Co. Publishers, 2005), 3rd ed.
]) (1) the far-field of a spatially-bounded 2D image is described by an analytic function, and (2) if an analytic function is known exactly in an arbitrarily small (but finite) region of the far field, then the entire function can be found uniquely by means of analytic continuation. These concepts and the extrapolation methods arising from them theoretically allow the recovery of sub-wavelength information [
18
A. Papoulis, “A new algorithm in spectral analysis and band-limited extrapolation,” IEEE Trans. Circuits Syst.
22, 735–742 (1975). [CrossRef]
,
19
R. W. Gerchberg, “Super-resolution through error energy reduction,” J. Mod. Opt.
21, 709–720 (1974).
]. However, all of these algorithms are known to be extremely sensitive to noise in the measured data. As summarized by Goodman’s 2005 book [
17
J. W. Goodman, Introduction to Fourier optics(Englewood, CO: Roberts & Co. Publishers, 2005), 3rd ed.
], “all methods for extrapolating bandwidth beyond the diffraction limit are known to be extremely sensitive to both noise in the measured data and the accuracy of the assumed a priori knowledge” and “it is generally agreed that the Rayleigh diffraction limit represents a practical frontier that cannot be overcome with a conventional imaging system.”
Here, we show theoretically and provide an experimental proof of concept that sub-wavelength information can be recovered robustly from the far-field of an optical image, overcoming the loss of information embedded in decaying evanescent waves. The only requirement is that the image is known to be sparse, a specific but very general and wide-spread property of signals which occur almost everywhere in nature. Our approach is based on theoretical tools from the emerging field of Compressed Sensing (CS), which is being used to reduce sampling rates in information processing [
20
E. J. Candes, J. Romberg, and T. Tao, “Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information,” IEEE Trans. Inf. Theory
52, 489–509 (2006). [CrossRef]
–
25
M. Mishali and Y. C. Eldar, “Blind multi-band signal reconstruction: Compressed sensing for analog signals,” IEEE Trans. Signal Process.
57, 993–1009 (2009). [CrossRef]
]. Recently, CS has been suggested in the context of optics [
26
A. Ashok, P. K. Baheti, and M. A. Neifeld, “Compressive imaging system design using task-specific information,” Appl. Opt.
47, 4457–4471 (2008). [CrossRef] [PubMed]
] and was actually used for ghost imaging [
27
O. Katz, Y. Bromberg, and Y. Silberberg, “Ghost imaging via compressed sensing,” in “Frontiers in Optics (FiO),” (2009).
]. Our purpose is different: to recover the information contained in spatial frequencies that were cut off by diffraction limit, which acts as a low pass filter. We reformulate sub-wavelength imaging as a sparse sampling problem. We provide several examples demonstrating reconstruction of 1D and 2D images of sub-wavelength resolution, and discuss the interrelation between the three parameters controlling the system: sparsity, resolution, and the optical wavelength. In addition, we extend the standard basis-pursuit algorithm [
20
E. J. Candes, J. Romberg, and T. Tao, “Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information,” IEEE Trans. Inf. Theory
52, 489–509 (2006). [CrossRef]
,
22
E. J. Candes and M. B. Wakin, “An introduction to compressive sampling,” IEEE Signal Process. Mag.
25, 21–30 (2008). [CrossRef]
,
28
Z. Ben-Haim, Y. C. Eldar, and M. Elad, “Near-oracle performance of basis pursuit under random noise,” IEEE Trans. Signal Process. (submitted).
] commonly used in CS for sparse signal recovery, in order to enable reconstruction of optical images including non-uniform phase, which is an essential attribute in optical image recovery. As we show in the Theory Appendix, basis-pursuit alone is not able to recover features close in space that have opposite phase. Therefore, this novel feature is crucial to the extraction of physically relevant data from the image, through the phase of the electromagnetic field. We then provide an experimental proof-of-principle of our approach, by demonstrating image recovery at a spatial resolution greatly exceeding the finest resolution defined by a spatial filter. Finally, we discuss the general concept and broad applicability of our technique, and its possible extension to other, non-optical, microscopes, such as Atomic Force Microscopes, Scanning Tunneling Microscopes, Magnetic Microscopes, and more.
2. Theoretical considerations
Consider the time-harmonic EM field at some initial plane z=0:
where ω is the optical frequency of the wave. The spatial function f (x,y) can be expanded as a function of plane waves , where kx
, ky
are the transverse wave numbers, related to the spatial frequencies by kx
=2πνx
, ky
=2πνy
etc. The propagation of the field at all planes z>0, in a homogeneous isotropic and linear medium, can be described through
where
is the optical coherent transfer function, with
. Here
k=
ω/
c=2
π/
λ is the wave number, and
c and
λ are the speed of light and the wavelength in the medium, respectively. The limits of the integral in
Eq. (2) are determined by the numerical aperture of the system. However, even if the system in principle has infinite width (as in free space), the transfer function always acts as a low-pass filter. Since for sufficiently large spatial frequencies
kz
becomes imaginary, such waves decay exponentially with propagation. Thus, for propagation distances
z much larger than the wavelength,
has a cutoff at
k
2=
k
2
x
+
k
2
y
, and all waves with spatial frequencies beyond the cutoff are evanescent. This is the reason why sub-wavelength information imprinted on EM fields cannot be observed by conventional imaging techniques.
To explain our technique, consider an EM wave that has propagated a distance z much larger than the wavelength λ. Since the transfer function acts as a low-pass filter, all spatial frequencies larger than 1/λ are lost. We will now show that the information detected in the far-field can be used to recover the sub-wavelength features, in a robust fashion, provided they are sparse in an appropriate basis.
Let us first explain our approach on intuitive grounds. The idea can be elegantly illustrated by first pinpointing why other extrapolation methods fail: they are not robust to noise in the measured data. As a simple example, consider Taylor expansion as a means for analytic extension from some region in the far-field, close to the cutoff spatial frequency. Taylor expansion fails when the value of some term in the expansion (some higher derivative) is comparable to the noise in the measured data. Other, more advanced, extrapolation methods [
18
A. Papoulis, “A new algorithm in spectral analysis and band-limited extrapolation,” IEEE Trans. Circuits Syst.
22, 735–742 (1975). [CrossRef]
,
19
R. W. Gerchberg, “Super-resolution through error energy reduction,” J. Mod. Opt.
21, 709–720 (1974).
] fail for similar reasons [
17
J. W. Goodman, Introduction to Fourier optics(Englewood, CO: Roberts & Co. Publishers, 2005), 3rd ed.
]. An illustrative comparison between the performance of the technique following [
18
A. Papoulis, “A new algorithm in spectral analysis and band-limited extrapolation,” IEEE Trans. Circuits Syst.
22, 735–742 (1975). [CrossRef]
] and our CS-based approach is shown in the Theory Appendix. The comparison was carried out on our experimental data - through which our technique gives excellent reconstruction, as shown in
Figs. 4 and
5 below. As demonstrated in that Appendix, the extrapolation method of [
18
A. Papoulis, “A new algorithm in spectral analysis and band-limited extrapolation,” IEEE Trans. Circuits Syst.
22, 735–742 (1975). [CrossRef]
], performed on the same experimentally measured data, fails to reconstruct the information. A comparison was also carried out with Taylor expansion, and the result is even worse. Let us now discuss why these methods fail. All extrapolation methods rely on projecting the measured data on some set of orthogonal functions (a basis) spanning the space of solutions. The noise in most physical systems is uncorrelated, hence it is distributed uniformly on the basis functions. The extrapolation methods fail when the value of some projection on the basis functions is comparable to (or lower than) the noise in the measured data, which obviously introduces large errors. Over the years, various ideas for rectifying this problem have been studied [
18
A. Papoulis, “A new algorithm in spectral analysis and band-limited extrapolation,” IEEE Trans. Circuits Syst.
22, 735–742 (1975). [CrossRef]
,
19
R. W. Gerchberg, “Super-resolution through error energy reduction,” J. Mod. Opt.
21, 709–720 (1974).
], all relying on additional a priori knowledge on the information. But this introduces another problem: the extrapolation methods now become very sensitive to the a priori assumptions, where small inaccuracies can introduce large errors. This is why generally extrapolation methods have failed in optical sub-wavelength imaging, exactly as stated in [
17
J. W. Goodman, Introduction to Fourier optics(Englewood, CO: Roberts & Co. Publishers, 2005), 3rd ed.
].
Let us now explain the intuition underlying CS, and the reason why it succeeds where other extrapolation methods have always failed. CS relies on a single assumption: that the information is sparse, in some basis spanning the space of solutions. If the information is sparse, it is possible to find a proper basis, where we could identify a sharp separation into two sub-spaces: a sub-space where the projections of the measured data is much larger than the noise, and another sub-space where the projections are very small, and can be set to zero without losing much information. If we could somehow identify these basis functions, which of course depend on the actual data, we could use only the sub-space where the projections are large, and completely ignore the other sub-space. Such method will not suffer from noise, because we do not use the sub-space where the projections are small and susceptible to noise. The CS technique does exactly that: it automatically identifies the first sub-space, and ignores the second. To do that, CS uses prior knowledge that the information is sparse (and just that), which implies that the information can be represented in a very compact way in some (mathematical) basis spanning only a (preferably small) sub-space of all possible solutions. Then, since the uncorrelated noise is distributed uniformly on all basis functions, a large fraction of the noise lies in unoccupied basis functions (the second sub-space which we ignore). This is the main idea behind CS and its robustness to noise. We use this idea in reverse logic, to recover sparse high-bandwidth information that was low-pass filtered, in a highly robust fashion. As explained below and in the Theory Appendix, CS has a tradeoff between two parameters: sparsity - the fraction of degrees of freedom occupied by the sparse information, and the desired signal extrapolation ratio - the ratio between all degrees of freedom (known + missing) in the system, and the known (measured) degrees of freedom. In our case of sub-wavelength features, sparsity is the ratio between the non-zero features and the total field of view, whereas the signal extrapolation ratio is the ratio between the full bandwidth of the sub-wavelength information and the measured spatial bandwidth determined by the numerical aperture of the system.
On more mathematical grounds, the ability to reconstruct sparse signals from a limited number of measurements has become feasible with the emergence of a new signal processing technique called Compressed Sensing (CS) [
20
E. J. Candes, J. Romberg, and T. Tao, “Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information,” IEEE Trans. Inf. Theory
52, 489–509 (2006). [CrossRef]
–
23
D. L. Donoho, “Compressed sensing,” IEEE Trans. Inf. Theory
52, 1289–1306 (2006). [CrossRef]
,
25
M. Mishali and Y. C. Eldar, “Blind multi-band signal reconstruction: Compressed sensing for analog signals,” IEEE Trans. Signal Process.
57, 993–1009 (2009). [CrossRef]
,
28
Z. Ben-Haim, Y. C. Eldar, and M. Elad, “Near-oracle performance of basis pursuit under random noise,” IEEE Trans. Signal Process. (submitted).
]. This method challenges the traditional limits on the signal reconstruction and measurement process. The underlying logic behind this approach is that sparsely represented signals hold a very limited number of degrees of freedom, since only a small fraction of their coefficients in a particular mathematical basis representation are non-zero. Hence, sparsity is extremely powerful and useful prior information, enabling considerable reduction in the number of measurements required to reconstruct the signal. In what follows, we demonstrate the results of the CS technique applied to reconstructing sub-wavelength optical information from the measured far-field of a signal, which is the optical analogue to the Fourier transform of the signal after passing through a low-pass filter. Theoretical background on CS and the new recovery techniques we develop here are provided in the Theory Appendix.
The problem of reconstructing sub-wavelength images is equivalent to that of recovering a signal from its low spatial frequencies only. Clearly this is impossible without additional information. Here we exploit the knowledge that the signal is sparse (and nothing else!) to resolve the fine sub-wavelength features. To see how we benefit from sparsity, note that sparse signals can be represented very compactly in a given basis, meaning that only a small fraction of their projections on the basis functions are non-zero. This feature significantly restricts the number of degrees of freedom the signal possesses. More specifically, each non-zero coefficient holds exactly two degrees of freedom: one for the amplitude of the projection and the other for the choice of the basis function. If we knew in advance which functions are chosen, then the degrees of freedom would be reduced in half. Given that the relative fraction of occupied basis functions is β (<1), we only need to determine β samples of the signal in an alternative basis expansion. However, we must choose the measurement basis wisely such that the combined matrix describing the signal and measurement bases is (left-) invertible, to ensure the existence of a solution. This follows from standard linear algebra considerations and is well known.
We now turn to the more interesting setting, in which we know that the signal is sparse, but we do not know the location of the basis elements comprising the signal. In this case, the degrees of freedom are doubled. We therefore expect that at least a fraction of 2β measurements of the total number of possible measurements are required. However, since the chosen basis functions are unknown, it is now less clear how to choose the measurement basis and how to recover the signal. An essential result of CS is that
we need to choose a measurement basis such that is satisfies the requirement of invertibility
, obtained in the case in which the locations are known,
for every possible set of locations
. This mathematical condition is quite difficult to verify in practice; however, it can be shown that a sufficient condition is that the measurement basis is uncorrelated with the signal basis. To understand this requirement intuitively, suppose first that the signal basis is orthonormal, and we choose as a measurement basis the signal basis itself. In this case, the majority of the measurements will yield zero, and contain no information about the true signal. We would have to acquire almost all of the projections to make sure we have not lost any information. Instead, we would like to choose the measurement basis such that that a measurement of any projection in this particular basis contains information about the signal. This can be achieved by
requiring that each measurement basis function has low correlation with each signal basis function
. A highly uncorrelated pair of bases obeys a specific mathematical condition. This important theorem, similar to the uncertainty principle in quantum mechanics, prevents a signal from being sparse in both bases, and ensures that, if the signal is sparse in one of the bases, it will be very spread in the other. Therefore almost each projection will yield a non-zero informative measurement. Classical examples of maximally uncorrelated bases are the spatial and Fourier domains: A highly sparse signal, e.g. a single Dirac delta function is Fourier-transformed into a spread function that covers the entire spectrum. In our sub-wavelength optical setting, the measurement basis is fixed as the low spatial frequencies in the Fourier domain. According to the discussion above, measuring these will be sufficient to recover the signal if it is sparse in a real-space basis that is uncorrelated with the low-pass Fourier basis. This is in particular the case if the signal is highly localized in real-space.
In the next section we will address how the signal can be recovered in practice from measurements that are all contained within the low-pass filter window. If the correlation between the measurement and signal bases is low enough, then one can prove that a combinatorial search over all sets of basis functions will recover the true underlying signal. However, clearly this approach exhibits high complexity. Instead, a variety of different recovery algorithms have been proposed that run in polynomial time, and are aimed at seeking a sparse signal that is consistent with the given measurements. One of the most common techniques is the basis-pursuit method [
29
S. S. Chen, D. L. Donoho, and M. A. Saunders, “Atomic decomposition by basis pursuit,” SIAM J. Sci. Comput.
20, 33–61 (1998). [CrossRef]
], which amounts to solving an
l
1 optimization problem involving minimization of a
l
1 norm, and can be implemented quickly and efficiently. A key result of CS is that at the expense of slightly increasing the number of measurements, or in turn, a more demanding condition on the signal sparsity, these polynomial-time algorithms can recover the true sparse signal
in a robust fashion
, provided that the measurement and signal bases have sufficiently low correlation.
Consequently, noise in the measurement (which is always present in any physical system) can be tolerated by a slight increase in the required sparsity of the object
. In the context of optical imaging, an important feature is the ability to detect signals with nonuniform phase. As we illustrate in the Theory Appendix, the standard basis-pursuit approach is not able to resolve fine details with different phase. Therefore, we extend this technique to account for nonuniform phase by adding an iterative nonlocal thresholding step. This new method is described in detail in the Appendix, and is referred to as Non Local Hard Thresholding (NLHT).
Our technique is demonstrated theoretically in
Fig. 1(a), showing the ability to recover phase and amplitude information that is (
β/2)-times smaller than the wavelength, in a robust fashion. The sub-wavelength information is represented by a one-dimensional optical image with alternating phase. Such a signal, with alternating phase, can be reconstructed via CS techniques, but not by the standard basis-pursuit algorithm when the non-zero information is close in space. In order to recover signals containing arbitrary phases, we develop the NLHT algorithm described in the Theory Appendix. The data in
Fig. 1 represents, e.g., a sequence of 1D items with different amplitudes (grey levels) and phases [
Fig. 1(a)]. The spatial frequency spectrum of this image is shown in
Fig. 1(b), where the red lines mark the cutoff boundaries of the low-pass filter |
H(
kx
)| at
kx
=±2
π/
λ. In conventional optical imaging systems, the contents at all frequencies beyond the cutoff are lost [
Fig. 1(d)]. Hence, the observed optical image is strongly deteriorated [
Fig. 1(c)]. For example, the loss of information beyond the cutoff renders the two peaks around
x/
λ≈6 indistinguishable, in the observed image. Using CS (our NLHT algorithm), we are able to achieve perfect recovery of both the image [
Fig. 1(e)] and its spatial spectrum [
Fig. 1(f)].
To demonstrate the robustness of NLHT, we add noise to the system; evidently, the reconstruction is robust and the noise has a very small effect on the recovered image. In order to meet physical relevant conditions, we use uniformly-distributed noise in Fourier space, amounting to 1% of the image power. Since the sparsity of this particular image is
β=0.03 in real space, the recoverable spatial frequencies (as we explain below) are in the range
kx
=±2
π/(2
βλ), greatly exceeding the initial low pass window.
Clearly, the results displayed in Fig. 1 demonstrate that indeed CS methods facilitate robust and accurate recovery of sub-wavelength optical amplitude and phase information
. Comparing the robustness to noise of CS techniques to other bandwidth extrapolation methods yields overwhelming results. See the discussion in the Theory Appendix. As shown there, CS techniques are robust to noise even at low SNR, where other bandwidth extrapolation methods completely fail even in the presence of weak noise, as discussed in [
17
J. W. Goodman, Introduction to Fourier optics(Englewood, CO: Roberts & Co. Publishers, 2005), 3rd ed.
] and references therein.
As demonstrated in
Fig. 1 theoretically, CS can facilitate the recovery of sub-wavelength information, based on a-priori knowledge that the information is sparse. The key idea is to exploit sparsity. We point out that there are other modern information processing techniques that are also sparsity-based, such as Finite Rate of Innovation (FRI) [
30
M. Vetterli, P. Marziliano, and T. Blu, “Sampling signals with finite rate of innovation,” IEEE Trans. Sig. Proc.
50, 1417–1428 (2002). [CrossRef]
], which was also applied for calculating fine spectral lines in nuclear magnetic resonance (under the name FDM) [
31
V. A. Mandelshtam, “FDM: the Filter Diagonalization Method for data processing in NMR experiments,” Prog. Nucl. Mag. Res. Sp.
38, 159–196 (2001). [CrossRef]
]. In principle, any method supporting sparsity can be used. However, as we show in the Theory Appendix, some of these approaches may be highly sensitive to noise in some settings. In the Appendix, we briefly compare between our CS-based method and FRI, showing that in the presence of noise CS often performs much better. In addition, as we also demonstrate there, our method tends to be more robust at recovering information with closely-spaced phase variations, while standard CS and FRI techniques have difficulties resolving such details. Our point here is to bring forth the advantage of exploiting sparsity in sub-wavelength imaging, and to suggest one possible method that in our examples appears to be robust and provide superior recovery in comparison with other techniques, to the extent that it can be used for imaging of sub-wavelength information. However, certainly, this calls for a more detailed and careful comparison with other techniques, and possibly even coming up with new sparsity-based ideas that are tailored to optical imaging. Such algorithms can address the specific issues related to optics (e.g., noise that could be partially correlated, etc.). This is beyond the scope of the current work, but an interesting and important direction for future pursuit.

Fig. 1. Theoretical reconstruction of one-dimensional sub-wavelength information (amplitude and phase). (a) The original function, which we want to reconstruct. (b) The Fourier (plane-wave) spectrum of the original information shown in (a). The vertical red lines indicate the width of the low-pass filter, which for sub-wavelength information is 2/λ. (c) The distorted image obtained by an inverse Fourier transform on the filtered spectrum; the features are highly blurred. (d) The low-pass-filtered spectrum; a large fraction of the frequency contents is lost. (e,f) Reconstructed image (e) and its spectrum (f) using CS-methods based on the sparsity of the original information. The function is reconstructed perfectly in both real space and Fourier space, including the phase information. Our algorithm is robust against noise. (g,h) Adding 1% noise to the filtered spectrum (not shown here), we are still able to reconstruct the original information at high quality in both real space (g) and Fourier space (h). Amplitude and intensity are given in arbitrary units (a.u.), because the system does not depend on the light intensity.
The ideas presented in
Fig. 1 can be extended to reconstructing two-dimensional sub-wavelength features.
Figure 1 depicts an example containing 2D sub-wavelength amplitude information. However, the 2D case is physically more challenging, because the scalar relation of
Eq. (2) requires a modification to describe inevitable polarization effects. That is, EM waves containing sub-wavelength 2D optical images cannot be linearly polarized. This implies that using CS for 2D sub-wavelength imaging should contain vectorial mapping between real space and the plane-wave spectrum [a unit vector should be added in the integral of
Eq. (2)]. Nonetheless, extending the CS techniques described here to 2D sub-wavelength images will require some further sophistication, but it is not a major obstacle in any way. In this sense,
Fig. 1 describes a scalar version of the physical reality, simply to demonstrate the ability to recover 2D sub-wavelength images.
3. Experimental proof-of-principle
In what follows we provide proof-of-principle experiments, demonstrating image recovery at a spatial resolution greatly exceeding the finest resolution defined by a spatial filter. The experimental setting (
Fig. 3) is the simplest optical imaging system: the so-called 4-f system, with an adjustable slit placed at the common focal plane of the lenses, where it acts as a 1D low-pass spatial filter. It is important to note that our setup does not contain sub-wavelength objects, but rather paraxial objects where the features are much larger than the wavelength. The aperture of the adjustable slit defines the highest resolution in the image recovered optically at the output plane (image plane). As such, our system contains exactly the same physical impact of low-pass filtering as naturally done by |
H(
kx
,
ky
)| in free space, only that the cutoff spatial frequency in our experiment is controlled by the aperture of our filter, whereas for the transfer function in free space the cutoff is set by the wavelength. Our adjustable filter facilitates precise control over the resolution of the imaging system, since, in contrast to the fixed and symmetric filter window of the optical transfer function in free space, the window size and position can be tuned in our experimental setup. As we explain in details in the Theory and the Experimental Appendices, the data for the reconstruction via-CS can be taken in the Fourier space, or in the (spatially-filtered) image plane, and/or at any plane between the Fourier plane and the image plane. Of course, taking the data at multiple planes constitutes over-sampling, and increases the performance of our CS reconstruction.
We first demonstrate the recovery of a generic amplitude-only picture: 3 stripes at uneven spacing [
Fig. 4(a)]. The input information is generated by passing a broad Gaussian beam (“plane wave”) through 3 transparent elongated rectangles drawn on an opaque slide (acting as 3 rectangular slits). We emphasize that
Fig. 4(a) is the actual input information: it is photographed right after (1mm) the input plane. As such, the horizontal cross-section [
Fig. 4(c)] contains 3 almost-perfect square-waves with sharp edges, in contrast to the best optically-recoverable output image generated in our system when the slit is completely open, which has wiggles on each square-wave. These wiggles occur because of the finite aperture of the lenses, which act as a low-pass filter even with opened slit. [This effect is known in information processing as the Gibbs effect]. Our input information passes through the first lens, which generates the Fourier transform of the information, at the focal plane.
Figure 4b, showing that plane, depicts the full spatial spectrum of the input information. We then adjust the spatial filter (slit) to cut off a large fraction of the spectrum - leaving practically only the central lobe [
Fig. 4(e)]. The output image recovered via direct optical imaging (additional Fourier-transform by the second lens) is now only a single, very broad, intensity peak containing practically only low-frequency information [see
Fig. 4(d) for the experimental image and
Fig. 4(f) for the horizontal cross-section]. Comparing
Fig. 4(d) to
Fig. 4(a), and
Fig. 4(f) to
Fig. 4(c), demonstrates nicely the impact of low-pass filtering on an optical picture, due to the loss of information caused by the filter.
Fig. 2. Theoretical reconstruction of two-dimensional sub-wavelength information. (a,b) The original information consists of an arrangement of circles, forming the Star of David (a), and its respective Fourier transform (b). (c,d) After some propagation distance, all spatial frequencies above 1/λ are lost (d), so that the actual observed image is strongly blurred and the fine features cannot be resolved anymore (c). (e,f) Applying our CS algorithm reveals the underlying sub-wavelength structure in the real space (e), since the Fourier spectrum is fully restored (f).
Fig. 3. Experimental setup for the proof-of-concept experiments. The laser beam is collimated using lenses L1 and L2, before the sample is illuminated. The signal is then Fourier transformed using lens L3, low-pass filtered by the slit and again Fourier transformed into the real plane by lens L4. Another lens L5 performs an additional Fourier transform, which is recorded by a camera. In order to measure the phase distribution, a probe beam is super-imposed (using the beam splitter BS) on the signal in order to create interference fringes. In an alternative setup, the information can be directly taken in the real plane, so that the camera is positioned directly behind lens L4. .
We now employ our CS techniques, on the measured Fourier spectrum acquired after the low-pass filter (which has cut off a large fraction of the spectrum). The simplest CS technique (called basis pursuit; see Theory Appendix) facilitates the reconstruction of 3 accurate stripes, with the appropriate amplitudes and spacing [
Fig. 4(g)], thereby circumventing the loss of information caused by the low-pass filter. Importantly, the cross-section shown in
Fig. 4(k) reveals that the wiggles are absent. This shows that actually our CS technique performs better than any optical direct-imaging system, by removing the wiggles caused by the finite apertures of the lenses. In this vein, the CS-reconstructed spectrum is almost identical to the original (uncut) spectrum [
Fig. 4(h)].
Moving on to an optical image containing phase information, we perform measurements with the structure depicted in
Fig. 5(a): two closely-spaced in-phase stripes and a single stripe further away with an opposite phase. The spectrum of this image is shown in
Fig. 5(b), the cross section in
Fig. 5(c). The low-pass spatial filter is set to pass just the central region of the spectrum [
Fig. 5(e)]. Consequently, the image obtained via direct optical imaging (4f system) has two very broad peaks [
Fig. 5(d),
5(f)]. Note that, because the input phase information is basically zero and π, any low-pass filtered image always has at least two peaks. We then use CS to recover the image, including both amplitude and phase. To do that, we employ the NLHT algorithm (see Theory Appendix). The CS-recovered image, depicted in
Figs. 5(g) and
5(k), is in excellent agreement with the input image, in all of its features. Likewise, its CS-recovered spectrum is very similar to the spectrum of the original image [
Fig. 5(h)].
Figures 1–
5 demonstrate the ability to recover optical information at a resolution greatly exceeding the maximum resolution (defined by a low-pass filter in Fourier space), that can be recovered by direct optical imaging. Our CS techniques compensate for the loss of information by taking advantage of the sparsity of the input information. It is therefore instructive to estimate the highest resolution recoverable via CS, given the sparsity of the input information β, and the width of the pass-band of the low-pass filter Δ
k. In principle, in a noise-free scenario, the CS techniques could act by extending the pass-band up to Δ
k/(2
β). [As explained in the Theory Appendix, CS techniques cannot yield an improvement of 1/
β, because there is always a penalty of factor 2 for finding the proper basis]. This would amount to extending the pass-band of the transfer function of free-space
H(
kx
,
ky
), from Δ
k=4
π/
λ to Δ
k=4
π/(2
βλ). In the particular example of
Fig. 1 (
β=0.03), the recoverable feature can be as small as
λ/16. In optical microscopy of sparse objects such as living bacteria (where
β can be 0.01 and smaller), the resolution is even much higher. Apart from sparsity, another physical limitation is noise, which can never be eliminated. As demonstrated in
Fig. 1, CS techniques are rather robust to noise, although noise does reduce their performance. However, the detriment effects of noise can be minimized using over-sampling to increase the precision of the measurements. Using a beam-splitter in the optical system, one could measure simultaneously both the Fourier spectrum and the output image (both after low-pass filtering), and in principle - measure the field distribution in any plane between those. Hence, even though noise will still affect the results somewhat, its detriment effects could be minimized. Finally, the system analyzed in this article assumes coherent illumination (as used in many modern sub-wavelength imaging techniques [
8
J. B. Pendry, “Negative refraction makes a perfect lens,” Phys. Rev. Lett.
85, 3966–3969 (2000). [CrossRef] [PubMed]
–
12
Z. Liu, H. Lee, Y. Xiong, C. Sun, and X. Zhang, “Far-field optical hyperlens magnifying sub-diffraction-limited objects,” Science
315, 1686 (2007). [CrossRef] [PubMed]
]). However, CS techniques are general and can be extended also to imaging with incoherent light.
Fig. 4. Experimental proof-of-concept: reconstruction of amplitude information. (a,b,c) The original information consisting of three vertical stripes (a), its Fourier spectrum (b), and a horizontal cross-section of the amplitude, taken through the real-space information (c). (d,e,f) Using the optical slit, the signal is low-pass filtered at the vertical red lines, yielding a highly blurred image (d). The Fourier spectrum now contains now only the lowest frequencies (e), which cause the mergence of the three stripes (in real-space) into one, as seen in the horizontal cross section (f). (g,h,k) Reconstruction using CS methods yields a high quality recovered information (g) and its respective Fourier spectrum (h). The strong correspondence between original and recovery is clearly visible in the horizontal cross section (k).
The key ingredient of all CS techniques is sparsity of the input information. In fact,
β=0.5 poses a stringent fundamental condition on all CS techniques; that is, without sparsity, CS cannot provide any improvement. It is therefore important to note that the vast majority of natural objects, as well as artificial objects, are sparse. Notwithstanding that, the information does not necessarily have to be sparse in real space: it can be sparse in any mathematical basis that is sufficiently incoherent with the Fourier basis. Moreover, one can use a mask with random phase (speckles) in the near field right after the object, which projects more information from the original signal into the low-frequency range, thereby increasing the amount of measurable data [
32
M. Mishali and Y. C. Eldar, “From theory to practice: Sub-nyquist sampling of sparse wideband analog signals,” arXiv [0902.4291v1] (2009).
]. An excellent example for naturally-sparse information is the interior of a living bacterium, which occupies only a small fraction of the area of the cross sections, being therefore highly sparse. Another example of sparse objects, this time from the man-made world, is liquid crystals consisting of giant molecules with lengths slightly below the visible wavelength. In both of these examples, CS can provide a major improvement of ”looking beyond the resolution limit”. Of course, there are objects that are not sparse, for example, electronic chips. However, it is clear that sparse objects are not esoteric, but are rather common in very many systems, especially in biological specimen.
Finally, we emphasize that our approach can be applied to every optical microscope as a simple computerized image processing tool, delivering results in almost real time with practically no additional hardware
. Our technique is very general, and can be extended also to other, non-optical, microscopes, such as atomic force microscope, scanning-tunneling microscope, magnetic microscopes, and other imaging systems. The main idea presented here holds the promise to revolutionize the world of microscopy with just minor adjustments to current technology: sparse sub-wavelength images could be recovered by making efficient use of their available degrees of freedom.

Fig. 5.
Experimental proof-of-concept: reconstruction of amplitude
+
phase information
. An important feature of our proposed algorithm is the ability to recover both amplitude and phase, which is essential for pictorial information carried upon electromagnetic waves. (a,b,c) The original information consisting of three vertical stripes (a), its Fourier spectrum (b), and a horizontal cross-section of the amplitude, taken through the real-space information, revealing that the two stripes on the right are π-phase shifted with respect to the stripe on the left (c). (d,e,f) Using the optical slit, the signal is low-pass filtered at the vertical red lines, yielding a highly blurred image consisting of two distinct lobes (d). The Fourier spectrum now contains now only the lowest frequencies (e), which cause the mergence of the two stripes on the right, as seen in the horizontal cross section (f). (g,h,k) Reconstruction using CS methods yields a high quality recovered information (g) and its respective Fourier spectrum (h). The strong correspondence between original and recovery is clearly visible in the horizontal cross section (k).