1. Introduction
Passive polarimetric imaging has emerged in recent years as a useful sensing modality for detection and recognition [
1
J. S. Tyo, D. L. Goldstein, D. B. Chenault, and J. A. Shaw, “Review of passive imaging polarimetry for remote sensing applications,” Appl. Opt.
45(22), 5453–5469 (2006). [CrossRef] [PubMed]
]. The added information provided by polarization can aid in clutter reduction and object characterization. Imagery from microgrid polarimeters is obtained by incorporating a mosaic of pixel-wise micropolarizers on a focal plane array (FPA). Such imagers are akin to Bayer pattern color cameras, except the mosaic pattern is for polarization state, rather than color. With a typical microgrid polarimetric sensor, four polarimetric state images are acquired using a 2 × 2 pattern of polarizers. In this manner, each polarization image is obtained by subsampling the full FPA image by 2 in each dimension. Thus, the effective pixel pitch for each channel is twice that of the native FPA (1/2 the sampling frequency). Because of the desire for a wide field of view and other factors, many imaging systems are equipped with optics that allow some undersampling for a given FPA. By employing the microgrid with the same optics and FPA, undersampling and resulting aliasing artifacts are increased [
2
B. M. Ratliff, C. F. LaCasse, and J. S. Tyo, “Interpolation strategies for reducing IFOV artifacts in microgrid polarimeter imagery,” Opt. Express
17(11), 9112–9125 (2009). [CrossRef] [PubMed]
,
3
J. S. Tyo, C. F. LaCasse, and B. M. Ratliff, “Total elimination of sampling errors in polarization imagery obtained with integrated microgrid polarimeters,” Opt. Lett.
34(20), 3187–3189 (2009). [CrossRef] [PubMed]
].
Multi-frame super-resolution (SR) enhancement methods have been widely demonstrated to improve the effective sampling rate of undersampled imagers [
4
S. C. Park, M. K. Park, and M. G. Kang, “Super-resolution image reconstruction: a technical overview,” IEEE Signal Process. Mag.
20(3), 21–36 (2003). [CrossRef]
]. By using multiple low-resolution (LR) frames from video, these SR algorithms exploit sub-pixel motion between frames to effectively enhance the sampling rate for the native FPA. By doing so, these SR methods effectively trade temporal resolution for spatial resolution. In addition to enhancing the sampling rate and reducing aliasing, the SR algorithms typically perform restoration to reduce noise and blur. Generally speaking, iterative image reconstruction based SR methods are among the best performing algorithms, but come with a relatively high computational cost [
4
S. C. Park, M. K. Park, and M. G. Kang, “Super-resolution image reconstruction: a technical overview,” IEEE Signal Process. Mag.
20(3), 21–36 (2003). [CrossRef]
]. The simplest SR algorithms, in a computational sense, are the interpolation-restoration approaches [
4
S. C. Park, M. K. Park, and M. G. Kang, “Super-resolution image reconstruction: a technical overview,” IEEE Signal Process. Mag.
20(3), 21–36 (2003). [CrossRef]
]. Here we shall consider these two classes of SR algorithms as applied to polarimetric imagery. Because of their ability to help overcome undersampling, we believe multi-frame SR algorithms are a powerful match for microgrid polarimeters. To the best of our knowledge, the only publications to date exploring this connection were the result of preliminary work led by two of the current coauthors [
5
B. M. Ratliff, J. S. Tyo, W. T. Black, and C. F. LaCasse, “Exploiting motion-based redundancy to enhance microgrid polarimeter imagery,” Proc. SPIE
7461, 74610K (2009). [CrossRef]
,
6
D. A. Lemaster, “Resolution enhancement by image fusion for microgrid polarization imagers,” in IEEE Aerospace Conference, Big Sky, MT (2010).
]. We believe the current paper represents the first comprehensive treatment of the subject. In [
5
B. M. Ratliff, J. S. Tyo, W. T. Black, and C. F. LaCasse, “Exploiting motion-based redundancy to enhance microgrid polarimeter imagery,” Proc. SPIE
7461, 74610K (2009). [CrossRef]
], Ratliff
et al used the iterative multi-frame SR algorithm in [
7
R. C. Hardie, K. J. Barnard, J. G. Bognar, E. E. Armstrong, and E. A. Watson, “High resolution image reconstruction from a sequence of rotated and translated frames and its application to an infrared imaging system,” Opt. Eng.
37(1), 247–260 (1998). [CrossRef]
] applied to the four demosaiced polarimetric channels independently. LeMaster explored a simple and fast multi-frame SR algorithm in [
6
D. A. Lemaster, “Resolution enhancement by image fusion for microgrid polarization imagers,” in IEEE Aerospace Conference, Big Sky, MT (2010).
] that registered and interlaced samples from multiple frames, also using independent channel processing. The work in [
5
B. M. Ratliff, J. S. Tyo, W. T. Black, and C. F. LaCasse, “Exploiting motion-based redundancy to enhance microgrid polarimeter imagery,” Proc. SPIE
7461, 74610K (2009). [CrossRef]
,
6
D. A. Lemaster, “Resolution enhancement by image fusion for microgrid polarization imagers,” in IEEE Aerospace Conference, Big Sky, MT (2010).
] demonstrated that SR can be used to enhance the resolution of polarimetric data from a microgrid imager, but neither fully exploited the redundant information available in the microgrid array.
Here we present two new multi-channel multi-frame SR algorithms specifically designed for processing imagery from microgrid polarimeters. The new SR methods are designed to provide improved performance by exploiting correlation between the polarimetric channels and by using a detailed observation model that includes an accurate point spread function (PSF). One of the two new SR algorithms presented here uses a form of regularized least squares (RLS) related to that in [
7
R. C. Hardie, K. J. Barnard, J. G. Bognar, E. E. Armstrong, and E. A. Watson, “High resolution image reconstruction from a sequence of rotated and translated frames and its application to an infrared imaging system,” Opt. Eng.
37(1), 247–260 (1998). [CrossRef]
] and has an iterative solution. The proposed RLS algorithm is novel in that here we
jointly estimate all of the polarimetric images and use a new regularization function that exploits correlation among the polarimetric channels. The other proposed algorithm is a new type of adaptive wiener filter (AWF) SR method [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
] designed for microgrid image data. The microgrid AWF method leads to a faster implementation that is more readily suitable to real-time processing with appropriate hardware than the RLS method. We study the performance of the two new microgrid SR methods using imagery from a long-wave infrared (LWIR) microgrid polarimeter. We also present results using simulated microgrid data derived from a visible polarimetric imaging system. These simulated data are used to allow for a new quantitative performance analysis. Our results demonstrate that there can be significant benefits to using SR with microgrid polarimeters. We also show how multichannel SR processing that exploits the correlation between the polarimetric channels yields improved performance over independent channel processing.
The remainder of this paper is organized as follows. In Section 2, background on polarimetric imagery is briefly reviewed and the observation models are presented. The new microgrid RLS SR algorithm is presented in Section 3 and the new microgrid AWF SR method is presented in Section 4. Experimental results are presented in Section 5 to illustrate the efficacy of the proposed methods. Finally, we offer conclusions in Section 6.
2. Observation model
The sensor we are focusing on is equipped with four polarization analyzers oriented at angles
θ
1 = 0
°,
θ
2 = 45
°,
θ
3 = 90
°, and
θ
4 = 135
°. The microgrid configuration for acquiring these four polarimetric images is shown in
Fig. 1. As shown, the microgrid image can be deinterlaced to produce the four individual polarimetric images. The choice of angles is designed to make the computation of Stokes vector images convenient [
1
J. S. Tyo, D. L. Goldstein, D. B. Chenault, and J. A. Shaw, “Review of passive imaging polarimetry for remote sensing applications,” Appl. Opt.
45(22), 5453–5469 (2006). [CrossRef] [PubMed]
,
9
R. M. A. Azzam and N. M. Bashara, Ellipsometry and Polarized Light (North-Holland, 1977).
]. In general, let the ideally-sampled high-resolution (HR) polarimetric image, with
P channels and
N pixels per channel, be represented in lexicographical notation as
p = [
p
1,
p
2,...,
pNP
]
T
. Breaking this down into individual polarimetric channels, we get
, where
p
θi
= [
pθi
,1,
pθi
,
2,...,
pθi,N
]
T
. For such a four-angle sensor, the first three Stokes images [
1
J. S. Tyo, D. L. Goldstein, D. B. Chenault, and J. A. Shaw, “Review of passive imaging polarimetry for remote sensing applications,” Appl. Opt.
45(22), 5453–5469 (2006). [CrossRef] [PubMed]
] can be estimated as
The
s
0 term is the total light intensity, and
s
1 and
s
2 present linear polarization content. Note that since we have four polarimetric states being measured and three Stokes images relating to linear polarization, there is some redundancy. That is, ideally we should have
s
0,
j
=
p
0,
j
+
p
90,
j
=
p
45,
j
+
p
135,
j
. Later we shall seek to exploit this redundancy in our SR algorithms. Another useful statistic, related to the Stokes images, is the degree of linear polarization (DoLP) [
1
J. S. Tyo, D. L. Goldstein, D. B. Chenault, and J. A. Shaw, “Review of passive imaging polarimetry for remote sensing applications,” Appl. Opt.
45(22), 5453–5469 (2006). [CrossRef] [PubMed]
] given by
Fig. 1 Microgrid pattern for the polarimetric imager considered here. Four polarization images are acquired using a 2 × 2 repeating pattern of micropolarizers on the FPA.
For multi-frame SR, let the number of observed LR microgrid frames be K. Let the LR observed pixels from frame k be expressed in lexicographical notation as g(k), where k = 1, 2,...,K. Breaking down each observed frame by polarimetric channel we get
, where g
θi
(k) = [gθi
,1 (k), gθi
,2 (k),..., gθi
,
M
(k)]
T
. Let the complete set of all observed pixels in lexicographical notation be specified by g = [g
T
(1), g
T
(2),...,g
T
(K)]
T
= [g
1, g
2,..., gKMP
]
T
.
A block diagram illustrating the observation model relating
p to the observed frame
g(
k) is presented in
Fig. 2. The ideal polarimatric data in
p is assumed to be sampled above the Nyquist rate. The motion between frames is parameterized with the vector
s(
k). Next, the model includes a linear shift invariant blurring using the discrete PSF for the optical system. This is followed by down-sampling by a factor of
L. Finally, the down-sampled images are mosaiced into the pattern in
Fig. 1 and noise is introduced to obtain
g(
k). This observation model can be mathematically expressed such that each observed pixel is a weighted sum of the ideal pixels in
p plus noise. The particular weights depend on the system PSF, the motion between frames, and the microgrid layout. In particular, the LR pixels can be expressed in terms of ideal HR pixels as follows
where
hθ
,i,j
(
k) is the contribution of
pθ
,j
to
gθ
,i
(
k) and
nθ
,i
(
k) is the noise term for
gθ
,i
(
k). This model can be express compactly for all frames jointly as
where
H is a
KMP ×
NP matrix and
n is a
KMP × 1 vector of noise samples.
Fig. 2 Block diagram illustrating the observation model relating p to the observed frames g(k), for k = 1,2,...,K.
The observation model in
Eqs. (3) and
(4) will be used for the RLS SR algorithm. However, to aid in the development of the fast AWF algorithm here, we consider an alternate observation model based on that in [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
]. The alternate observation model is shown in
Fig. 3. Here the motion model is commuted with the PSF and integrated into what becomes a nonuniform sampling operator. This alternative observation model is equivalent to the model in
Fig. 2 for translational motion. It is also equivalent for rotational motion when the PSF is circularly symmetric [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
]. The power of this model is that it leads to the computationally simple interpolation-restoration SR algorithms [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
]. Based on this model, nonuniform interpolation can be used to recover noisy, but uniformly spaced, samples of
f (
x,y,θi
) from
g. Then, restoration is used to estimate the corresponding samples of
p(
x,y,
θi
).
Fig. 3 Alternative observation model valid for translational motion (and rotational motion when the PSF is circularly symmetric). Here the motion model is commuted with the PSF and integrated into what becomes a nonuniform sampling operator.
Let us now consider the modeling of the system PSF. We model the optical transfer function (OTF) with three components
where
u and
v are the horizontal and vertical spatial frequencies in cycles per millimeter,
H
dif(
u, v) is from the diffraction-limited optics,
H
abr(
u, v) models optical aberrations, and detector integration is included as
H
det(
u, v). The OTF from diffraction-limited optics with a circular pupil function is given by [
10
J. Goodman, Introduction to Fourier Optics (McGraw-Hill, 1968).
]
where
and
, and 𝒩 is the
F/# of the optics. Since a purely diffraction-limited optical system is perhaps overly idealistic, we also include the following aberration OTF [
11
R. R. Shannon, “Aberrations and their effect on images,” Proc. SPIE
531, 27–37 (1985).
]
where
WRMS
is the root mean square wavefront error [
11
R. R. Shannon, “Aberrations and their effect on images,” Proc. SPIE
531, 27–37 (1985).
]. The detector component of the OTF is found as the Fourier transform of a mask with the geometry of the active area of an individual detector in the FPA. Finally, the system PSF is the inverse Fourier transform of the overall OTF in
Eq. (5).
The microgrid polarimetric imager used here has the following specifications. It has a spectral bandwidth of
λ = 7.8 – 9.8
μm. We use a wavelength of
λ = 9
μm for our PSF model. The system uses
F/2 optics and has a full FPA pixel pitch of 25
μm with approximately 100% fill factor rectangular detectors. Note that for this microgrid imager, the pixel pitch for pixels of like polarization is actually 50
μm. Thus, the effective active area for the like-polarization sub-array detectors is modeled as spanning half the 50
μm like-polarization pixel pitch in each dimension. This equates to a 25% fill factor (taking into account the skipping of detectors in the FPA as shown in
Fig. 1). For abberation modeling, we use
WRMS
= 1/14 [
11
R. R. Shannon, “Aberrations and their effect on images,” Proc. SPIE
531, 27–37 (1985).
]. Cross sections of the relevant 2-D modulation transfer functions (MTFs) are shown in
Fig. 4(a) for this system. The continuous PSF is shown in
Fig. 4(b). The discrete impulse invariant PSF, for use in the observation model operating on the HR image, can be found by sampling the continuous PSF with a sampling period of 50
/L
μm.
Fig. 4 (a) Cross sections of theoretical 2-D MTF and its components for the microgrid polarimetric imager used here. (b) Theoretical continuous PSF for L = 4.
Note that the Nyquist or folding frequency (i.e., 1/2 sampling frequency) for a single channel is 10 cycles/mm, as shown in green in
Fig. 4(a). This is well below the 55.56 cycles/mm cut-off frequency of the optics. Any scene frequency content above the Nyquist frequency is “folded” into lower frequencies, creating aliasing artifacts. Thus, this system is likely to exhibit significant aliasing without SR. Another point to note from the overall MTF is that it has a low pass blurring effect. However, the blurring cannot be combatted with a linear shift-invariant filter without first (or jointly) addressing the undersampling issue. Otherwise, the aliasing artifacts will simply be amplified. Based on the overall system MTF shown in
Fig. 4(a), we believe that
L = 4 is a practical choice for SR processing for this sensor. The effective sampling frequency is then 80 cycles/mm, with a Nyquist frequency of 40 cycles/mm as shown in yellow in
Fig. 4(a). In addition to aliasing reduction, we also hope to restore the imagery from the blurring effects of the overall MTF.
3. RLS SR for microgrid polarimeters
Based on the observation model depicted in
Fig. 2 and detailed in
Eq. (3), we propose an RLS estimator for
p. The output of the RLS algorithm is given by
The cost function to be minimized is given by
The first term in
Eq. (9) is the main term that is minimized when
p projected through observation model matches the observed data
g. Since this term alone may be insufficient to provide a unique solution, the second and third terms act as a regularization function. The second term acts as a smoothness constraint on the Stokes images. For this smoothness term we use a discrete Laplacian operator given by
Finally, the third term in
Eq. (9) exploits the redundancy in the four polarimetric measurements. This term is minimized when
p
0,
j
+
p
90,
j
=
p
45,
j
+
p
135,
j
. For an ideal noise-free sensor, with no cross-talk between polarization channels, this equality should be true [
1
J. S. Tyo, D. L. Goldstein, D. B. Chenault, and J. A. Shaw, “Review of passive imaging polarimetry for remote sensing applications,” Appl. Opt.
45(22), 5453–5469 (2006). [CrossRef] [PubMed]
] and this cost function term would go to zero. However, in practice, noise and non-ideal sensor characteristics make this a small but non-zero term. Note that each term in
Eq. (9) has a corresponding scaling parameter. The value of these constants govern the relative impact of each term on the cost function, and ultimately the final estimate. Specifically, the first term is scaled by
, which would typically be related to the expected noise variance in the observation model. With high noise level, we would expect a larger error in the first cost function term, even for the true image as the estimate. Thus, employing a larger
reduces the cost associated with this expected higher error. The second term has scaling parameters of
, for
m = 0, 1, 2. These reflect our
a priori assumptions about the variance of the Laplacian of the Stokes images as defined using
Eq. (10). Finally, the third term is scaled by
, which reflects our expectation for the variance of the difference between
p
0,
j
+
p
90,
j
and
p
45,
j
+
p
135,
j
. Note that it is only the relative values of these scaling constants that matter for the algorithm.
While this RLS approach is based on that in [
7
R. C. Hardie, K. J. Barnard, J. G. Bognar, E. E. Armstrong, and E. A. Watson, “High resolution image reconstruction from a sequence of rotated and translated frames and its application to an infrared imaging system,” Opt. Eng.
37(1), 247–260 (1998). [CrossRef]
], what makes
Eq. (9) novel are the regularization terms. The RLS algorithm in [
7
R. C. Hardie, K. J. Barnard, J. G. Bognar, E. E. Armstrong, and E. A. Watson, “High resolution image reconstruction from a sequence of rotated and translated frames and its application to an infrared imaging system,” Opt. Eng.
37(1), 247–260 (1998). [CrossRef]
] uses only one regularization term, a smoothness term for the single channel data. Here we have multichannel data with physical constraints, like the redundancy exploited in the third term in
Eq. (9). Note also, that the smoothness terms are applied here to the Stokes images, and not the original polarimetric channels. This is done for several reasons. First, the Stokes images may be a final product for such an imaging system. By having smoothness terms applied here, we better ensure that this product will be free from high frequency artifacts. Furthermore, {
s
1,
j
} and {
s
2,
j
} are known to generally have relatively low energy (i.e., low variance). Thus, applying a term favoring small high-frequency energy in these channels is sensible. The Stokes image {
s
0,
j
} is the total intensity image. This should contain most of the energy. One may choose to use the smoothness constraint on this term minimally by employing a high
σs
0
. The parameters
σs
1
and
σs
2
may be smaller, because of the expected lower energy in these images. Finally, because the Stokes terms interrelate the different polarization channels, it allows us to exploit their distinct spatial sampling positions better than if we applied a smoothness constraint on each channel independently. The approach of applying smoothness regularization to the Stokes images in polarimetric data may be viewed as akin to using luminance and chrominance coordinates in color processing [
12
T. Gotoh and M. Okutomi, “Direct super-resolution and registration using raw CFA images,” in IEEE Conference on Computer Vision and Pattern Recognition , vol. 2, pp. 600–607 (Los Alamitos, CA, 2004).
–
14
L. Condat, “A generic variational approach for demosaicking from an arbitrary color filter array,” in Proceedings of IEEE ICIP , pp. 1625–1628 (2009).
]. It is interesting to note that the RLS estimate can be viewed as a maximum
a posteriori (MAP) estimator in the case of independent and identically distributed Gaussian noise [
15
R. C. Hardie, K. J. Barnard, and E. E. Armstrong, “Joint MAP registration and high resolution image estimation using a sequence of undersampled images,” IEEE Trans. Image Process.
6(12), 1621–1633 (1997). [CrossRef] [PubMed]
]. In the MAP framework, the quadratic nature of the regularization terms in
Eq. (9) can be seen as resulting from a Gaussian prior pdf.
To minimize the cost function in
Eq. (9), one may use a gradient descent or conjugate gradient algorithm similar to that presented in [
7
R. C. Hardie, K. J. Barnard, J. G. Bognar, E. E. Armstrong, and E. A. Watson, “High resolution image reconstruction from a sequence of rotated and translated frames and its application to an infrared imaging system,” Opt. Eng.
37(1), 247–260 (1998). [CrossRef]
]. Here we develop the somewhat simpler gradient descent approach. We initialize the RLS estimation process by registering all
K frames to a common reference. A convenient choice of reference is the most recent observed frame. Note that we need to register the frames to subpixel accuracy. We have found that we get better registration performance by deinterlacing the channels, applying the registration algorithm in [
7
R. C. Hardie, K. J. Barnard, J. G. Bognar, E. E. Armstrong, and E. A. Watson, “High resolution image reconstruction from a sequence of rotated and translated frames and its application to an infrared imaging system,” Opt. Eng.
37(1), 247–260 (1998). [CrossRef]
], and then averaging the registration parameters obtained from the four channels. With this and the PSF from Section 2, we have a complete observation model. Next, we take the reference image and use single frame interpolation to obtain an initial estimate of
p, denoted
p̂
0. To improve this estimate using gradient descent, we need the gradient of
Eq. (9) with respect to
p, denoted ∇
p
C(
p). This gradient is composed of the partial derivative of the cost function with respect to each of the HR samples in
p. This can be expressed as
where
The partial derivatives making up the gradient can be computed from
Eq. (9), yielding
where the observation model gradient term is given by
and the model error component is
The Stokes gradient term is given by
Finally, the redundancy gradient term is
Given the gradient, the gradient descent updates are computed as follows
where
ɛn
is the step size for iteration
n. The iterations may be performed a predetermined number of times, or may be stopped with a halting criterion, such as ||
p̂
n
–
p̂
n
−1|| <
T, where
T is a threshold.
The optimum step size at each iteration can be found by optimizing
C(
p̂
n
) as a function of
ɛn
, given
p̂
n
−1. In particular, we set the derivative of
C(
p̂
n
) with respect to
ɛ
n
equal to zero, and solve for
ɛn
. This calculation results in an optimum step size of the form
The individual terms in the optimum step size are given by
where
and
Computing the optimum step size in this fashion tends to be faster than performing a search. A final point to note about the RLS microgrid SR method is that we can easily treat “bad” pixels by simply setting the error,
eθ
,i
(
k) as defined in
Eq. (18), to zero for any
θ,
i, and
k for which
gθ
,i
(
k) is known to be a bad pixel value. This is convenient as it obviates the need for a separate bad pixel correction algorithm, even when using a single frame. This is how we address bad pixels for the RLS SR results presented in Section 5. Other bad pixel replacement strategies that exploit microgrid redundancy have been proposed in [
16
B. M. Ratliff, J. S. Tyo, J. K. Boger, W. T. Black, D. L. Bowers, and M. P. Fetrow, “Dead pixel replacement in LWIR microgrid polarimeters,” Opt. Express
15(12), 7596–7609 (2007). [CrossRef] [PubMed]
].
4. AWF SR for microgrid polarimeters
Unlike the RLS, the AWF SR method [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
] is non-iterative and has a potentially faster implementation than the RLS. Here we propose a new type of the AWF SR algorithm [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
] for use with a microgrid imager. As in [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
], the microgrid AWF uses a moving observation window on the HR grid as shown in
Fig. 5. The observation window is shown for
L = 4 and is populated with pixels from a single frame for simplicity. With multiple frames of data with motion between them, the observation window will contain additional sets of pixels from each polarization state, positioned according to the motion. In general, let the observation window span
Wx
HR pixel spacings in the horizontal direction and
Wy
HR pixel spacings in the vertical direction. All of the LR pixels of any polarization that lie within the span of this observation window are placed into an observation vector
g
i
= [
gi
,1,
gi
,2,...,
gi,Ji
]
T
, where
i is the window positional index and
Ji
is the number of LR pixels within the window span. The samples within the observation window will be used to estimate the HR pixels within the generally smaller
Dx
×
Dy
sample estimation window at the center of the observation window, as shown in
Fig. 5. Thus, the AWF SR method is effectively performing nonuniform interpolation and restoration simultaneously. One of the main differences here, versus the AWF in [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
], is that the observation window contains LR pixels from multiple polarizations. Furthermore, at each HR pixel location, we must estimate all of the polarization states, not just a single value.
Fig. 5 Observation window for the microgrid AWF algorithm for polarimetric data with L = 4. For simplicity, the window is shown populated with pixels from only a single frame.
The HR pixel estimates for the samples within the estimation window are obtained using a weighted sum of the LR pixels in the observation window. This is expressed as
where
d̂
i
= [
d̂i
,1,
d̂i
,2,...,
d̂i,DxDyP
]
T
and
W
i
is a
Ji
×
DxDyP matrix of weights. Each column of
W
i
contains weights used to estimate the value of one particular HR pixel at one polarization state inside the estimation window. The observation window moves across the HR grid in a raster scan fashion stepping by increments of
Dx
and
Dy
in the horizontal and vertical directions, respectively (non-overlapping estimation sub-windows).
We wish to employ the minimum mean squared error (MSE) filter weights in
Eq. (31). The additional challenge we face when processing microgrid data is that we must not only consider the spatial arrangement of samples in a given observation window, but also their polarization state. Given a correlation model for the observed samples, the minimum MSE weights [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
] are given by
where
is the autocorrelation matrix for the observation vector and
is the cross-correlation between the true desired vector
d
i
and the observation vector. The goal now is to provide the statistics in
R
i
and
P
i
. As in [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
], let
f
i
be the noise-free version of
g
i
, such that
g
i
=
f
i
+
n
i
, where
n
i
is the random noise vector. We shall assume that the noise vector is zero-mean with independent and identically distributed elements of variance
. In this case, the autocorrelation matrix for the observation vector is given by
and cross-correlation matrix is
To fill the matrices in
Eqs. (33) and
(34), we need a statistical model to provide the needed correlations. Consider a correlation model, specifically designed for microgrid data, that incorporates knowledge of the spatial separation between samples, and the polarization state of each. For the desired data, we denote this correlation function as
rdθdϕ
(
x,y). The variables
θ and
ϕ represent the polarization angles of the two samples involved, and
x,y represents the spatial separation. Here we propose the following model
where
controls correlation between samples at the same spatial location and
ρθ,ϕ
controls the decay of the correlation with separation distance. We have obtained useful results using
ρθ,ϕ
=
ρ for all
θ, ϕ, and
Note that if
α = 0, we assume no correlation between samples of different polarizations regardless of their spatial proximity. This gives rise to an algorithm essentially equivalent to deinterlacing the microgrid data and applying independent AWF SR [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
] to each channel. In most cases, this is not the best approach, as there is often significant correction across polarization channels. By letting
α = 1, this is equivalent to applying a single AWF to the mosaic microgrid data as if it were a standard FPA and each pixel represents the same physical quantity. In that case, spatial separation is the only factor altering the correlation model. This is also not likely the best approach as samples from the same polarization generally do exhibit higher correlation than those of different polarizations. Thus, one would want to select
α in the range 0 ≤
α ≤ 1 based on expected scene content. A relatively high
α is appropriate when we have weakly polarized data, like we may see in LWIR. A lower
α may be used for more highly polarized data with a high average DoLP.
Given
rdθdϕ
(
x,y), it can be shown that the cross-correlation function between
p(
x,y,
θ) and
f (
x,y,
ϕ), as shown in
Fig. 3, can be expressed as
The cross-correlation between
f (
x,y,
θ) and
f (
x,y,ϕ) is given by
By registering the LR frames, the spatial locations and corresponding polarizations of all the LR pixels that make up
g
i
, which are the same for
f
i
, are known. Thus, the horizontal and vertical displacements between these samples can be computed. Evaluating
Eq. (38) using the displacements and the corresponding polarizations allows us to fill the correlation matrix
. Next,
R
i
can be found from
Eq. (33), given the noise variance. Similarly, we compute the displacements between the samples in the estimation window and the LR pixels in the observation window. With these displacements and the corresponding polarizations, we evaluate
Eq. (37) and complete
P
i
as given in
Eq. (34). Finally, the weights are found using
Eq. (32).
As described in [
8
R. Hardie, “A fast image super-resolution algorithm using an adaptive Wiener filter,” IEEE Trans. Image Process.
16(12), 2953–2964 (2007). [CrossRef] [PubMed]
], there are circumstances where one only needs to compute one set of weights for processing all the observation windows for an entire output frame. This allows for very fast processing. This fortunate case occurs when we have global translational motion and the statistical parameters are assumed to be constant across the image. With global translational motion, the same spatial pattern of LR samples is observed in each observation window, so long as the observation and estimation window sizes are an integer multiple of
L. If we have bad pixels, we can simply exclude those from the observation vector
g
i
, so that they do not contribute to the output. However, this requires that a distinct set of weights be computed around any bad pixels, as it changes the spatial distribution of samples in the observation window. If processing speed is critical, a bad pixel replacement algorithm can be employed prior to AWF filtering [
16
B. M. Ratliff, J. S. Tyo, J. K. Boger, W. T. Black, D. L. Bowers, and M. P. Fetrow, “Dead pixel replacement in LWIR microgrid polarimeters,” Opt. Express
15(12), 7596–7609 (2007). [CrossRef] [PubMed]
]. This faster approach is how we address bad pixels for the AWF SR results presented in Section 5.
An example of some polarimetric AWF filter weights is shown in
Fig. 6. The brightest pixel corresponds to a weight of 1.41 and the darkest is −0.17. The remaining weights are scaled linearly. For reference, note that the perimeter weights are very close to zero. These weights are computed for
K = 1 frame,
P = 4 polarimetric channels, and an upsampling factor of
L = 2. Thus, we assume we have a single microgrid frame and we form an estimate of each polarization at each FPA location. This can be viewed as demosaicing with restoration. This simple case is shown to make interpreting the weights more straightforward than with mulitple frames and/or more upsampling. In computing these weights, we assume the noise variance is
and
. The observation window size is
Wx
=
Wy
= 10 and the estimation window size is
Dx
=
Dy
= 2. The weights correspond to estimating the upper left pixel in the estimation window. This location is highlighted with a plus sign in
Fig. 6. Each of the four columns in
Fig. 6 corresponds to a different polarization sample being estimated for the one spatial location, and each row corresponds to using a different
α in computing the weights. In particular, the columns from left to right correspond to polarization angles of
θ = 0
°, 45
°, 90
° and 135
°. The rows from top to bottom correspond to
α = 0, 0.7, 1.0. Note that when
α = 0 (top row), the weights are only nonzero for pixels on the microgrid of the same polarization as the output. On the other extreme, when
α = 1 (bottom row), the weights treat all pixels on the migrogrid array as equivalent, ignoring their polarization differences. Thus, the output for all polarizations is formed with the same weights, yielding the same output. This would be appropriate when the DoLP is assumed to be zero, and all polarization channels should be equal. The case where
α = 0.7 (middle row) yields an interesting result, where samples from all polarizations are combined to form each output, especially when estimating at a location where that polarization is not measured (columns 2–4). The advantage of using samples with polarizations other than the one being estimated is that they offer spatial locations not represented by pixels of like polarization. Spatially close pixels of a different polarization may be more correlated to the sample we are estimating than ones farther away that have the same polarization. Thus, using
α, we control the weight given to samples with polarizations different than the polarization being estimated relative to those of like polarization. When multiple frames are present, the picture is more intricate as samples of various polarizations are distributed potentially nonuniformly throughout the observation window. However, the principle is the same where
α balances spatial correlation with polarization correlation.
Fig. 6 AWF filter weights for K = 1 frame, P = 4 channels, Wx
= Wy
= 10 observation window, Dx
= Dy
= 2 estimation window and L = 2 upsampling (single frame demosaicing with restoration). The weights correspond to estimating the pixel location with the red plus sign. The columns from left to right correspond to polarization angles of θ = 0°, 45°, 90° and 135°, respectively. The rows from top to bottom correspond to α = 0, 0.7, 1.0.
We have found that in some cases we can improve the microgrid AWF results by preprocessing the microgrid to compensate for disparate local statistics of the individual channels. In particular, if we subtract the local means and divide by the local standard deviations of the individual channels, they tend to exhibit greater correlation. This allows us to use a larger
α and better exploit the spatial sampling diversity provided by the microgrid as shown in
Fig. 1. To better describe this method, let
a(
i, j) represent a sample from the microgrid array, where
i = 1,2,...,
M is the 2 × 2 super-pixel index and
j is the polarization sample index within the super-pixel. Let
j = 1,2,3,4 correspond to
θ = 0
°, 45
°, 90
0, 135
°, respectively. Local means, denoted
ā(
i, j), are estimated by deinterlacing the channels and applying a Gaussian low-pass filter with standard deviation of
σ pixels. Local standard deviations, denoted
ã(
i, j), are estimated by subtracting the local mean and then using a Gaussian low-pass filter on the squared difference, and taking the square root of the result. In terms of these variables, the preprocessing step is expressed as
After SR processing, the local means and standard deviations are interpolated, shifted appropriately to match the HR grid, and reintroduced to the SR result to restore the estimated channels to their proper dynamic range. Even without SR processing, we have found that a local statistics fusion (LSF) method can be useful as a stand-alone demosaicing approach. In that case, the output is given by
Here
b(
i, j,k) is the output for super-pixel
i = 1,2,...,
M, position
j = 1,2,3,4 within the super-pixel, and with polarization index
k = 1,2,3,4. The variables
āj
(
i, k) and
ãj
(
i, k) are the mean and standard deviation, respectively, for super-pixel
i and polarization
k repositioned through interpolation to align with spatial position
j. Note that the low-pass nature of the statistic images make them more suited to interpolation than the raw data. This method works best when there is high correlation between the polarimetric channels, but perhaps a scale and bias difference.