2. Change in aberrations with object position
Consider the problem of influence of the change in the object position on imaging properties of a general rotationally symmetrical optical system.
Figure 1
shows imaging of two different planes by the optical system. Suppose that the optical system is aberration-free for imaging of the plane ξ
A, then an arbitrary point
A of this plane is imaged as the point
A´ in the plane ξ′
A. Choose now the different plane ξ
B. An arbitrary point
B of this plane will not be imaged by the optical system as the point
B´, but as the circle of least confusion
dB
according to aberrations of the optical system, which originate from the fact that the planes ξ
A and ξ
B are not identical. It causes the deviation of imaging properties. The relationship between the size of the object and its image will not be further linear. If we use such optical system for measurement the mentioned effect causes the measurement error, which cannot be removed. If the optical system is aberration free for a specific position of the object, then it has aberrations for other object positions and the image has lower quality.
Fig. 1 Imaging of two different planes by optical system.
The described problem will be analyzed using the theory of third-order aberrations [
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
–
8
C. G. Wynne, “Primary aberrations and conjugate change,” Proc. Phys. Soc.
65B, 429–437 (1952).
] which enables to obtain the solution in a simple analytical form. Consider the rotationally symmetrical optical system. Aberration properties for light of a specific wavelength are given by its third-order aberration coefficients
SI, SII, SIII, SIV, SV
, and
SVI
, where
SI
is the coefficient of spherical aberration,
SII
is the coefficient of coma,
SIII
is the coefficient of astigmatism,
SIV
is the Petzval’s sum,
SV
is the coefficient of distortion, and
SVI
is the coefficient of spherical aberration in pupils. We obtain the following equations [
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
–
6M. Herzberger, Modern Geometrical Optics (Interscience, 1958).
] for transverse ray aberrations within the validity of the third-order aberration theory
where
are transverse ray aberrations in the object plane, and
are transverse ray aberrations in the image plane. The object can be represented in a general case as the image created by the preceding optical system. Values δ
x and δ
y describe transverse ray aberrations of the preceding optical system in the object plane of the optical system under consideration. We can set δ
x = 0 and δ
y = 0 if any optical system is not located in front of the considered optical system. The coefficients in previous formulas are given by
where
m is the transverse magnification of the optical system,
g is the angular magnification of the optical system,
gP
is the angular magnification of the optical system in pupils,
p
1 is the distance from the object to the entrance pupil,
xP
1,
yP
1 are the coordinates of the intersection of the ray with the plane of entrance pupil,
y is the size of the object, and
n,
n' are indices of refraction of object and image space. The case, when object and image surfaces are not planar, is described e.g. in Ref.
6M. Herzberger, Modern Geometrical Optics (Interscience, 1958).
. Coefficients
SI
,
SII
,
SIII
,
SIV
,
SV
, which describe imaging properties of the optical system for an arbitrary position of the object (arbitrary transverse magnification
m), can be expressed using the third-order aberration coefficients
. These coefficients characterize imaging properties of the optical system for imaging of the object at infinity (transverse magnification
m = 0), and the coefficient of spherical aberration in pupils
. Formulas for aberration coefficients
SI
,
SII
,
SIII
,
SIV
,
SV
[
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
–
8
C. G. Wynne, “Primary aberrations and conjugate change,” Proc. Phys. Soc.
65B, 429–437 (1952).
] can be rewritten after a tedious derivation into the following matrix form
where we denoted
The coefficients
are given by the following formulas
where
f′ is the focal length of the optical system.
Equations (2) represent generally valid formulas for calculation of aberration coefficients of the optical system corresponding to varying distance of the object. Moreover,
Eq. (2) can be modified into the form where the dependence on the magnification (position of the object) is explicitly expressed in contrast to previously published papers. We obtain the following linear system:
where
From previous relations, it is evident that if aberration coefficients of the optical system are known for one value of magnification, than we can calculate aberrations coefficients of the optical system for any other value of magnification. If we write
Eq. (3) for two different values of magnification
and
, we obtain for the vectors of Seidel aberration coefficients the following simple formula
One can see from the previous equation the advantage of the matrix form of formulas for aberration coefficients. The matrix
B has to be calculated once for all and one can use it for different values of magnification. The matrix form is also very useful for zoom lens design [
10
A. Mikš, J. Novák, and P. Novák, “Method of zoom lens design,” Appl. Opt.
47(32), 6088–6098 (2008). [CrossRef] [PubMed]
].”
It can be shown that previous formulas are generally valid (within the validity of the third-order aberration theory) and do not depend on the composition of the optical system [
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
–
9
A. Walther, “Irreducible aberrations of a lens used for a range of magnifications,” J. Opt. Soc. Am. A
6(3), 415–422 (1989). [CrossRef]
]. Properties of the optical system are then fully specified by its focal length
f′ and third-order aberration coefficients
, which characterize imaging properties of the optical system for imaging of the object at infinity. Aberration coefficients
SI
,
SII
,
SIII
,
SIV
,
SV
are calculated for the following input values
and aberration coefficients
are determined for the following input values
where
and
is the paraxial incidence height and angle of the aperture ray (first auxiliary ray),
and
is the paraxial incidence height and angle of the principal ray (second auxiliary ray) at the first surface of the optical system,
s
1 is the distance from the first surface of the optical system to the object plane, and
sP
1 is the distance from the first surface of the optical system to the entrance pupil. We can clearly see from previous equations that aberrations of the optical system change in case of the varying object position. The optical system is called aberration-free for a given value of magnification (object position) if all aberration coefficients are zero for this magnification (object position), i.e. in our case we have
.
Furthermore, we will focus on a special case which is interesting both from the theoretical and practical point of view. We will analyze the changes of aberrations with respect to the change in object’s position (change in magnification of the optical system) for the optical system which is aberration-free for objects at infinity. Assume now that we have the optical system, which is aberration-free for imaging the object at infinity and for the selected spectral range, i.e.
If we use
Eq. (1) and set
, then the transverse ray aberrations
,
, which are induced by the change in position of the object, can be expressed as
where
AX = xP
1/
p
1 and
AY = yP
1/
p
1 are numerical apertures (in air) in the object space,
w is the angle of field of view (tan
w =
y/p
1), and
n = n' = 1 (the most common situation in practice). The coefficients
are given by
Equations (4) and
(5) are very interesting because these formulas show that aberration properties (aberration coefficients) of the considered optical system (aberration-free for the object at infinity) depend only on the focal length
f', the angular magnification
g, and the angular magnification
gP
between pupils of the optical system. In order to determine the shift of “geometric-optical energy centre of the circle of confusion” we calculate mean values of transverse ray aberrations. The mean values (centroid of the spots)
and
of the transverse ray aberrations
and
(
Eq. (3) are given by [
8
C. G. Wynne, “Primary aberrations and conjugate change,” Proc. Phys. Soc.
65B, 429–437 (1952).
]
where we used the following relations
(
r, φ) are polar coordinates in entrance pupil plane,
R is entrance pupil radius, and
As one can see from
Eq. (7) the position of the centroid is affected only by coma coefficient in our case. Other aberration coefficients have no influence on the position of the centroid because integrals
(6) and
(7) of terms corresponding to aberration coefficients
and
are zero. However, aberration coefficients affect the radius of gyration over the exit pupil. This problem is treated in more detail in [
11
J. B. Develis, “Comparison of methods for image evaluation,” J. Opt. Soc. Am.
55(2), 165–173 (1965). [CrossRef]
]. Concerning distortion one can clearly see from
Eq. (3) that aberration coefficient
SV
is zero for all positions of the object. The expression
gives the maximum numerical aperture of the optical system in the object space, and
F
0 is the
f-number of the optical system for the object at infinity. We can write
where
y is the size of the object,
s
1 is the object distance, and
sP
is the position of the entrance pupil. We obtain for the relative error of measurement
where we denoted
m is the transverse magnification of the optical system (
m = 1/
g),
mP
is the transverse magnification in pupils of the optical system (
mP
= 1/
gP
), and
y´ is the image size. In measurement practice, optical systems with a telecentric path of the principal ray (object-side, image-side and double-sides telecentric lenses) are frequently used. Telecentric optical systems are treated in detail in [
12]. In case of image-side telecentric optical system (
), e.g. objective lenses of CCD cameras, we obtain for such systems from
Eq. (10)Formulas (9) and
(10) are very useful for estimation of measurement errors caused by varying object distance. These errors cannot be explained using the paraxial approximation. We had used the third-order aberration theory and obtained resulting formulas that have shown the magnitude of errors, both generally and for a special case of optical system without aberrations. In the last case (optical system without aberrations for objects at infinity) derived equations present approximate estimation of measurement accuracy obtained with optical instruments. Presented formulas extend knowledge in the field of optical metrology and other areas, such as geodesy, photogrammetry, fringe projection methods, etc. It is clear from the presented results that these errors have physical character and cannot be removed, but their influence can be reduced by a proper design of optical measuring systems. This can be done as follows. Firstly, we choose the range of angular magnification
for which we want to optimize the optical system. We can write
Eqs. (1) and
(2) for several values of angular magnification
. Then, using the least squares method we can calculate the values of the aberrations coefficients
that minimize aberrations in the given range of the change of magnification
g. Using aberrations coefficients
we can calculate parameters of the optical system. These parameters then serve as a starting state (predesign) for further optimization of the optical design of the optical system using optical design software, such as OSLO, ZEMAX, CODE V, OPTALIX. The procedure is the same as in the Ref [
10
A. Mikš, J. Novák, and P. Novák, “Method of zoom lens design,” Appl. Opt.
47(32), 6088–6098 (2008). [CrossRef] [PubMed]
]. The problem is also solved in a different way by Walther [
9
A. Walther, “Irreducible aberrations of a lens used for a range of magnifications,” J. Opt. Soc. Am. A
6(3), 415–422 (1989). [CrossRef]
] who used mock ray tracing and numerical optimization in his work.
Equations (9) and
(10) are also interesting from a theoretical point of view because they say that if the optical system is aberration-free (
,
,
,
,
,
) for the object at infinity, then the relative measurement error does not depend (within the validity of the third-order aberration theory) on the type and composition of the considered optical system, but it depends only on its transverse magnification
m, transverse magnification
mP
between pupils, and
f-number
F
0. Using
Eq. (9) we can calculate the ratio of the mean value (centroid) of transverse ray aberration to the image size in dependence on the transverse magnification
m of the optical system.
3. Influence of wavelength on measurement accuracy
Let us now deal with the influence of the wavelength of light
λ on the accuracy of measurement. It is well known that aberrations of the optical system vary with the wavelength of light, i.e. the optical system has chromatic aberration [
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
–
5W. T. Welford, Aberrations of the Symmetrical Optical Systems (Academic Press, 1974).
,
7
M. Herzberger, “Theory of image rrrors of the fifth order in rotationally symmetrical systems. I,” J. Opt. Soc. Am.
29(9), 395–406 (1939). [CrossRef]
]. Assume that we have the optical system, which is aberration-free for imaging the object at infinity and for the selected spectral range
(e.g. visible spectral range), i.e.
In order to analyze the influence of chromatic aberration of the optical system on measurement accuracy for different object positions (object located at finite distance from the optical system), we have to differentiate
Eq. (10). After some calculations we obtain
where functions
and
are given by the following expressions
is the lateral chromatic aberration of the optical system for the transverse magnification
m, and
is the lateral chromatic aberration in pupils of the optical system for the transverse magnification
mP
. Further, we focus on calculation of the lateral chromatic aberration
of the optical system. Assume that the image and object space is air (
). It is known from the theory of chromatic aberrations that longitudinal chromatic aberration
and lateral chromatic aberration
can be expressed by the following general formulas [
4H. A. Buchdahl, An Introduction to Hamiltonian Optics (Cambridge University Press, 1970).
,
5W. T. Welford, Aberrations of the Symmetrical Optical Systems (Academic Press, 1974).
]
where
CI
is the coefficient of longitudinal chromatic aberration,
CII
is the coefficient of lateral chromatic aberration, and
is the Lagrange-Helmholtz invariant. It holds that
where
and
are chromatic aberration coefficients for the object at infinity (
),
and
are pupil chromatic aberration coefficients.
Assume now that we have an optical system, which is chromatic aberration-free for imaging the object at infinity and for the selected spectral range, i.e. (,). Then, we obtain from previous relations
We can derive for the chromatic aberration
in pupils
where
Equation (21) enables to analyze the influence of chromatic aberrations of the optical system (aberration corrected for the object at infinity:
,
) on measurement accuracy for different object positions (object located in a finite distance from the optical system). If chromatic aberrations in pupils of the optical system are corrected, i.e.
, then we obtain from
Eq. (21) that
. In such case the optical system is achromatic and the measurement accuracy is not dependent on wavelength.
The relationship between the wave aberration and the ray aberration is given by [
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
–
5W. T. Welford, Aberrations of the Symmetrical Optical Systems (Academic Press, 1974).
]
where
X and
Y are normalized coordinates of the intersection of the ray with the reference sphere in the image space,
F is the
f-number of the optical system,
AX
and
AY
are numerical apertures in the direction
x and
y in the object space. The wave aberration can be then calculated by integration, i.e.
where the transverse ray aberrations
are given by
Eq. (4).
4. Dependence of image quality on object position change
Considering wave properties of light and the finite size of optical systems, the image of the point in the object plane is the diffraction pattern in the image plane. The response of the optical system to the point signal is called the point spread function (PSF) [
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
,
10
A. Mikš, J. Novák, and P. Novák, “Method of zoom lens design,” Appl. Opt.
47(32), 6088–6098 (2008). [CrossRef] [PubMed]
,
11
J. B. Develis, “Comparison of methods for image evaluation,” J. Opt. Soc. Am.
55(2), 165–173 (1965). [CrossRef]
,
13E. L. O'Neill, Introduction to Statistical Optics (Addison-Wesley Inc., 1963).
–
19
V. N. Mahajan, “Zernike polynomials and optical aberrations,” Appl. Opt.
34(34), 8060–8062 (1995). [CrossRef] [PubMed]
]. The shape of the PSF, i.e. the energy distribution in the diffraction pattern, depends on the position of the point in the object plane and on the distance from the optical system to the object plane. If wave aberration changes due to the variation of the object position, the shape and position of the peak of point spread function will also change and measurement errors occur.
Assuming the case that aberrations are small, then we can take as a criterion of the quality of optical systems for imaging the point object the normalized intensity in the peak of the diffraction pattern (Strehl definition, Strehl ratio) [
10
A. Mikš, J. Novák, and P. Novák, “Method of zoom lens design,” Appl. Opt.
47(32), 6088–6098 (2008). [CrossRef] [PubMed]
,
11
J. B. Develis, “Comparison of methods for image evaluation,” J. Opt. Soc. Am.
55(2), 165–173 (1965). [CrossRef]
,
20
A. J. E. M. Janssen, S. van Haver, J. J. M. Braat, and P. Dirksen, “Strehl ratio and optimum focus for high-numerical-aperture beams,” J. Eur. Opt. Soc. Rapid Publ.
2, 07008 (2007). [CrossRef]
] that is defined as the ratio of the maximum of the point spread function of the real optical system to the maximum of the point spread function of the diffraction limited system (optical system without aberrations). It is well known [
10
A. Mikš, J. Novák, and P. Novák, “Method of zoom lens design,” Appl. Opt.
47(32), 6088–6098 (2008). [CrossRef] [PubMed]
,
11
J. B. Develis, “Comparison of methods for image evaluation,” J. Opt. Soc. Am.
55(2), 165–173 (1965). [CrossRef]
,
13E. L. O'Neill, Introduction to Statistical Optics (Addison-Wesley Inc., 1963).
–
15
W. B. King, “Dependence of the Strehl ratio on the magnitude of the variance of the wave aberration,” J. Opt. Soc. Am.
58(5), 655–661 (1968). [CrossRef]
] that the Strehl ratio of the optical system with small aberrations can be expressed in the form
where
E
0 is the variance of wave aberration and
where
W is wave aberration,
k
0 = 2π/λ
0, and λ
0 is the wavelength of light in the vacuum. With respect to the Strehl definition, we consider the optical system to be equivalent to the diffraction limited system if the Strehl definition is higher than 0.8 (
). Wave aberrations can be expressed e.g. using Zernike polynomials [
11
J. B. Develis, “Comparison of methods for image evaluation,” J. Opt. Soc. Am.
55(2), 165–173 (1965). [CrossRef]
,
16
V. N. Mahajan, “Strehl ratio for primary aberrations: some analytical results for circular and annular pupils,” J. Opt. Soc. Am.
72(9), 1258–1266 (1982). [CrossRef]
–
19
V. N. Mahajan, “Zernike polynomials and optical aberrations,” Appl. Opt.
34(34), 8060–8062 (1995). [CrossRef] [PubMed]
] or Seidel polynomials [
5W. T. Welford, Aberrations of the Symmetrical Optical Systems (Academic Press, 1974).
,
9
A. Walther, “Irreducible aberrations of a lens used for a range of magnifications,” J. Opt. Soc. Am. A
6(3), 415–422 (1989). [CrossRef]
]. In our work we will use Seidel aberration polynomials for further analysis. The wave aberration
W of the third order can be expressed as
where
r and φ are polar coordinates at the exit pupil sphere of the rotationally symmetric optical system,
W
11 is the coefficient of tilt,
W
20 is the coefficient of defocus,
W
40 is the coefficient of the third-order spherical aberration,
W
31 is the coefficient of the third-order coma, and
W
22 is the coefficient of the third-order astigmatism. Aberration coefficients can be calculated from the following formulas [
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
,
11
J. B. Develis, “Comparison of methods for image evaluation,” J. Opt. Soc. Am.
55(2), 165–173 (1965). [CrossRef]
]
where
s
0 is longitudinal defocus,
y
0 is the transverse defocus, δ
yz
´ is distortion, δ
sK
´ is spherical aberration for the aperture ray that passes through the edge of the entrance pupil, δ
st
´ is meridional astigmatism, δ
ss
´ is sagittal astigmatism,
is meridional coma,
F is the
f-number of the optical system, and
n´ is the index of refraction in the image space. By substitution of
Eq. (26) into
Eq. (25) we obtain
Aberration coefficients
W
11 and
W
20 are unrestrained parameters that express the coordinates of the centre of the reference sphere. The position of the optimum image point [
11
J. B. Develis, “Comparison of methods for image evaluation,” J. Opt. Soc. Am.
55(2), 165–173 (1965). [CrossRef]
], i.e. the point with the minimum
E
0 and maximum Strehl definition, can be calculated from the necessary conditions for the extremum of function
E
0The solution of these equations can be expressed as [
11
J. B. Develis, “Comparison of methods for image evaluation,” J. Opt. Soc. Am.
55(2), 165–173 (1965). [CrossRef]
]
Formula (29) is valid only for the optimum image point. By substitution of
Eq. (27) for coefficients
into
Eq. (29) one can calculate the position (coordinates
s
0,
y
0), where the Strehl definition is maximum with respect to the paraxial image point. It holds
The detailed calculation of aberration coefficients is described in Ref [
3A. Miks, Applied Optics (Czech Technical University Press, 2009). [PubMed]
–
5W. T. Welford, Aberrations of the Symmetrical Optical Systems (Academic Press, 1974).
]. We can see that one needs to know aberration coefficients
W
11 and
W
31 for analysis of the error of measurement with the optical system. Using
formula (29) we have
where
Formulas (32) and
(33) are showing the influence of the shift of the point spread function central peak of the optical system under consideration with respect to the position of the point spread function central peak of the ideal optical system.
Equation (30) is the resulting formula for the relative measurement error that is calculated on the basis of the diffraction theory of optical imaging.
We can compare values of function φ(m,mP
) that was calculated on the basis of the geometrical theory of optical imaging and values of function ψ(m,mp
) that was obtained using the diffraction theory of optical imaging. We can see that the less accurate geometrical theory gives larger values of relative measurement errors than more accurate diffraction theory of optical imaging. We obtain by comparison of both functions
One can see that the geometrical theory of optical imaging gives 50% higher estimate of relative measurement errors than the diffraction theory of optical imaging regardless of the value of the transverse magnification m of the optical system.
5. Examples and analysis
We will show how to use the results obtained in the previous section. We focus on optical measuring systems e.g. in survey engineering, where theodolites and level instruments are most widely used for measurements. If we carry out measurements of objects that are situated at very long distances in comparison with the focal length of the objective lens of the measurement instrument, then the value of the transverse magnification
m and similarly the measurement error induced by the change in the object position is practically zero. On the other hand, if one makes measurements of objects that are situated near the measurement instrument, then the value of the transverse magnification is nonzero, and the measurement error increases. Such a situation is very frequent, for example, in survey engineering during measurement of various structural parts and buildings, in photogrammetry, etc. when a high measurement accuracy is required. Objects are situated near the measurement instrument if their distance to the optical system is smaller than the “practical infinity”. The “practical infinity” corresponds to such distance of the axial object point which produces defocus aberration of
W = λ/4 (Rayleigh quarter-wavelength rule). Using
Eq. (27) and Newton's conjugate distance equation we obtain for the “practical infinity” the following formula
For example, the value of “practical infinity” lies at distance
in front of the optical system with the focal length
,
f-number
, and wavelength
. Consider that we want to determine the shortest distance
from the optical system to the object which satisfies the condition that the relative measurement error
is smaller than some given value. Assume that the transverse magnification in pupils is
(the most frequent case in practice). Using
formulas (9),
(10), and Newton's conjugate distance equation, we obtain
where we assumed conditions
and
, which are always satisfied. We can express from the previous formula that
where the negative sign in
Eq. (36) is due to the fact that the object lies in front of the optical system. The relative error of measurement will be smaller than ε for all objects that are positioned in larger distances than
from the optical system.
In order to give some impression about the magnitude of aberration induced measurement errors we will calculate the relative measurement errors for the optical system, which is aberration-free () for an object at infinity. Let the optical system has the f-number F
0 = 5 and the transverse magnification in pupils of the optical system is mP
= 1. The focal length of the optical system is and the image size is .