1. Introduction
Optical systems that do not have an axis of rotational symmetry have been and continue to be of interest in optical design. In his Ph. D. dissertation, Buchroeder [
1
R. A. Buchroeder, “Tilted component optical systems,” Ph.D. dissertation (University of Arizona, Tucson, Arizona, 1976).
] described a class of non-axially symmetric systems, called “tilted component optical systems” constructed from axially-symmetric components. These components may be tilted about their nodal points in such a way that a particular ray, which defines the reference axis, remains undeviated. The aberrations of tilted component optical systems then can be expressed using Shack’s [
2
R. V. Shack, “Aberration theory, OPTI 514 course notes,” College of Optical Sciences, University of Arizona, Tucson, Arizona.
] vector aberration function where the field vector
H⃗ is modified by a displacement term to account for each component tilt. With vector notation, the final system aberration fields can be found and analyzed as shown by Thompson [
3
K. P. Thompson, “Description of the third-order optical aberrations of near-circular pupil optical systems without symmetry,” J. Opt. Soc. Am. A,
22, 1389–1401 (2005). [CrossRef]
,
4
K. P. Thompson, “Aberration fields in tilted and decentered optical systems,” Ph.D. dissertation (University of Arizona, Tucson, Arizona, 1980).
].
A plane symmetric system is a system that has a plane of symmetry: that is, one half of the system is a mirror image of the other. Axially symmetric and double-plane symmetric systems belong to the class of plane symmetric systems. Sasian [
5
J. M. Sasian, “Imagery of the Bilateral Symmetric Optical System,” Ph.D. dissertation (University of Arizona, Tucson, Arizona,
1988).
,
6
J. M. Sasian, “How to approach the design of a bilateral symmetric optical system,” Opt. Eng.
33, 2045 (1994).
] developed an aberration function for describing plane symmetric optical systems that uses a vector
i⃗ to define the direction of plane symmetry. The question is: can the plane symmetric formalism be extended to a combination of plane symmetric systems which do not necessarily share the same orientation for their respective planes of symmetry? Using the vector notation developed by Thompson [
3
K. P. Thompson, “Description of the third-order optical aberrations of near-circular pupil optical systems without symmetry,” J. Opt. Soc. Am. A,
22, 1389–1401 (2005). [CrossRef]
,
4
K. P. Thompson, “Aberration fields in tilted and decentered optical systems,” Ph.D. dissertation (University of Arizona, Tucson, Arizona, 1980).
] we extend the plane symmetric formalism to a combination of plane symmetric systems. The final, or global, system is not necessarily plane symmetric. Instead it may not have any identifiable symmetry.
The result of the work in this paper is a more general theory for non-axially symmetric systems than “tilted component optical systems” given that the component systems are not restricted to being axially symmetric. For example, this theory can be applied to both systems comprised of off-axis aspheres, as long as a plane of symmetry can be defined, and tilted plane symmetric optical components if they are tilted in the plane of symmetry. However this theory is not applicable to all asymmetric systems. It cannot be applied to systems with components that can not be simplified as plane symmetric.
There are several methods to concatenate asymmetric systems which can be used to combine plane symmetric systems. For example, Andrews [
7
M. Andrews, “Concatenation of characteristic functions in Hamiltonian optics,” J. Opt. Soc. Am.
72, 1493–1497 (1982). [CrossRef]
], Forbes [
8
G. Forbes, “Concatenation of restricted characteristic functions,” J. Opt. Soc. Am.
72, 1702–1706 (1982). [CrossRef]
,
9
B. Stone and G. Forbes, “Foundations of first-order layout for asymmetric systems: an application of Hamilton’s methods,” J. Opt. Soc. Am.
A 9, 96–109 (1992). [CrossRef]
] and Stone [
9
B. Stone and G. Forbes, “Foundations of first-order layout for asymmetric systems: an application of Hamilton’s methods,” J. Opt. Soc. Am.
A 9, 96–109 (1992). [CrossRef]
] have developed methods for the concatenation of asymmetric systems using Hamiltonian methods. Our concatenation methodology differs in that it is based in component system rotation about an optical axis ray.
In addition to these methods, modern commercial lens design programs are capable of analyzing combinations of optical systems. Our approach provides an intuitive understanding of the combination of plane symmetric systems that can guide a lens designer using commercial lens design software. We show which of the aberrations are dependent on the orientation of the plane of symmetry. And we show that for these aberrations, the combined system aberration is simply the sum of the i vectors, denoting the direction of the plane of symmetry of the subsystem, weighted by the aberration coefficient for each subsystem. Conceptually it follows that a given aberration can be canceled by adding an equal and opposite amount of the aberration if the orientation of the plane of symmetry is chosen properly. It also follows that the aberration cannot be canceled if the plane of symmetry orientations are not chosen properly. To aid in this conceptual understanding, we have included figures for each of the aberrations and have grouped similar types of aberration fields.
For completeness, we find the nodes of each of the aberration fields. This provides a conceptual example of how the aberration fields for a combination of plane symmetric systems contributes to the overall system aberrations. We build on previous work [
3
K. P. Thompson, “Description of the third-order optical aberrations of near-circular pupil optical systems without symmetry,” J. Opt. Soc. Am. A,
22, 1389–1401 (2005). [CrossRef]
,
4
K. P. Thompson, “Aberration fields in tilted and decentered optical systems,” Ph.D. dissertation (University of Arizona, Tucson, Arizona, 1980).
] in aberration theory, add graphics to help clarify concepts, and contribute to the theory by including plane symmetric component systems. Specifically the new contributions to the theory are: 1) component system addition is specified by angular displacement rather than by component tilt, resulting in an alternate view of system concatenation, 2) a wider class of optical systems regarding the symmetry of the component systems can be treated, 3) we point out and discuss the occurrence of line nodes, and 4) the graphics contribute to the understanding of vector aberration theory.
2. Aberration function and fields
The wave aberration function for a rotationally symmetric optical system in vector form [
2
R. V. Shack, “Aberration theory, OPTI 514 course notes,” College of Optical Sciences, University of Arizona, Tucson, Arizona.
] is
where
H⃗ is the field vector and
ρ⃗ is the aperture vector, as shown in
Fig. 1.
Fig. 1. Conventions for the plane symmetry, field and aperture vectors.
The aberration function can be modified for a plane symmetric system [
5
J. M. Sasian, “Imagery of the Bilateral Symmetric Optical System,” Ph.D. dissertation (University of Arizona, Tucson, Arizona,
1988).
]:
where
i⃗ is a unit vector that specifies the direction of plane symmetry. The indices
k,
m,
n,
p and
q are integer numbers. On the right hand side of
Eq. (2), the
W’s represent the aberration coefficients and convey the magnitude of a given aberration. Note that the first subscript 2
k+
n+
p is the algebraic power of the field vector
H⃗, and the second subscript 2
m+
n+
q is the algebraic power of aperture vector
ρ⃗. To combine several systems, one can use a field displacement term
H⃗-
σj
for each of the tilted component systems as Thompson did. We instead combine several plane symmetric systems using the vector
i⃗
j
that indicates the relative orientation among each of the
j plane symmetric systems.
The aberration function describes the aberrations about the optical axis. The center of the field of view, the center of the aperture stop and the pupils lie on the optical axis ray [
1
R. A. Buchroeder, “Tilted component optical systems,” Ph.D. dissertation (University of Arizona, Tucson, Arizona, 1976).
]. Optically, the optical axis ray is a straight line; that is, looking at the optical axis from image space, the axis appears like a straight line, while in reality it is a ray that is reflected, refracted or diffracted by the system surfaces. In effect, the system can be unfolded such that the optical axis ray is a straight line. The vector
i⃗ is perpendicular to the optical axis ray. Parametric expressions for the aberration coefficients of a plane symmetric system are given by Sasian [
5
J. M. Sasian, “Imagery of the Bilateral Symmetric Optical System,” Ph.D. dissertation (University of Arizona, Tucson, Arizona,
1988).
,
6
J. M. Sasian, “How to approach the design of a bilateral symmetric optical system,” Opt. Eng.
33, 2045 (1994).
]. Using the notation established by Sasian, the aberration function for a single plane symmetric system up to fourth-order is:
The total or global aberration function of a number
j of plane symmetric systems with relative orientation on
i⃗
j
and with aberration coefficients
W
2k+n+p,2m+n+q,n,p,q,j
is the sum of the individual aberration functions. The fact that the total aberration is still a sum of the individual surface aberrations, even when there are tilted or decentered components, was discussed by Buchroeder [
1
R. A. Buchroeder, “Tilted component optical systems,” Ph.D. dissertation (University of Arizona, Tucson, Arizona, 1976).
]. The sum to fourth-order is:
This combination of plane symmetric systems still shares the same optical axis ray. The relative orientation of each of the systems is conveyed by the vectors
i⃗
j
.
Figure 2 shows an example of possible orientations of these vectors as seen looking down the optical axis ray.
Fig. 2. Possible orientation of vectors i⃗
j
as seen looking down the optical axis. Notice that the vectors are all unit vectors in different directions. (The optical axis is in and out of the page at (x,y)=(0,0).)
To organize the aberrations in
Eq. (4), we first define the variables listed in
Table 1. These variables label each of the aberration coefficients by the aberration type and field dependence. This table also shows which of the aberrations have no dependence on
i⃗ or are dependent on
i⃗ or
i⃗
2. It shows how the aberration coefficients from each component are combined into the system aberration coefficient. For the aberrations that depend on
i⃗, the combined system aberration coefficient is simply the sum of the aberration coefficient for the individual components multiplied by the
i⃗ vector denoting the orientation of plane of symmetry. In general, the plane of symmetry for each aberration coefficient of the combined system will have a different plane of symmetry so the combined
i⃗ vectors are redefined. To minimize the combined system aberrations, either the individual subcomponent aberration coefficients can be minimized or, like with axially symmetric systems, the combined aberration can be reduced by balancing aberrations with equal but opposite amounts of the individual component aberrations, as long as the proper orientation of the individual planes of symmetry is chosen.
Table 1. Coefficient and vector definitions
| Uniform Piston:
| Field Displacement:
| Linear Piston:
|
| Defocus:
| Magnification:
| Quadratic Piston I:
|
| Defocus from Constant Astigmatism:
| Magnification from Anamorphism:
| Quadratic Piston IIa:
|
| Constant Astigmatism:
| Anamorphism:
| Quadratic Piston IIb:
|
| Constant Coma:
| Linear Astigmatism:
| Field Tilt:
|
| Quadratic Distortion I:
| Quadratic Distortion II:
| Cubic Piston:
|
| Spherical Aberration:
| Linear Coma:
| Quadratic Astigmatism:
|
| Field Curvature:
| Cubic Distortion:
| Quartic Piston:
|
In
Table 1, the terms “constant,” “uniform,” “linear,” “quadratic” and “cubic” describe the algebraic power of the field dependence (first subscript). (We chose the name “uniform piston” for the aberration that would typically be called “constant piston” in this paper so that the subscript would be different from cubic piston, which uses “cp”.)
Each time the direction unit vector
i⃗ occurs in
Table 1, it is always normalized to one. For example, in the constant coma case, we have:
Therefore, the W coefficient accounts for the entire weight, such as in the constant coma term:
In order to better illustrate the field dependence of the aberration function in
Eq. (4), we group the aberration coefficients by their dependence on the aperture vector
ρ⇀. As shown in
Table 2 this groups the aberration function into six aberration fields: spherical aberration, coma, astigmatism, defocus, distortion, and piston.
Table 2. Aberration fields of a combination of plane symmetric systems
| Field of Piston |
|
| Field of Distortion |
|
| Field of Defocus | |
| Field of Astigmatism |
|
| Field of Coma |
|
| Field of Spherical Aberration |
|
Some of the terms in
Table 2, involve the product of vectors, such as
i⃗
H⃗ or
H⃗
2. This operation, named “vector multiplication” by Shack [
2
R. V. Shack, “Aberration theory, OPTI 514 course notes,” College of Optical Sciences, University of Arizona, Tucson, Arizona.
], is different from both a vector dot product and a vector cross product. If we have two vectors
A⇀ and
B⇀ expressed as
then the vector product A⇀B⇀ is defined as:
The conjugate of a vector is a reflection of the vector across the y-axis:
The following vector identities were also used to generate
Table 2 [
3
K. P. Thompson, “Description of the third-order optical aberrations of near-circular pupil optical systems without symmetry,” J. Opt. Soc. Am. A,
22, 1389–1401 (2005). [CrossRef]
]:
For an explanation of vector multiplication, see Thompson [
3
K. P. Thompson, “Description of the third-order optical aberrations of near-circular pupil optical systems without symmetry,” J. Opt. Soc. Am. A,
22, 1389–1401 (2005). [CrossRef]
].
The vector identities shown in
Eqs. (11) and
(12) are used to split the anamorphic distortion and each of the astigmatism terms into a combination of two aberrations. This is discussed in the appendix.
The aberrations discussed here are to the fourth-order of approximation, as discussed by Sasian [
5
J. M. Sasian, “Imagery of the Bilateral Symmetric Optical System,” Ph.D. dissertation (University of Arizona, Tucson, Arizona,
1988).
], and do not include higher-order terms. For example, terms like cylindrical field curvature are not considered. The use of higher-order terms would impact the nature of the aberration fields and their point and nodal lines where they vanish.
4. Aberration field nodes
Since each of the aberration fields listed in
Table 2 is the sum of multiple terms, each with a different field dependence, it is possible that these terms sum to zero at certain locations in the field. These special locations are called field nodes and geometrically can be points, lines or circles. Following the work of Thompson [
3
K. P. Thompson, “Description of the third-order optical aberrations of near-circular pupil optical systems without symmetry,” J. Opt. Soc. Am. A,
22, 1389–1401 (2005). [CrossRef]
,
4
K. P. Thompson, “Aberration fields in tilted and decentered optical systems,” Ph.D. dissertation (University of Arizona, Tucson, Arizona, 1980).
] for tilted component systems, we find the nodes for a combination of plane symmetric systems. In many cases the location of the nodes is similar, but because we use different vector definitions than Thompson the equations used to find the nodes are not always the same.
To find the nodes in the field of view, we treat each aberration field independently. Notice that the pupil dependence for each field in
Table 2 has been factored out. This leads to a term in brackets that depends only on the field vector, plane of symmetry vectors, and the aberration coefficients. In order to find the location of the nodes, we set this bracketed term equal to zero and solve for
H⃗ as a function of the aberration coefficients and
i.
4.1 Spherical aberration
The field of spherical aberration is uniform; it does not vary as a function of the field of view, so there are no field nodes.
4.2 Coma
The field of coma can be linear or constant with respect to the field of view:
Thus, there can be one field node:
4.3 Astigmatism
The magnitude of the astigmatism can be uniform, linear or quadratic as a function of the field of view. This field can have one or two nodes where the astigmatism vanishes. Shack [
2
R. V. Shack, “Aberration theory, OPTI 514 course notes,” College of Optical Sciences, University of Arizona, Tucson, Arizona.
] was the first to recognize that the field of astigmatism could have two nodes and he called this case binodal astigmatism. In the presence of constant, linear, and quadratic astigmatism the node positions satisfy the equation:
The locations of the nodes are at the two field points that can be found solving
Eq. (15) and are given by:
There are some special cases where the location of the node is simplified:
1) If there is no constant or linear astigmatism (Wca
=0 and Wla
=0), there is one field node at the optical axis ray (H⃗=0).
2) When there is no constant astigmatism (Wca
=0), there are two nodes:
3) If there is no linear astigmatism (Wla
=0), then there are two nodes at:
4) If there is no quadratic astigmatism (Wqa
=0), the field of astigmatism is linear in field and there is only one node located at:
4.4 Defocus
The nodes of defocus can be point nodes, line nodes, or circle nodes. A single point node is a circle node of radius zero. The defocus nodes must satisfy the following equation:
Note that the astigmatism terms in the field of defocus describe the defocus required to move from the image plane to the medial astigmatic surface.
A line node will occur if the quadratic astigmatism term is balanced by the field curvature term (Wqa
=-Wfc
). As an example, if the defocus also balances the defocus from constant astigmatism (Wd
=-Wdca
), then the line node satisfies the following equations:
Thus, the line node occurs when the field vector H⃗ is perpendicular to the vector Wft
i
⇀
ft
+Wla
i
⇀
la
. If the two defocus terms do not cancel, then
The line node will shift and it no longer crosses through the center of the field of view. It is possible to shift the node outside the field of view. An example of a line node is plotted in
Fig. 3.
Fig. 3. A line node in the field of defocus from a combination of field tilts. (Color online: Red is focused before the image plane.)
However in general, the nodes from defocus will be circular. The values of the coefficients and the directions of the
i vectors will determine the location of the circle and where it is centered.
Figure 4 shows an example circular node in the field of defocus.
Fig. 4. A circular node in the field of defocus from a combination of field curvature and constant defocus. (Color online: Red is focused before the image plane.)
If the field tilt is balanced by the linear astigmatism (Wft
i
⇀
ft
=-Wla
i
⇀
la
), then the circular node is centered on the on-axis field point. The radius of this circular node is derived as follows:
Note that the quantity in parenthesis must be negative or else the field radius where the circular node exists will be imaginary and there will not be a circular node at all.
4.5 Distortion
Distortion is purely a mapping error which does not cause blurred images. Therefore it is possible to correct distortion with post-processing of the image. There are still applications where post-processing is not possible or distortion should be minimized to reduce the error in the post-processing. For this reason we describe the types of nodes found in the field of distortion. The distortion nodes satisfy the following equation:
Table 5 shows some sample distortion vector plots with two, three, and four nodes and list the contributing coefficient amounts.
Table 5. Some distortion plots showing 2, 3, and 4 nodes. The surface maps represent the magnitude of the distortion. The points where the surfaces meet the plane are the nodes. In the vector plots, the vectors represent both the magnitude and direction of the mapping distortion. The shading represents the magnitude of the distortion. Darker shades have less distortion. All i vectors used in creating these graphs are pointing to the right.

As with the field of defocus, it is also possible to get circular nodes and line nodes with the field of distortion. For example, if all other terms cancel except field displacement and quadratic distortion I, then the field of distortion simplifies to
This leads to a circular node (shown in
Table 6), with a radius derived as follows:
Like with the case of circular defocus nodes, Wfd
and WqdI
must have opposite signs or the radius of the circle node will be imaginary and there will not be a node in the field.
Another circular node can be created from magnification and cubic distortion. This set also has a node on axis. If:
then there is one node on axis and a circular node (shown in
Table 6) at
Again the term (Wm
+Wma
) and Wcd
must have different signs for there to be a node.
A line node in the distortion field may be found when there is no cubic distortion. One simple example of a line node is the case of quadratic distortion II. Additionally, by adding an equal, but opposite, amount of quadratic distortion I to quadratic distortion II, (in effect WqdI
=-WqdII
and i⃗
qdI
=i⃗
qdII
) a line node will be created. A line node may also be found by adding complementary amounts of magnification and anamorphism. It is also possible to get a line node and a point node using a combination of quadratic distortion II and magnification.
Table 6. Some distortion plots showing line and circular nodes. The surface maps represent the magnitude of the distortion. In the vector plots, the vectors represent both the magnitude and direction of the mapping distortion. The shading represents the magnitude of the distortion. Darker shades have less distortion. All i vectors used in creating these graphs are pointing to the right.
