1. Introduction
Multi-group zoom optical system is a complex structure, especially when zooming ratio is large and when the number of groups becomes larger than three. Kinematics of optical components, their optical power, and their positioning along the axis in extreme settings positions, form a dynamic structure, which is difficult for precision numerical modeling even if thin-component model is applied. Designing of such complex new systems usually start analysis on the structural level. After such structural analysis is completed, further optimization, subsequently performed even with professional optical software, does not contribute significantly to the performance improvements.
Many years ago, such designers as K. Yamaji [
11. K. Yamaji, “Design of zoom lenses”, in Progress in Optics, Vol. 6, (North-Holland 1967), pp.105–170.
], K. Tanaka [
22. K. Tanaka, Paraxial Theory of Mechanically Compensated Zoom Lenses by Means of Gaussian Brackets, Research Report of Canon Inc. (6), (Canon Inc. 1991).
], A. D. Clark [
33. A. D. Clark, Zoom Lenses: Monographs in Applied Optics (7) (Adam Hilger 1973).
], I. I. Pahomov [
44. I. I. Pahomov, Zoom Systems (Mashinostroene, Moskva 1976) – in Russian.
], T. H. Jamieson [
55. T. H. Jamieson, “Thin-lens theory of zoom systems,” Opt. Acta (Lond.) 17(8), 565–584 (1970). [CrossRef]
], etc., made significant contributions to the structural design of zoom systems. The rich knowledge concerning various construction aspects of zoom systems has been summarized by A. Mann in a dedicated textbook [
66. Selected Paper on Zoom Lenses, A. Mann, Editor, Vol. MS 85, (SPIE Optical Engineering Press 1993).
] and some newer achievements in this field were presented during two topical conferences organized by SPIE [
77. S. P. I. E. Proceedings, 2539, Zoom Lenses, A. Mann, Editor, (1995).
,
88. S. P. I. E. Proceedings, 3129, Zoom Llenses II, E. I. Betensky, A. Mann, and I. A. Neil, Editors, (1997).
]. In this respect, books [
99. A. Mann, Infrared Optics and Zoom Lenses, Vol., TT83, (SPIE Press 2009).
] by A. Mann and [
1010. A. Mikš, J. Novák, and P. Novák, “Method of zoom lens design,” Appl. Opt. 47(32), 6088–6098 (2008). [CrossRef] [PubMed]
] by A. Mikš et al. deserve for special attention due to their great contribution to the field.
Initially, simple solutions, usually adjusted to specific applications, based on the purely algebraic methods [
11. K. Yamaji, “Design of zoom lenses”, in Progress in Optics, Vol. 6, (North-Holland 1967), pp.105–170.
], Gaussian brackets [
22. K. Tanaka, Paraxial Theory of Mechanically Compensated Zoom Lenses by Means of Gaussian Brackets, Research Report of Canon Inc. (6), (Canon Inc. 1991).
] or chain fractions [
55. T. H. Jamieson, “Thin-lens theory of zoom systems,” Opt. Acta (Lond.) 17(8), 565–584 (1970). [CrossRef]
], were used for determination of the design of the zoom systems. Simultaneously, applicability of matrix optics [
1111. A. Gerrard and J. M. Burch, Introduction to Matrix Methods in Optics (Dover Publication 1994), Chap.II.
,
1212. G. Kloos, Matrix Methods for Optical Layout, Vol. TT77, (SPIE Press 2007).
] was widely discussed in academic circles, but rather with poor practical results. The authors of this publication quite early recognized the power of the matrix approach [
1313. T. Kryszczyński, M. Leśniewski, and J. Mikucki, “New approach to the method of the initial optical design based on the matrix optics,” 16th Polish-Slovak-Czech Optical Conference on Wave and Quantum Aspects of Contemporary Optics, Proc. SPIE 7141, 71411X1–7 (2008). [CrossRef]
–
1515. T. Kryszczyński and J. Mikucki, “Interactive matrix method for analysis and construction of optical systems with elaborated components,” 17th Slovak-Czech-Polish Optical Conference on Wave and Quantum Aspects of Contemporary Optics, Proc. SPIE 7746, 77461M (2010). [CrossRef]
], and, in this paper, the matrix optics is used for the structural design of the complex multi-zoom systems similar to those presented in publications [
1616. K. Tanaka, “Recent development of zoom lenses,” Proc. SPIE 3129, in Zoom lenses II, ed. E. I. Betensky, A. Mann, I. A. Neil, 13–22 (Sep. 1997).
–
1818. EF lens work III the eyes of EOS, “Sixteen technologies used in high-performance EF lenses,” (Canon Inc. Lens Products Group 2006), pp. 175–177.
], but without any additional conditions or limitations on a number of components.
2. System matrix
As it is known, optical imaging by the single thin component with the power φ and at the distance t-1 measured from the previous component or from a reference plane designated by relative index “-1” can be presented in two steps.
Imaging in the matrix form can be written seemingly in a somewhat artificial matrix form by means of the transfer matrix T and optical power matrix M, both with dimensions of 2x2 elements which depend on parameters of the system according to formulas:
where: α_{-1}, α are ray angles to the optical axis respectively before and behind the component, h_{-1}, h are heights of incidence of the ray on the component, respectively preceded and considered.
where S is the matrix of the single component in a form of product M T with a determinant equal to unity.
If we repeat the substituting process represented by
Eq. (3) for consecutive indexes from
0 to
k + 1, then we obtain
where: T is the transfer matrix for the separation t between neighboring components and between special planes placed at entrance and exit ends of the optical system, M is the power matrix representing the impact of the optical power φ of the component according to the following universally known matrix presentation
3. Working conditions matrix
Working conditions of the optical system are usually expressed on both sides of the system by coordinates of the two characteristic rays called the “aperture ray” and “main field ray”. All major optical properties of the optical system, such as optical power of the system or its part, the magnification (transversal, angular), and optical conjugations in relation the object-image and the input-output pupils, can be derived from the coordinates of these rays. For this reason, these rays are traditionally used in the structural design and design methods considering the third-order optical aberrations.
With the aim of the complete description of imaging by the optical system, we now introduce a new matrix J, built from the coordinates of two paraxial rays at input or output, denoted by the indexes J_{0} and J_{k + 1} according to the following formulas
where: h, α are respectively the height and angle of incidence to the axis of the first ray; y, β are respectively the height and angle of incidence to the axis of the second ray.
If the construction parameters of the optical system are given by means of the matrix
S_{0,k + 1}, then imaging of two characteristic rays passing through this system according to
Eq. (7) should satisfy the following matrix equation
or in a more concise notation we have
Expressed below equality of matrices S and W for complete optical track from 0 to k + 1 is the basis for the structural design of all optical systems including numerous zoom systems
For better understanding, dependences expressed by
Eqs. (5),
(10) and
(11) are illustrated on a diagram of
Fig. 2Fig. 2 Diagram linking construction parameters: transfer matrices T, optical power matrices M of the optical system (matrix S) with working conditions represented by matrices of coordinates of characteristic ray J_{0} and J_{k + 1}.(matrix W)
.
Determinant of matrix J is known in geometrical optics under the name of the generalized Lagrange-Helmholtz’s invariant presented in a form
The invariant (12) is a constant of the system not only for any continuous set of its component but also in any of its cross section as it is noted below
After multiplication of the matrix
J for the plane at any end of the system according to
Eq. (10), the dependence elements of matrix
W can be obtained directly from the coordinates of the characteristic rays
Depending on the type of the optical system, coordinates of the characteristic rays on both ends of the system are either already known or can be calculated. After reduction, ratios of these coordinates calculated form operational parameters for both ends of the system many characteristics such as optical power Φ, transversal m and angular g magnifications can be obtained.
4. Differentiation in relation to construction parameters
The derivative of the matrix S can be obtained for any construction parameter x that participates in production of the matrix S. Partial contribution ΔS due to the change of Δx (using a simplified notation without indexes) can be expressed as follows
In this situation to calculate the impact of the variability of the optical power of the element with index “
i”, the related to this element
M_{i k} in
Eq. (5) has to be replaced by partial derivative of the matrix
∂M_{i}/∂φ_{i} calculated accordingly to
Eq. (18), while the remaining matrices retain their position in the product representing the matrix
M (
Eq. (5).
This approach produces quite straightforward result also for
x = t_{i}, impact of which on the behavior of the complete system is expressed by
Eqs. (6). In this case the partial derivatives of the matrix
T_{i} produces the same constant value for each component with any non-zero separation as shown below
Since the first order derivative of the matrix T in respect to t gives the matrix with constant values the second and higher order derivatives have to produce matrices with zero value elements.
In this work a similar method of calculation of matrix derivative (as that applied by T. B. Andersen [
1919. T. B. Andersen, “Optical aberration functions: derivatives with respect to axial distances for symmetrical systems,” Appl. Opt. 21(10), 1817–1823 (1982). [CrossRef] [PubMed]
] to calculate the derivatives of aberration functions in respect to the axial distances) consisting in independent calculation of derivative of each matrix component, is used.
In
Eq. (21), the partial derivative of the matrix
∂T_{i}/∂ t_{i} according to
Eq. (20) replaces the matrix
T_{i} of
Eq. (5) while the remaining matrices of this equation retain their position.
The change of the matrix
ΔS, in case of change of any two parameters
Δx and
Δy, depends on the mixed partial derivative with respect to these two parameters regardless of the derivative order (Schwarz's theorem). In such situation the derivative of the matrix given by
Eq. (5) by both parameters
x and
y gives the following result
In
Eq. (23), the power matrices
M_{i} and
M_{j} calculated in a way presented by
Eq. (18) are replaced in
Eq. (5) by the corresponding derivative matrices of the same type, while remaining components of
Eq. (5) retain their positions.
The change of the matrix ΔS induced by mixed derivatives of two arbitrarily selected separations Δx = Δt_{i} and Δy = Δt_{j} can be determined from the following formula
In
Eq. (24), transfer Matrices
T_{i} and
T_{j} of
Eq. (5) are replaced by their partial derivatives given by
Eq. (20) while the remaining matrices of
Eq. (5) retain their position.
Change of two parameters is the most typical one in the optical system. A total change of the system matrix ΔS must take into account first and mixed derivatives associated with these parameters. When variability of parameters Δφ_{i} and Δφ_{j} are determined then the exact change ΔS of system matrix S consists of changes related to individual parameters as well as changes being a result of their combined effect as expressed below
Similarly when the changes of separations Δt_{i} and Δt_{j} are selected as the parameters under considerations, then the exact change ΔS of matrix S has to be determined as it is shown below
Finally, when both optical power Δφ_{i} and separation Δt_{j} have to be taken into a consideration, then the exact change ΔS of the matrix S should be determined as expressed below
5. Components kinematics
Fig. 3Fig. 3 The moving component with index i within a fragment of the optical system.
shows the moving component with index
i, which changes two neighboring separations on the same value
z_{i} but with opposite sign.
In this case, combining of
Eqs. (21) and
(28), produces the increase matrix
ΔS consisting of the following three parts:
Regardless of the type of system, two matrix elements only are normally used. One of them is the operational parameter, such as Φ, m or g, and the second is zero selected among matrix elements which determine if the system has optically conjugated (focusing) or non conjugated (non-focusing) characteristic, and which are differently located in the matrix W. To stabilize the image location, it is necessary to have at least two movements z_{i} and z_{j}.
It is possible to determine kinematics of the components directly from the
Eq. (31). As it follows from expression (11) the increments
ΔS of the components of matrix
S are identical with increments
ΔW of the components of matrix
W. This creates a system of two equations, which can be derived from
Eq. (31) taken twice. Unknown movements of two components can be solved by iterative methods for each discrete position.
The fact that there are only two moving components to achieve required stabilization conditions imposes some limitations, which could be eliminated by introducing supplementary moving components with inserted parameters of motion. Movements of such components are described by means of a special function of the parabolic-exponential type having the following form
In summary, zoom optical system have three kind components: numerically determined, supplementary inserted, as well as obviously motionless fixed, but it depends mainly on the result of structural design.
6. Structural optical design of the multi-group zoom system
At the beginning of the design, it should be given a seed of the zoom system. The planned arrangement of components on the axis in two edge settings at minimum and maximum values of the operational parameter and rough optical powers enter into a composition of the seed.
When a zoom system is accepted, then in turn one should plan movements of the components divided into those that will be determined automatically, and those, which will be inserted as complementary movements with initial well-matched parameters. Components kinematics is obtained by processing in graphical and numerical versions. Kinematics cannot always be accepted because components, either their movements interfere with each other, or their minimum distances are too small, or there are too little fluid movements.
If one does not accept the components kinematics, then it may turn out to be enough changes in planed movements, and it is needed to add or to change parameters of complementary movements. If such attempts fail, then one must renew the seed and look for new opportunities to obtain the desired solution.
Transforming the seed into a zoom system is performed by using matrix equations with the following form
where: A is the matrix which elements are derivatives of the system matrix in respect to design parameters and A^{T} is the transposed matrix of A, Δp is the vector of change parameters, and Δw is the vector of selected element changes of the matrix W in accordance with a type of the optical system.
In the case, when the number of components is greater than 4, then the matrix A is a rectangular horizontally matrix. Multiplying left-hand the matrix by the A^{T} ensures obtaining the solution irrespective of the number of components. Number of equations is constant and equal to 4 including two optical conjugations and two functional parameters at extreme arrangements.
In this situation, due to an excess of parameters and non-linearity of equations it can lead to several solutions that depend on a seed and first approximation. In accepting solutions, one must take into account the ray tracings for the assumed aperture and field of view. Using
Eq. (33), we have decided to use only first derivatives and then solving requires iterative process. There is a more complex approach to solving the nonlinear system of equations with participation of mixed derivatives of the second order but it does not gives practical benefits.
Structural optical design is only indispensible beginning the complex optical design. Ultimate acceptation depends on the fine correction in conjunction with optical software.
7. Examples of the multi-group zoom system
Following three examples of multi-group zoom systems, which illustrate describing the structural design of such optical systems, are presented below. All systems have five-components, and a number of moving components is 4 or 5. Components kinematics may seem excessive, but it is connected to a practice.
Example 1 - objective lens
Data structure of the objective lens with zooming ratio 12.5 and the optical power
Φ varying from 1/125 to 1/10 is located in
Table 2Table 2. Data structure of the multi-group zoom objective lens |
| |
and shown in
Fig. 7Fig. 7 Diagram of the multi-group zoom objective lens at the edge arrangements.
. Similarly, moving components determined numerical are denoted indices 2 and 4. Components denoted indexes 1 and 3 play a role of supplements and their parameters of movements (
z_{1} = 29, p_{1} = 20, e_{1} = 0.8, z_{3} = −22.5, e1_{3} = 1.5, p_{3} = 5) are well-matched.
The traditional variator is the supplementary component indexed 2 and the remaining three components are more or less subtle compensators. A line denoted the index 6 coincides with the stabilized image position. We have obtained the objective lens that is characterized by simple smooth movements (see
Fig. 8Fig. 8 Components kinematics of the objective lens.
) and perhaps – by better possibilities to optimize the optical system on later stages of the optical design in comparison with some solutions containing four components.
Example 2 - reproduction system
Data structure of the reproduction lens with zooming ratio 6.25 and the transversal magnification
m varying from −0.4X to −2.4X is placed in
Table 3Table 3. Data structure of the multi-group zoom reproduction system |
| |
and shown in
Fig. 9Fig. 9 Diagram of the multi-group zoom reproduction system at the edge arrangements.
. Moving components determined numerical are denoted by indices 2 and 4. Remaining components play a role of a supplement and their parameters of movements (
z1 = −15, z3 = 35 p3 = −8 and p5 = −15) can be fixed values by means of rough interpolation.
From our assumption, the exact image stabilization (index 6) is obtained by movements of all components. Advantages of this reproduction lens are its compact construction and smooth kinematics (see
Fig. 10Fig. 10 Components kinematics of the reproduction system.
).
Example 3 - telescopic system
Data structure of the telescope system with zooming ratio 12.75 and the positive angular magnification
g varying from 0.28X to 3.57X is located in
Table 4Table 4. Data structure of the multi-group zoom telescopic system |
| |
and shown in
Fig. 11Fig. 11 Diagram of the multi-group zoom telescopic system at the edge arrangements.
. Numerical determined moving components are denoted indices 2 i 4. Remaining components play a role of supplementary and their more complex parameters of movements (
z_{1} = 30, p_{1} = 10, e_{1} = 0.75, z_{3} = 12.5, z_{5} = 40, e1_{5} = 1.6) are inserted the method of trials and errors. The exact non-focusing of the telescope has been achieved movements of all components. The line denoted in index 6 coincides with the imposed exit pupil.
The traditional division of components of the zoom system into variators and compensators may be negligible here. The compact construction and smooth kinematics (see
Fig. 12Fig. 12 Components kinematics of the telescopic system.
) are advantages of this telescope system despite the high zooming ratio.
8. Summary and conclusion
It is quite well known fact that performance of the optical system, designed with the most advanced software depends on capability of the designer to properly formulate the starting structural model of the designed system. Construction of such structural model is particularly difficult in the case of complex optical multifunctional systems such as zoom lenses with a wide range of focal length variation. It is demonstrated that the process of preparation of such preliminary structural data can be dramatically simplified by rigorous application of matrix modeling of such system, enhanced by application of derivative matrices. It is shown that these matrices allow for direct association of the construction parameters of the optical system with the working conditions of the system. Reversing of this association allows to determine precisely the structural composition of the optical system able to perform required functions. For this purpose, differentiation of a system matrix in respect to construction parameters and movements of the components have obtained by a number of new matrix formulas. Properly selected matrix elements allow to produce the system of nonlinear equations with unknown parameters and movements, which can be successfully solved by iterative methods. Once such preliminary structure is determined, the final design can be completed using such optical design programs as Code V or Zemax. This highly formalized approach can be used for structural analysis of any optical system with unlimited number of components and unlimited number of components relocation necessary to achieve required performance of the optical system such as zoom lenses with complex movement of the optical components, complex imaging systems, telescopes and so on.
The applicability of the proposed method is illustrated with a number of examples, which include the zoom system with image stabilization achieved by moving two components only and which could be further improved with a supplementary element with linear translation.
As illustrated, the developed methodology covers various types of optical systems such as the objective, reproductive, and telescope systems and is particularly useful for design of the optical systems with mechanical or electronic compensation of the image position.