Kitap dosya olarak indirilemez ancak uygulamamız üzerinden veya online olarak web sitemizden okunabilir.
Kitabı oku: «Optical Engineering Science», sayfa 7
3.6 Summary of Third Order Aberrations
At this stage it will be useful to summarise the five Gauss-Seidel aberrations in terms of the pupil and field dependence of their OPD and ray fans. It should be noted that for all Gauss-Seidel aberrations, the order of the pupil dependence and the order of the field angle dependence sum to four (for the OPD). In particular, it is important for the reader to understand how the different types of aberration vary with both pupil size and field angle. For example, in many optical systems, such as telescopes and microscopes, the range of field angles tend to be significantly smaller than the larger angles subtended to the pupil. Therefore, for such instruments, those aberrations with a higher-order pupil dependence, such as spherical aberration (4) and coma (3), will predominate.
3.6.1 OPD Dependence
The list below sets out the WFE dependence of the five Gauss-Seidel aberrations on pupil function, p, and field angle, θ.
● Spherical Aberration: ΦSA ∝ p4
● Coma: ΦCO ∝ p3θ
● Field Curvature: ΦFC ∝ p2θ2
● Astigmatism: ΦAS ∝ p2θ2
● Distortion: ΦDI ∝ pθ3
To quantify each aberration, we can define a coefficient, K, which describes the magnitude (in units of length) of the aberration. In addition, as well as normalising the pupil function, we can also normalise the field angle by introducing the quantity, h, which represents the ratio, θ/θ0, the ratio of the field angle to the maximum field angle.
(3.37)
(3.38)
(3.39)
(3.40)
(3.41)
The reader should take particular note of the form of Eq. (3.40). The description of astigmatism here is such that the mean defocus over all orientations of the ray fan is taken to be zero. However, other representations adopt the convention that the defocus is zero for the sagittal ray and the balance of the astigmatism is incorporated into the field curvature. That is to say, in these conventions, the astigmatism is taken to be proportional to cos2φ, rather than cos2φ, as in Eq. (3.40). Of course, in using cos2φ, an average defocus of the same form as field curvature is introduced, hence the reason for adopting the convention used here. If the field curvature and astigmatism were redefined according to that convention, then the following revised description would apply:
(3.42)
(3.43)
3.6.2 Transverse Aberration Dependence
The ray fan or transverse aberration dependence upon pupil function and field angle is such that the order of the two variables sum to three, as opposed to four for OPD. The dependence of transverse aberration is listed below:
● Spherical Aberration: tSA ∝ p3
● Coma: tCO ∝ p2θ
● Field Curvature: tFC ∝ pθ2
● Astigmatism: tAS ∝ pθ2
● Distortion: ΦAS ∝ θ3
3.6.3 General Representation of Aberration and Seidel Coefficients
The analysis presented in this chapter has demonstrated the power of using the OPD as a way of describing aberrations. More generally, when expressed as a WFE, it can be used to describe the deviation of a specific wavefront from an ideal wavefront that converges on a specific reference point. As such, this deviation can be used to describe defocus, which shows a quadratic dependence on pupil function and tilt, where the WFE is plane surface that is tilted about the x or y axis (the optical axis being the z axis). The standard representation for describing and quantifying generic WFE and aberration behaviour is shown in Eq. (3.44).
(3.44)
p is the pupil function and h is the object height (proportional to field angle θ); φ is the ray fan angle.
In the general term, Wabc, ‘a’ describes the order of the object height (field angle dependence), ‘b’ describes the order of the pupil function dependence and ‘c’ describes the dependence on the ray fan angle. The defocus and tilt, are of course paraxial terms. Overall, the dependence of each coefficient is given by Eq. (3.45):
(3.45)
It should be noted that this convention incorporates powers of cosφ, so the astigmatism term contains some average field curvature. Describing each of the aberration coefficients introduced earlier in terms of these coefficients gives the following:
(3.46)
(3.47)
(3.48)
(3.49)
(3.50)
Another convention exists of which the reader should be aware. These are the so called Seidel coefficients, named after the nineteenth century mathematician, Phillip Ludwig von Seidel, who first elucidated the five monochromatic aberrations. The coefficients are usually denominated, SI, SII, SIII, SIV, and SV, referring to spherical aberration, coma, astigmatism, field curvature, and distortion. They nominally quantify the WFE, as the other coefficients do, but their magnitude is determined by the size of the blur spot that the aberration creates. The correspondence of these terms is as follows:
(3.51)
(3.52)
(3.53)
(3.54)
(3.55)
The form of Eq. (3.54) is interesting. When compared to the definition of W220 in Eq. (3.48), an additional amount of astigmatism has been compounded with the field curvature. As such, this new representation of field curvature, SIV represents a fundamental and important property of an aberrated optical system and is referred to as the Petzval curvature. Its significance will be discussed more fully in the next chapter.
The treatment of aberrations, thus far, has been entirely generic. We have introduced the five Gauss-Seidel aberrations without specific reference to how they are generated at specific optical surfaces and by individual optical components. This will be discussed in detail in the next chapter. The most important feature of this treatment is that the third order aberrations are additive through a system when described in terms of OPD. That is to say, the five aberrations may be calculated independently at each optical surface and summed over the entire optical system. This analysis is an extremely powerful tool for characterisation of aberration in a complex system.
Further Reading
Born, M. and Wolf, E. (1999). Principles of Optics, 7e. Cambridge: Cambridge University Press. ISBN: 0-521-642221.
Hecht, E. (2017). Optics, 5e. Harlow: Pearson Education. ISBN: 978-0-1339-7722-6.
Kidger, M.J. (2001). Fundamental Optical Design. Bellingham: SPIE. ISBN: 0-81943915-0.
Kidger, M.J. (2004). Intermediate Optical Design. Bellingham: SPIE. ISBN: 978-0-8194-5217-7.
Longhurst, R.S. (1973). Geometrical and Physical Optics, 3e. London: Longmans. ISBN: 0-582-44099-8.
Mahajan, V.N. (1991). Aberration Theory Made Simple. Bellingham: SPIE. ISBN: 0-819-40536-1.
Mahajan, V.N. (1998). Optical Imaging and Aberrations: Part I. Ray Geometrical Optics. Bellingham: SPIE. ISBN: 0-8194-2515-X.
Mahajan, V.N. (2001). Optical Imaging and Aberrations: Part II. Wave Diffraction Optics. Bellingham: SPIE. ISBN: 0-8194-4135-X.
Slyusarev, G.G. (1984). Aberration and Optical Design Theory. Boca Raton: CRC Press. ISBN: 978-0852743577.
Smith, F.G. and Thompson, J.H. (1989). Optics, 2e. New York: Wiley. ISBN: 0-471-91538-1.
4
Aberration Theory and Chromatic Aberration
4.1 General Points
In the previous chapter, we developed a generalised description of third order aberration, introducing the five Gauss-Seidel aberrations. The motivation for this is to give the reader a fundamental understanding and a feel for the underlying principles. At the same time, it is fully appreciated that optical system design and detailed analysis of aberrations is underpinned by powerful optical software tools. Nevertheless, a grasp of the underlying principles, including an appreciation of the form of ray fans and optical path difference (OPD) fans, greatly facilitates the application of these sophisticated tools.
The treatment presented here is restricted to consideration of third order aberrations. Before the advent of powerful software analysis tools, the designer was compelled to resort to a much more elaborate and complex analysis, in particular introducing an analytical treatment of higher order aberrations. For all the labour that this would involve, the reader would gain little in terms of a useful understanding that could be applied to current design tools. As the third order aberrations are third order in transverse aberration and fourth order in OPD, so succeeding higher order aberrations are fifth, seventh etc. order in transverse aberration, but sixth, eighth order in OPD. That is to say, aberrations, whose order is expressed conventionally in terms of the transverse aberration, can only be odd. One can re-iterate the analysis of Section 3.4 to generate the form and number of terms involved in the higher order aberrations. This is left to the reader, but it is straightforward to derive the number of distinct terms Nn as a function of aberration order, n:
(4.1)
In concentrating on third order aberrations, we shall, in the remainder of this chapter, seek to determine the impact of refractive surfaces, mirrors, and lenses on all the Gauss-Seidel aberrations. This analysis will proceed, initially, on the assumption that the surface in question lies at the pupil position. Subsequently, the impact of changing the position of the stop will be analysed. Manipulation of the stop position is an important variable in the optimisation of an optical design. The concept of the aplanatic geometry will be introduced where specific, simple optical geometries may be devised that are wholly free from either spherical aberration (SA) or coma (CO). These aplanatic building blocks feature in many practical designs and are significant because, in many instruments, such as telescopes and microscopes, there is a tendency for spherical aberration and coma to dominate the other aberrations. The elimination of spherical aberration and coma is thus a priority. Furthermore, by the same token, astigmatism (AS) and field curvature (FC) are more difficult to control. In particular, the control of field curvature is fundamentally limited by Petzval curvature, as alluded to in the previous chapter.

Figure 4.1 Calculation of OPD for refractive surface.
4.2 Aberration Due to a Single Refractive Surface
The analysis of the aberrations of a single refractive surface is based on the computation of the OPD of a generalised field point to the appropriate order (4th) in terms of field angle, θ and ray height, r, at the pupil. For this analysis, we will assume that the pupil is located at the lens surface. In calculating the OPD, we force all rays to go to the paraxial focus and compute the OPD with respect to the chief. Figure 4.1 shows an object with a field angle, θ, located at a distance, u from a spherical refractive surface of radius R. It must be emphasised, in this instance, that this analysis applies specifically to a spherical surface. In this geometry, it is assumed that the object is displaced from the optical axis in the y direction. The paraxial image is itself located at a distance v from the surface and the position of a ray at the surface (and stop) is described by its components in x and y – hx and hy.
The image in this case is the paraxial image and from the paraxial theory, the angle φ may be expressed in terms of θ as θ/n. To compute the optical path of a general ray as it passes from object to paraxial image, we need to define the ray co-ordinates at three points:



The z co-ordinate of the stop position is derived from the binomial expansion for the axial sag of a sphere including terms up to the fourth power. In making this approximation, it is assumed that h is significantly less than R. If we were to adopt the paraxial approximation we would only consider the first r2 term in the expansion. In the case of third order aberration, we need to consider the next term. It is then very straightforward to calculate the total optical path, Φ, for a general ray in passing from object to paraxial image:
(4.2)
The two square root terms represent the optical path of two ‘legs’ of the journey, with the path through the glass adding a multiplicative factor of n. The next stage of the process is an extension of the paraxial theory. It is assumed that rx, ry, and uθ are all significantly less than u. We can now approximate Φ from Eq. (4.2) using the binomial theorem. In the meantime collecting terms we get:

Before deriving the third order aberration terms, we examine the paraxial contribution which contain terms in h up to order r2.
(4.3)
As one would expect, in the paraxial approximation, the optical path length is identical for all rays. However, for third order aberration, terms of up to order h4 must be considered. Expanding Eq. (4.2) to consider all relevant terms, we get:
(4.4)
Four of the five Gauss-Seidel terms are present – spherical aberration, coma, astigmatism, and field curvature. However, clearly there is no distortion. In fact, as will be seen later, distortion can only occur where the stop is not at the surface as it is here. Of course, Eq. (4.4) can be simplified if one considers that u, v, and R are dependent variables, as related in Eq. (4.3). Substituting v for u, and R, we can express the OPD in terms of u and R alone. Furthermore, it is useful, at this stage to split the OPD contributions in Eq. (4.4) into Spherical Aberration (SA), Coma (CO), Astigmatism (AS), and Field Curvature (FC). With a little algebraic manipulation this gives:
(4.5a)
(4.5b)
(4.5c)
(4.5d)
4.2.1 Aplanatic Points
It is worthwhile, at this juncture, to examine the four expressions in Eqs. (4.5a)–(4.5d) in some detail and, in particular, those for spherical aberration and coma. Before examining these expressions further, it is worthwhile to cast them in the form outlined in Chapter 3:
(4.6a)
(4.6b)

Figure 4.2 Aplanatic points for refraction at single spherical surface.
There is a clear pattern in these expressions in that both spherical aberration and coma can be reduced to zero for specific values of the object distance, u. Examining Eqs. (4.6a) and (4.6b), it is evident that this condition is met where u = −R. That is to say, where the object is located at the centre of the spherical surface. However, this is a somewhat trivial condition where rays are undeviated by the surface and where the surface would not provide any useful additional refractive power to the system. Most significantly, another condition does exist for u = −(n + 1)R. Here, for this non-trivial case, both third order spherical aberration and coma are absent. This is the so-called aplanatic condition and the corresponding conjugate points are referred to as aplanatic points (Figure 4.2). From Eq. (4.3) we can derive the image distance, v, as (n + 1)R/n. That is to say, the object is virtual and the image is real if R is positive and vice-versa if R is negative.
To be a little more rigorous, we might suppose that refractive index in object space is n1 and that in image space is n2. The location of the aplanatic points is then given by:
(4.7)
Fulfilment of the aplanatic condition is an important building block in the design of many optical systems and so is of great practical significance. As pointed out in the introduction, for those systems where the field angles are substantially less than the marginal ray angles, such as microscopes and telescopes, the elimination of spherical aberration and coma is of primary importance. Most significantly, not only does the aplanatic condition eliminate third order spherical aberration, but it also provides theoretically perfect imaging for on axis rays.
Worked Example 4.1 Microscope Objective
The ‘front end’ of many high power microscope objectives exploits the principle of single surface aplanatic points through the use of a hyperhemisphere co-located with the object. The hyperhemisphere consists of a sphere that has been truncated at one of the aplanatic points which also coincides with the object location, as illustrated in Figure 4.3.
Using the hyperhemisphere, we wish to create a ×20 microscope objective for a standard optical tube length of 200 mm. In this example, it is assumed that two thirds of the optical power resides in the hyperhemisphere itself; other components collimate the beam. In other words:


Figure 4.3 Hyperhemisphere objective.
The refractive index of the hyperhemisphere is 1.6. What is the radius, R, of the hyperhemisphere and what is its thickness?
For a tube length of 200 mm, a ×20 magnification corresponds to an objective focal length of 10 mm. As two thirds of the power resides in the hyperhemisphere, then the focal length of the hyperhemisphere must be 15 mm. Inspecting Figure 4.2, it is clear that the thickness of the hyperhemisphere is −R × (n + 1)/n, or −1.625 × R. To calculate the value of R, we set up a matrix for the system. The first matrix corresponds to refraction at the planar air/glass boundary, the second to translation to the spherical surface and the final matrix to the refraction at that surface. On this occasion, translation to the original reference is not included.

From the above matrix, the focal length is −R/0.6 and hence R = −9.0 mm. The thickness, t, we know is −1.625 × R and is 14.625. In this sign convention, R is negative, as the sense of its sag is opposite to the direction of travel from object to image space.
The (virtual) image is at (n + 1) × R from the sphere vertex or 2.6 × 9 = 23.4 mm.
In summary:

4.2.2 Astigmatism and Field Curvature
Unlike spherical aberration and coma, there is less scope for correction of astigmatism and field curvature. In Eqs. (4.5c) and (4.5d), astigmatism is corrected at the aplanatic point and field curvature at the radial points. However, the convention used in Eq. (4.5c) to describe astigmatic correction corresponds to zero sagittal ray defocus. On the other hand, using the alternative convention set out in Chapter 3 we have:
(4.8a)
(4.8b)
From Eq. (4.8a), it is evident that at the aplanatic condition where u = −(n + 1)R, the astigmatism vanishes, as does the spherical aberration and coma. It is interesting to see what might happen to the field curvature where this condition is fulfilled:
(4.9)
This is related to the Petzval field curvature, which, by definition, is the field curvature that arises when the astigmatism in the system is zero. Relating this to Eq. 4.8b, then the field curvature may be expressed as:
(4.10)

Figure 4.4 Field curvature for single refraction.
It is clear that Eq. (4.9) represents, with its quadratic dependence upon the pupil location, r, a degree of defocus, Δf, or longitudinal aberration, that is quadratic in the field angle. This defocus is given by:
(4.11)
The systematic field dependent defocus can be represented as a spherical surface where the each field point is in focus. The curvature of this surface, CPETZ and equivalent to 1/RPETZ where RPETZ is the Petzval Radius, is given by:
(4.12)
The sign is important, in that the Petzval curvature is in the opposite sense to that of the surface itself. This point is illustrated in Figure 4.4.
The most significant point about Petzval curvature is, in common with the underlying wavefront error, that it is additive through a system. To illustrate this, we might consider a system with N surfaces with radius of curvature Ri. The material that follows each surface has a refractive index of ni. The Petzval curvature associated with the system is simply the sum of the individual curvatures and is referred to as the Petzval sum. This is given by:
(4.13)
The practical implication of Eq. (4.13) is that if a system consists of elements with entirely positive or entirely negative focal power, then that system will always exhibit field curvature. To achieve a flat field, or a zero Petzval sum, then any positive optical elements must be balanced by negative elements elsewhere in the system.
It must be emphasised that the condition for perfect image formation on the Petzval surface applies specifically to the scenario where astigmatism has been removed.
4.3 Reflection from a Spherical Mirror
The third order analysis for a spherical mirror proceeds in very much the same way as the single refractive surface. That is to say, a ray is traced from the object location to the mirror and thence to the paraxial focus regardless as to whether the real ray actually terminates there. The general layout is shown in Figure 4.5. The sign convention used here is the same as applied to all previous analyses. That is to say, positive image distance is with the image to the right, and the image distance, as shown in Figure 4.5, is actually negative. However, it must be accepted that, as rays physically converge on this image point, then this image is actually real, despite v being negative. In addition, the same convention is applied to mirror curvature; the mirror depicted in Figure 4.5 has negative curvature.

Figure 4.5 Reflection at spherical mirror.
The analysis proceeds as previously. Firstly, we set out the object and image positions and the ray intercept at the stop.



The optical path is given by:
(4.14)
Rearranging:

In applying the binomial approximation, one needs to be careful with regard to the sign convention. It should be accepted that each of the square root terms in Eq. (4.14) is positive for a real object and real image. That is to say, all rays are physically traced to the appropriate location. In the case of a mirror surface, the definition of a real image corresponds to a negative image distance, u. Once again, we examine the paraxial terms
(4.15)
As for the refractive surface we expand Eq. (4.14) using the binomial theorem to give terms of the fourth order in OPD.
(4.16)
As with the refractive case, four of the five Gauss-Seidel terms are present – spherical aberration, coma, astigmatism, and field curvature. There is also no distortion. As previously, Eq. (4.16) can be simplified considering u, v, and R as dependent variables, as related in Eq. (4.15). We can, once more, express the OPD in terms of u and R alone. Splitting the OPD contributions in Eq. (4.16) into Spherical Aberration (SA), Coma (CO), Astigmatism (AS), and Field Curvature (FC) and with a little algebraic manipulation we have:
(4.17a)
(4.17b)
(4.17c)
(4.17d)
Equations (4.17a)–(4.17c) bear some striking similarities with respect to those for the refractive surface. In fact, if one substitutes n = −1 in the corresponding refractive formulae, one obtains expressions similar to those listed above. Thus, in some ways, a mirror behaves as a refractive surface with a refractive index of minus one. Once again, there are aplanatic points where both spherical aberration and coma are zero. This occurs only where both object and image are co-located at the centre of the spherical surface. The apparent absence of field curvature may appear somewhat surprising. However, the Petzval curvature is non-zero, as will be revealed. We can now cast all terms in the form set out in Chapter 3 and introduce the Lagrange invariant, which is equal to the product of r0 and θ0 (the maximum field angle):
(4.18a)
(4.18b)
(4.18c)
(4.18d)
The Petzval curvature is simply given by subtracting twice the KAS term in Eq. (4.18c) from the field curvature term in Eq. (4.18d). This gives:
(4.19)

Figure 4.6 Petzval curvature for mirror.
In this instance, the Petzval surface has the same sense as that of the mirror itself. However, the radius of the Petzval surface is actually half that of the original surface. This is illustrated in Figure 4.6.
Calculation of the Petzval sum proceeds more or less as the refractive case. However, there is one important distinction in the case of a mirror system. For a system comprising N mirrors, each successive mirror surface inverts the sense of the wavefront error imparted by the previous mirrors.
(4.20)
