Next Article in Journal
Path Difference Optimization of 5G Millimeter Wave Communication Networks in Malaysia
Previous Article in Journal
Research on the Optimization of Open-Pit Mine Cast Blasting Parameters Based on the Optimal Economy of Dragline Stripping Technology
Previous Article in Special Issue
Research on a Super-Sub-Arc Bivariate Relative Angle Thermal Deformation Testing Method without Pitch Angle Limitation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Axis Laser Interferometer Not Affected by Installation Errors Based on Nonlinear Computation

Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2023, 13(19), 10887; https://doi.org/10.3390/app131910887
Submission received: 12 July 2023 / Revised: 18 September 2023 / Accepted: 29 September 2023 / Published: 30 September 2023
(This article belongs to the Special Issue Recent Advances in Optical Coordinate Measuring Systems)

Abstract

:
Most multi-axis laser interferometers require accurate installation of the lasers and detectors since the position and orientation installation errors of lasers and detectors bring measurement error to the interferometer. In this paper, the multi-axis interferometer based on nonlinear computation is proposed, which avoids the measurement error caused by installation errors by taking the laser position and orientation as unknowns into the measurement model and discussing the solution component uniqueness of a nonlinear equation system. The simulation results show that even when the installation errors are close to 1 μm and 1 μrad, the proposed interferometer can still measure the multi-degree-of-freedom displacement accurately, and the root mean standard error (RMSE) of displacement is 1.884 nm and 5.871 × 10−7 mrad under a reading noise level of 0.1 nm.

1. Introduction

With a displacement measurement resolution higher than 1 nm, the laser interferometer is used in various precision and ultra-precision measurement applications. One of the typical applications is the lithography wafer stage in semiconductor equipment, which is used to carry the silicon wafer to achieve a series of multi-degree-of-freedom (multi-DOF) precision motions. Since the positioning accuracy of the wafer stage directly affects the overlay accuracy of the lithography, a high requirement for the accuracy and bandwidth of the multi-DOF displacement measurement is put forward [1,2,3].
The laser interferometer is commonly used in such cases to measure displacement based on the interference of laser beams, which measure the variation in optical path length by detecting the laser intensity, thus realizing the precise displacement measurement [4,5,6]. According to the laser type, the interferometers can be classified as homodyne interferometers with a single-frequency laser source and heterodyne interferometers with a dual-frequency laser source. To measure the multi-DOF displacement of a precision movement stage by using a multi-axis interferometer, more lasers and other optical elements need to be adopted. Additionally, it is necessary to carefully design the number and each installation position of lasers and the multi-DOF displacement decoupling computation. For example, five lasers are used to measure the stage surface at five different points in the five-axis laser interferometer, and the 5-DOF displacement is computed from the interferometer model [7]. In addition to the measurement scheme in [7], some researchers used more lasers with redundant information to decrease the nonlinear error [8,9,10,11]. Moreover, some researchers built a more accurate but more complex model of the coupling between multi-DOF displacement to improve the accuracy of the displacement computation results [12,13,14,15].
However, these multi-axis laser interferometers mentioned above inevitably have measurement systematical errors, which can be summarized as two reasons. First, the interferometer model is inaccurate, which is mainly caused by installation errors. For example, the laser beams in the model are parallel, but in practice they are not, and the installation position and orientation of the lasers in the model are different from those in practice. Second, the computation of the model is inaccurate. Usually, there are many nonlinear functions such as trigonometric functions and rational fractions in the interferometer model. To simplify the computation, these nonlinear functions are approximated linearly so that the explicit expression of the multi-DOF displacement can be derived. Both of the above two reasons will result in a multi-DOF displacement measurement error, which causes the loss of accuracy.
To avoid the above two factors leading to a measurement systematical error, parameters such as the laser beam angle and emitting point position should be introduced into the model as unknowns. In addition, the nonlinear measurement model should be solved directly without any linear approximation. However, in this way, the interferometers model is quite complex; that is, it contains a large number of unknowns and complex nonlinear functions, and how to solve the model accurately and quickly becomes a new problem. Although there are the Gauss–Newton method [16], Levenberg–Marquardt (LM) method, and other nonlinear equation system numerical algorithms [17,18,19], when it comes to practical engineering, due to the negative properties of the equations such as the singular Jacobi matrix, which are caused by the huge number of unknowns and the complex nonlinear functions, the convergence and the accuracy of the algorithm may not be ideal, which raises the necessity for a general and comprehensive analysis.
In this paper, an interferometer model with unknown parameters, including the laser beam angle and emitting point position, is proposed to avoid inaccurate modeling so that the measurement result is entirely unaffected by installation errors. A solution of the interferometer model without any linearization is proposed to avoid an inaccurate solution. For the problem arising in the model solving that the singular Jacobi matrix leads to no unique solution of the equations, the theory of solution component uniqueness is proposed to obtain a unique and accurate solution of the displacement in the model. This paper is organized as follows. In Section 2, the mathematical model of the laser interferometer is proposed, which shows the relation between interferometer readings and multi-DOF displacements. In Section 3, the displacement computation principle is proposed, and the focus problem that the solution of a nonlinear equation system is not unique due to the singular Jacobi matrix is discussed. In Section 4, the interferometer measurement and the displacement computation are simulated. The simulation result verifies the effectiveness of the proposed interferometer. Section 5 is the conclusion of the paper.

2. Interferometer Measurement Model

In this section, the structure and the mathematical model of the multi-axis laser interferometer, which avoids the two factors mentioned in Section 1, is proposed. For simplicity, the 3-DOF displacement measurement of the X-Y stage is mainly discussed, while the 6-DOF displacement measurement of the X-Y-Z stage is also mentioned, but is not the focus. Nevertheless, there is no difference in the principle between the two but there is in the computation complexity.

2.1. Single-Axis Interferometer Laser Path Model

Since the multi-axis interferometer consists of multiple regular single-axis interferometers, the single interferometer model is proposed first.
The measurement principle of the single interferometer is shown in Figure 1; the laser emits from the interferometer, reflects off the smooth surface of the stage (or the retroreflector fixed on the stage), and returns to the interferometer. Let a fixed position of the stage be the reference position, as shown in Figure 1a, and the reading of the interferometer at the reference position is set to 0. Then, the reading of the interferometer at the measurement position in Figure 1b is set as V and is equal to the variation of the laser path length after the stage is moved from the reference position to the measurement position, that is
V = L 1 L 0
where L 0 , L 1 are, respectively, the laser path length at the reference and measurement position.
Moreover, it can be obtained from Figure 1 that
{ L 0 = 2 cos θ 0 cos 2 θ 0 [ ( x 0 x d ) cos θ 0 + ( y 0 y d ) sin θ 0 b / 2 ] L 1 = 2 cos θ 1 cos 2 θ 1 [ ( x 1 x d ) cos θ 1 + ( y 1 y d ) sin θ 1 b / 2 ]
where x d , y d is the coordinate of the laser emission point, x 0 , y 0 and x 1 , y 1 are, respectively, the coordinate of the stage center at the reference and measurement position, and θ 0 , θ 1 are, respectively, the angle of the stage at the reference and measurement position. b and h are the length and width of the stage.
The single interferometer model proposed in Equations (1) and (2) shows the mathematical relation between the interferometer readings and the position of the stage. This model is still used in the following multi-axis interferometers.

2.2. 3-DOF Displacement Measurement Model

To avoid measurement errors caused by installation errors, the installation parameters such as laser beam angles and emission point positions are added to the model as unknowns. Then, it is necessary to add constraints to make the model solvable, and one way is to increase the redundant measurement information by using more interferometers.
In this paper, a 3-DOF displacement measurement of the X-Y stage using a six-axis interferometer is proposed. And the measurement scheme is shown in Figure 2. There are six interferometers for measuring the two adjacent sides of the X-Y stage. Assuming that the X-Y stage only moves in the X-O-Y plane, and all the laser beams are supposed to be in the X-O-Y plane, the measurement scheme is also expressed in Figure 3.
In Figure 3, the reference position of the stage is shown as the dotted line, and the measurement position is shown as the true line; the O-XYZ axis is set so that the coordinates of the stage center and the angle of the stage are both equal to 0 at the reference position. Due to the installation errors, there is an angle of each laser beam, which is set as α i , i = 1 ,   2 , ,   6 , and the unknown installation position of each interferometer is set as x d i , y d i , i = 1 ,   2 , ,   6 .
Based on the single interferometer laser path model proposed in Section 2.1, the readings of the six interferometers can be modeled as
{ V 1 = 2 cos ( θ α 1 ) cos ( 2 θ 2 α 1 ) [ ( x x d 1 ) cos θ + ( y y d 1 ) sin θ b / 2 ] + 2 cos α 1 cos 2 α 1 ( x d 1 + b / 2 ) V 2 = 2 cos ( θ α 2 ) cos ( 2 θ 2 α 2 ) [ ( x x d 2 ) cos θ + ( y y d 2 ) sin θ b / 2 ] + 2 cos α 2 cos 2 α 2 ( x d 2 + b / 2 ) V 3 = 2 cos ( θ α 3 ) cos ( 2 θ 2 α 3 ) [ ( x x d 3 ) cos θ + ( y y d 3 ) sin θ b / 2 ] + 2 cos α 3 cos 2 α 3 ( x d 3 + b / 2 ) V 4 = 2 cos ( θ α 4 ) cos ( 2 θ 2 α 4 ) [ ( y y d 4 ) cos θ + ( x x d 4 ) sin θ h / 2 ] + 2 cos α 4 cos 2 α 4 ( y d 4 + h / 2 ) V 5 = 2 cos ( θ α 5 ) cos ( 2 θ 2 α 5 ) [ ( y y d 5 ) cos θ + ( x x d 5 ) sin θ h / 2 ] + 2 cos α 5 cos 2 α 5 ( y d 5 + h / 2 ) V 6 = 2 cos ( θ α 6 ) cos ( 2 θ 2 α 6 ) [ ( y y d 6 ) cos θ + ( x x d 6 ) sin θ h / 2 ] + 2 cos α 6 cos 2 α 6 ( y d 6 + h / 2 )
where V i is the reading of the ith interferometer, i = 1 ,   2 , ,   6 , and x , y , θ is the 3-DOF displacement of the stage at the measurement position since the coordinate of the stage center is equal to 0 at the reference position.
According to Equation (3), after obtaining the value of V i in practice, there are a total of 21 unknowns including x d i , y d i , α i , i = 1 ,   2 , ,   6 , and x , y , θ in the nonlinear equation system. Since the number of unknowns is greater than the number of constraints, it is unrealistic to solve the displacement x , y , θ from Equation (3). Therefore, the number of measurements is increased to gain more redundant information; that is, all the interferometers’ readings are recorded at every 10 distinct measurement positions for one displacement computation. Let x n , y n , θ n be the displacement at nth measurement positions, n = 1 ,   2 , ,   10 , the six interferometers’ readings at 10 measurement positions can be modeled as
{ V n 1 = 2 cos ( θ n α 1 ) cos ( 2 θ n 2 α 1 ) [ ( x n x d 1 ) cos θ n + ( y n y d 1 ) sin θ n b / 2 ] + 2 cos α 1 cos 2 α 1 ( x d 1 + b / 2 ) V n 2 = 2 cos ( θ n α 2 ) cos ( 2 θ n 2 α 2 ) [ ( x n x d 2 ) cos θ n + ( y n y d 2 ) sin θ n b / 2 ] + 2 cos α 2 cos 2 α 2 ( x d 2 + b / 2 ) V n 3 = 2 cos ( θ n α 3 ) cos ( 2 θ n 2 α 3 ) [ ( x n x d 3 ) cos θ n + ( y n y d 3 ) sin θ n b / 2 ] + 2 cos α 3 cos 2 α 3 ( x d 3 + b / 2 ) V n 4 = 2 cos ( θ n α 4 ) cos ( 2 θ n 2 α 4 ) [ ( y n y d 4 ) cos θ n + ( x n x d 4 ) sin θ n h / 2 ] + 2 cos α 4 cos 2 α 4 ( y d 4 + h / 2 ) V n 5 = 2 cos ( θ n α 5 ) cos ( 2 θ n 2 α 5 ) [ ( y n y d 5 ) cos θ n + ( x n x d 5 ) sin θ n h / 2 ] + 2 cos α 5 cos 2 α 5 ( y d 5 + h / 2 ) V n 6 = 2 cos ( θ n α 6 ) cos ( 2 θ n 2 α 6 ) [ ( y n y d 6 ) cos θ n + ( x n x d 6 ) sin θ n h / 2 ] + 2 cos α 6 cos 2 α 6 ( y d 6 + h / 2 )
where V n i is the reading of the ith interferometer at the nth measurement position, n = 1 ,   2 , ,   10 ,   i = 1 ,   2 , ,   6 .
The equation system (4) is the proposed 3-DOF displacement measurement model. As we can see, the number of equations is 60, which is larger than the number of unknowns (48 in total, including x n , y n , θ n , n = 1 ,   2 , ,   10 and x d i , y d i , α i , i = 1 ,   2 , ,   6 ). The 3-DOF displacement can be computed from Equation (4) by using the nonlinear equations system algorithm, which is discussed in Section 3.
From Equation (4), it should be noted that since the installation positions and angles x d i , y d i , α i are unknowns, the installation errors will not bring errors to the displacement computation. In addition, there is no linear approximation in Equation (4), which also makes the displacement computation result accurate.
It should also be noted that the number of interferometers and the number of measurement positions in the measurement model can be varied, and their selection rule is as follows: let the number of interferometers be M and the number of measurement positions be N (in Equation (4), M = 6 and N = 10 ). According to the above analysis, there are M N equations and ( 3 M + 3 N ) unknowns in the model. To make the number of unknowns not greater than the number of equations, there is
3 M + 3 N M N
It can be seen from Equation (5) that M is supposed to be larger than 3 , otherwise the number of equations will never catch up with the number of unknowns. Hence, Equation (5) can be rewritten as
N 3 + 9 M 3
Equation (6) shows that N can be smaller with larger M . Under the restriction of Equations (5) and (6), less M N means less computational complexity, but large M N provides enough redundant information to solve the unknowns. Therefore, it is selected that M = 6 and N = 10 in this section.

2.3. 6-DOF Displacement Measurement Model

In the 3-DOF displacement measurement of the X-Y stage, the assumption that the stage only moves in one plane is adopted. Although this assumption holds approximately in the case that the assembly is ideal, there is still deviation from the actual situation. When it comes to having sufficient computing resources, it is necessary to discuss and apply the 6-DOF displacement measurement to reduce the deviation between the model and the actual situation.
The 6-DOF displacement measurement scheme of the X-Y-Z stage using a nine-axis interferometer is proposed and shown in Figure 4. There are three interferometers that measure each one of the three adjacent surfaces of the stage. Similar to the 3-DOF displacement measurement of the X-Y stage, the mathematical model of the readings of the nine interferometers can be deduced, which is too complex because it needs to calculate the laser path in 3-dimensional space. Since the 6-DOF model is only more complex than the 3-DOF model, and there is no difference in the modeling principle, the detailed expression of the 6-DOF model is not given in this section, but the function heading is given.
The same as in the 3-DOF measurement, the O-XYZ coordinate axis is set to ensure that the origin coincides with the center of the stage at the reference position. Let x , y , z , θ , φ , ω be the 6-DOF displacement, then the readings of interferometers at the measurement position are
V i = V i ( x , y , z , θ , φ , ω , x d i , y d i , z d i , α i , β i ) , i = 1 , 2 , , 9
where V i is the reading of the i th interferometer, V i · is the function of the length variation of laser path from the reference position to the measurement position, x d i , y d i , z d i is the installation position of the i th interferometer, and α i , β i is the angle of the i th laser beam.
Also, to make the model solvable, the number of equations is supposed to be larger than the number of unknowns. Hence, all the interferometers’ readings are recorded at every 20 distinct measurement positions for one displacement computation. Let x n , y n , z n , θ n , φ n , ω n be the displacement at the n th measurement positions, n = 1,2 , , 20 , the nine interferometers’ reading at 20 measurement positions are
V n i = V n i ( x n , y n , z n , θ n , φ n , ω n , x d i , y d i , z d i , α i , β i ) ,   n = 1 , 2 , , 20 ,   i = 1 , 2 , , 9
where V n i is the reading of the i th interferometer at the n th measurement position, n = 1 ,   2 , ,   20 ,   i = 1 ,   2 , ,   9 .
Since there are a total of 180 equations and 165 unknowns in Equation (8), the 6-DOF displacement measurement model (8) is also theoretically solvable. Similar to the analysis in Section 2.2, the installation errors do not affect the displacement computation, and the nonlinear model without any approximation provides high accuracy.
As we can see, both the 3-DOF model and the 6-DOF model are nonlinear equation systems with a large number of unknowns. In this case, the existence and uniqueness of the solution, as well as the solution algorithm, are all key factors affecting the accurate displacement computation, which are discussed in Section 3.

3. Multi-DOF Displacement Computation

The solution and the algorithm of the multi-axis interferometer measurement model are discussed. For simplicity, only the solution of the 3-DOF displacement measurement model (4) is discussed, and the solution principle of the 6-DOF model (8) is the same.

3.1. Problem of Solving Nonlinear Equation System

Rewriting the nonlinear equation system (4) as
F ( X ) = 0
where X is the variable consisting of the 48 unknowns; F · is the equation function of Equation (4).
The classical algorithm of the nonlinear equation system (9) is the Gauss–Newton method [16], in which the iteration formula is
X ( k + 1 ) = X ( k ) [ J T ( X ( k ) ) J ( X ( k ) ) ] 1 J T ( X ( k ) ) F ( X ( k ) )
where k is the number of iterations and J ( · ) is the Jacobi matrix of F · .
The problem is that in the computation process of solving (4) in the form of Equation (9) by using Equation (10), J ( X ( k ) ) is always going to be singular and J T ( X ( k ) ) J ( X ( k ) ) is not invertible due to its huge condition number, which makes it impossible to continue the iteration. In fact, for a nonlinear equation system with a singular Jacobi matrix, it is proven that the solution is not unique, and both the Newton method and Gauss–Newton method lose their convergence [16]. Hence, the improved algorithm needs to be applied. As it is proven that the solution of a nonlinear equation system with a singular Jacobi matrix is a set with at least one dimension and the Levenberg–Marquardt method (LM method) can be used to converge to a solution on the solution set [20], the iteration formula is
X ( k + 1 ) = X ( k ) [ J T ( X ( k ) ) J ( X ( k ) ) + μ I ] 1 J T ( X ( k ) ) F ( X ( k ) )
where I is the identity matrix and μ is the steepest descent factor.
However, although there are improved algorithms such as the LM method and Newton method based on the MP inverse mentioned in [21] for solving nonlinear equations with a singular Jacobi matrix, and since the solution of Equation (4) is not unique and the iteration varies with the initial iterate, there is still a deviation between the computation result of the 48 unknowns and their true value. As an illustration, the computation of solving Equation (4) with the Newton method is simulated: a set of true values of unknowns is given, and 10 different initial values are randomly generated. In contrast, the iterations of x 1 and x d 1 using the 10 initial values are shown in Figure 5. The detail of Figure 5 is drawn as Figure 6 to observe the convergence results.
From Figure 5, the iterations of x 1 and x d 1 both converge; however, from Figure 6, the convergence of x d 1 changes with initial values. Hence, the convergence accuracy of x 1 and x d 1 are different, which means that the accuracy of each component of the equation system solution is different. Since we only care about the solution of displacement x n , y n , θ n , n = 1 ,   2 , ,   10 , can it be proven that all the displacement unknowns can be accurately solved? The answer is yes, and the proof is given in the following sections.

3.2. Solution Component Uniqueness Theory

As shown in Figure 6, the convergence of each variable of the nonlinear equation system with a singular Jacobi matrix is different, which is proven to be related to the uniqueness of the solution component in this section. To prove the solvability of displacement in the interferometer measurement model, the solution component uniqueness theory is proposed.
Without loss of generality, consider the nonlinear equation system
F ( X ) = 0
where X R N , F : R N R M is a continuously differentiable nonlinear function.
Equation (12) is said to be a singular nonlinear equation system (SNES) if it has a singular Jacobi matrix. Let X = x 1 , , x N T be any one in the solution set of Equation (12), the solution component x n is said to be unique or with uniqueness if there is only one value for it, where n N , 1 n N .
Let X * = x 1 * , , x N * T be the true value of the unknowns in Equation (12). If we look at the interferometer model (4), X * is the true value of the 48 unknowns. Let X ^ = x ^ 1 , , x ^ N T be the solution computed by any improved algorithm. For the case that Equation (12) is SNES, as mentioned in Section 2, X ^ varies with the initial iterate, and the solution of Equation (12) is a set with at least one dimension. X * and X ^ both belong to the solution set, but there is no evidence that X * = X ^ holds. The following Theorem 1 shows that the distance between x n * and x ^ n depends on the solution component uniqueness.
Theorem 1. 
For  n N , 1 n N ,  x ^ n = x n *   holds if  x ^ n   is unique. Otherwise,  x ^ n = x n *  might not hold.
Proof. 
For the condition that x ^ n is unique, there is only one value for the solution of x n . Since both X * and X ^ are the solution of (12), it is easy to check that x ^ n = x n * holds. In contrast, for the condition that x ^ n is not unique, there are two distinct solutions of (12) set as X A and X B with two distinct n th components set as x n A and x n B . Without loss of generality, suppose that X * = X A and X ^ = X B , then we have x n * = x n A and x ^ n = x n B , which indicate that x ^ n x n * holds. Thus, the proof is finished.
Theorem 1 implies that only for the solution component with uniqueness, the result computed by the algorithm always converges to the true value. Therefore, for the aim to solve the 30 displacement unknowns in Equation (4), the uniqueness of the corresponding 30 solution components must be verified. The following Theorem 2 provides the basis to help verify the solution component uniqueness.
Lemma 1. 
All the components of solution  X  are unique if  J ( X * )  has full column rank.
Proof. 
For the J ( X * ) with full column rank, it follows from the continuity of F X that there exists the neighborhood of X * as B X * , r with r > 0 such that rank [ J ( X ) ] = rank [ J ( X * ) ] holds for all the X B X * , r ., i.e., for all the X B X * , r , J ( X ) has full column rank. Thus, it follows from the Newton method [22] that X k converges to the unique solution X * . Therefore, all the components of X converge to the unique solution, the proof is finished.
Theorem 2. 
Let  J ( X * ) = p 1 , , p N ,  where  p n R M × 1  for  n = 1,2 , , N .  Then,  x n   is not unique if  rank  p 1 , , p N = rank p 1 , , p n 1 , p n + 1 , , p N  holds. Otherwise,  x n  is unique.
Proof. 
According to the definition of the Jacobi matrix, the total differential of (12) at X * is
j = 1 N p j Δ x j = 0
If rank p 1 , , p N = rank p 1 , , p n 1 , p n + 1 , , p N holds, there is
p n = j = 1 , j n N α j p j
where α j R .
Substituting Equation (14) into Equation (13), then we have
j = 1 , j n N p j ( Δ x j + α j Δ x n ) = 0
Let
y l = { x l + α l x n , 1 l < n x l + 1 + α l + 1 x n , n l N 1
Substituting Equation (16) into Equation (15), there is
[ p 1 , , p n 1 , p n + 1 , , p N ] [ Δ y 1 , , Δ y N 1 ] T = 0
Taking (17) as the total differential of the equation system, F Y = 0 where Y = y 1 , , y N 1 T , for which the Jacobi matrix is p 1 , , p n 1 , p n + 1 , , p N .
Without loss of generality, suppose that p 1 , , p n 1 , p n + 1 , , p N has full rank, otherwise we can repeat the above process to cut down the column elements. Then, it follows from Lemma 1 that all the solutions of y l are unique. Let y l * be the unique solution of y l , it follows from (16) that
( 1 α 1 1 α n 1 α n + 1 1 α N 1 ) ( N 1 ) × N ( x 1 x 2 x N ) = ( y 1 * y 2 * y N 1 * )
holds, which is the linear equation system of x 1 , , x N T . Notice that the blanks in the matrix in (18) are all 0.
It is easy to check that x n in (18) has infinitely many solutions, which means x n is not unique. The proof of the first half of the statement is finished.
As for the other condition that rank p 1 , , p N > rank   p 1 , , p n 1 , p n + 1 , , p N , the discussion is divided into the following two cases: For the case that p 1 , , p n 1 , p n + 1 , , p N has full column rank, p 1 , , p N must have full column rank since its rank is larger than p 1 , , p n 1 , p n + 1 , , p N . Then, it follows from Lemma 1 that x n is unique. For the other case that p 1 , , p n 1 , p n + 1 , , p N has no full column rank, let p 1 , , p n 1 , p n + 1 , , p N = q 1 , , q W after cutting down all the linearly dependent columns by taking the steps same as (14)–(17), where q 1 , , q W has full column rank and W < N 1 . Then, the new Jacobi matrix becomes p n , q 1 , , q W , which has full column rank. Hence, according to Lemma 1, x n is unique. The proof of the second half of the statement is finished.
From Theorem 2, the uniqueness of the solution component can be determined by computing whether the corresponding column in the Jacobi matrix is independent. Therefore, to verify the uniqueness of the displacement solution in the interferometer model, the columns dependence computation methods are discussed in the following.
Although there are Gaussian elimination, determinant computation, and other numerical algorithms used to compute the dependence of vectors [23], since the numerical error exists in the Jacobi matrix computation, the dependence computation result might go wrong if the tolerance of the algorithms is improper (tolerance can only be selected according to empirical methods, without means of debugging or correction due to the lack of appropriate feedback). To make the dependence computation more accurate, a numerical algorithm for dependence computation based on principle component analysis (PCA) is proposed.
For the given Jacobi matrix J M × N = p 1 , , p N , normalizing by
A = [ p 1 mean ( p 1 ) , , p N mean ( p N ) ]
The covariance matrix of A is
C = A T A / M
Computing the eigenvalue decomposition
C = V D V T
where V = [ v 1 , , v N ] is the eigenvector matrix with v R [ 1,1 ] N × 1 , D = diag ( σ 1 , , σ N ) is a diagonal matrix whose diagonal elements are eigenvalues in descending order.
According to PCA, after setting the threshold TH σ and r with that σ 1 > > σ r > TH σ > σ r + 1 > > σ N , we have
J prin = J V r
for dimensionality reduction of J , where V r = [ v 1 , , v r ] .
If p n in J is strongly dependent, which indicates that it carries less information, we believe that it contributes less to J p rin so that the weight of p n in Equation (22) is smaller. Suppose that V r = w 1 , , w N T , w R r × 1 , then the weight of p n is w n . Thus, taking the infinite norm of w n as the independent factor (IDF), that is
IDF n = w n
There is IDF n [ 0,1 ] since v R [ 1,1 ] N × 1 . The closer IDF n is to 0, which indicates that p n has weaker independence and stronger dependence in J , it follows from Theorem 2 that x n is more likely to be not unique; on the contrary, with stronger independence and weaker dependence of p n , x n is more likely to be unique.
According to the above, the IDF shows the possibility of whether a solution component is unique. After computing the IDFs of all the vectors in the Jacobi matrix, some simple classification algorithms such as the k-nearest neighbor algorithm [24] can be used to classify all the components into two categories according to the size of their IDFs, so as to determine the components’ uniqueness. A simple approach is setting a threshold TH IDF ( 0,1 ) , there is
x n i s   n o t   u n i q u e , IDF n < TH IDF i s   u n i q u e , IDF n > TH IDF
The brief process of the solution component uniqueness algorithm proposed above is summarized as follows.
So far, by using Theorem 1, Theorem 2, and Algorithm 1, it can be determined whether each unknown in the SNES can be solved accurately. Also, the solvability of the displacement in the interferometer model can be verified, which is discussed in the following Section 3.3.
Algorithm 1: Solution component uniqueness algorithm.
1 :   Initialization :   Given   TH σ > 0 ,   TH IDF ( 0,1 ) ,   X is a solution in the solution set of (12)
2 :   p 1 , , p N = J X
3 :   A = p 1 mean ( p 1 ) , , p N mean ( p N )
4 :   C = A T A / M
5 :   Eigen   decomposition   C = VDV T ,   where   V = [ v 1 , , v N ] ,   D = diag ( σ 1 , , σ N )
6 :   Find   r   satisfies   σ 1 > > σ r > TH σ > σ r + 1 > > σ N
7 :   w 1 , , w N T = [ v 1 , , v N ]
8 :   IDF 1 , , IDF N = [ w 1 , , w N ]
9: Compute the uniqueness of each component by Equation (24)

3.3. Principle of Displacement Computation

In this Subsection, the algorithm of the displacement computation of the interferometer measurement model is proposed, and the high accuracy of the displacement computation result is verified.
Since the classic Gauss–Newton method is not applicable to SNES, the Newton method with MP inverse is proposed to solve the displacement from the interferometer model (4). The linear expansion approximation of the equation system (12) at the given iteration X ( k ) is
F ( X ( k ) ) + J ( X ( k ) ) ( X X ( k ) ) = 0
In the Newton method, the solution of X in (25) is taken as the next step X ( k + 1 ) in the iteration. In the case that J ( X ( k ) ) is not a square matrix, the Gauss–Newton method solves X ( k + 1 ) by multiplying Equation (25) by [ J T ( X k ) J ( X k ) ] 1 J T X k , which fails when J T ( X k ) J ( X k ) is singular. In this paper, let Equation (25) be the inconsistent linear equation system of ( X X k ) , then the minimum 2-norm solution is
X X ( k ) = J ( X ( k ) ) [ F ( X ( k ) ) ]
where J ( X k ) is the MP inverse of J X k , which is calculated by singular value decomposition
J ( X ( k ) ) = V 1 S 1 1 U 1 T
where V 1 , S 1 , U 1 satisfy
J ( X ( k ) ) = U S V T = [ U 1 U 2 ] T [ S 1 0 0 0 ] [ V 1 V 2 ]
where S is the diagonal matrix, whose diagonal elements are singular values in descending order.
Therefore, according to Equation (26), the iteration formula is
X ( k + 1 ) = X ( k ) + J ( X ( k ) ) [ F ( X ( k ) ) ]
The brief process of the proposed Newton method with MP inverse is summarized as follows.
Since the convergence of the Newton method with MP inverse is discussed [22], Algorithm 2 can be used to solve the interferometer model (4). However, the solution component uniqueness needs to be verified.
Algorithm 2: Newton method with MP inverse.
1 :   Initialization :   Given   k max ,   X 0 ,   k   0
2 :   While   k < k max do
3 : Singular   Value   Decomposition   J X k = US V T
4 : Find   V 1 , S 1 , U 1 satisfy (28)
5 : J X k = V 1 S 1 1 U 1 T
6 : X k + 1 = X k + J ( X k ) [ F ( X k ) ]
7:  k = k +1
8: end while
Algorithm 1 is used to compute the solution uniqueness of the 48 unknowns in Equation (4). Number the 48 unknowns as
X = [ x d 1 , x d 2 , , x d 6 , x 1 , x 2 , , x 10 , y d 1 , y d 2 , , y d 6 , y 1 , y 2 , , y 10 , α 1 , α 2 , , α 6 , θ 1 , θ 2 , , θ 10 ]
Then, the eigenvalues [ σ 1 , , σ 48 ] in Equation (21) are shown in Figure 7a, and the IDFs of the 48 vectors in the Jacobi matrix are shown in Figure 7b. It is easy to check from Figure 7b that the IDF of the vectors whose numbers are 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, and 17 are much lower than the others. The corresponding solution components are not unique for the threshold, which is taken as TH IDF = 0.3 . Thus, the convergence results of these unknowns are not accurate, which means that the computation of the interferometers’ installation position ( x d , y d ) is not accurate. On the contrary, the other 36 unknowns, including the 3-DOF displacement, are solved with high accuracy. The solvability of the displacement in the interferometer model is proven.

4. Simulation Results

The 3-DOF displacement measurement of the proposed interferometer is simulated to verify the high accuracy. To verify the advantage that there is no measurement error caused by installation errors, the deviation between the nominal value and the true value of the interferometers’ installation positions and orientations is set as shown in Table 1.
The simulation scheme is shown in Figure 8. A total of 10 groups of 3-DOF displacement in the range of the interferometer are randomly selected as the true values. By substituting the true values of displacements and the true values of installation parameters into the interferometer model (4), the readings of the six interferometers are simulated. Then, substituting the readings into Equation (4) and taking the displacements and the installation parameters as unknowns, the displacement computation is simulated by using the proposed method in Section 3. In solving Equation (4) with Algorithm 2, the nominal value of the installation parameters is taken as the initial iteration, and the initial iteration of the displacements is the computation result in the last time step.
As the simulation results, the iteration error curves of the 48 unknowns are shown in Figure 9, Figure 10, Figure 11 and Figure 12, where Figure 9 and Figure 10 are the iteration error curves of the installation position ( x d , y d ) and orientation α of the 6 interferometers; Figure 11 and Figure 12 are the iteration error curves of the 10 sets of displacement ( x ,   y ,   θ ) . It is easy to check from Figure 9, Figure 10, Figure 11 and Figure 12 that the 3-DOF displacement ( x ,   y ,   θ ) converges to the true value while the laser installation position ( x d , y d ) does not. Hence, the solvability of the displacement is verified.
Further, the error reduction ratio (ERR) is defined as follows to verify the conclusion in Section 3.3 that only the unknowns with high IDF can be solved accurately: for the general nonlinear equation system F X = 0 where X = x 1 , , x N T , the ratio of the variable error after iteration to the variable error before iteration is defined as ERR
ERR n = 20 log | x ^ n x n * x n ( 0 ) x n * |
where ERR n is the ERR of x n , x n * is the true value, x n 0 is the initial iteration, and x ^ n is the convergence solution.
For 3-DOF displacement computation, namely the solution of Equation (4), there are a total of 48 unknowns including displacements and installation parameters. Arrange the 48 unknowns as Equation (30), the ERR of each unknown is shown in Figure 13. It can be seen from Figure 13 that the magnitude of ERRs corresponds to the IDFs shown in Figure 7b, which verifies that only the iteration of unknowns with a high IDF can converge to the true value.
To test the numerical accuracy of the proposed algorithm, 200 displacement measurements are simulated for the case that there is noise in the readings of interferometers, and the root mean square (RMS) of noise is set as 0.1 nm to simulate the practical engineering. The measurement error of displacement is shown in Figure 14, from which the mean and the standard deviation (STD) of displacement x error are 0.155 nm and 1.884 nm, and the mean and STD of displacement θ error are 3.549 × 10−8 mrad and 5.871 × 10−7 mrad. According to the above, the simulation results show the unbiasedness, low uncertainty, and high convergence speed of the displacement computation.
It should be noted that according to Algorithm 2 in Section 3.3, the amount of computation is proportional to the number of iterations, equations, and unknowns, which depend on the number of interferometers and measurement positions. Therefore, compared with traditional linear multi-axis interferometer models, the proposed iterative-based modeling and solving algorithm has a greater computational complexity. Nevertheless, in practical engineering, the computational time can be decreased by adopting a high-efficiency processing unit such as the FPGA with a parallel computing mode.

5. Conclusions

In this paper, a multi-axis laser interferometer is proposed, whose measurement is not affected by the installation errors. The interferometer measurement model shows that there is no need for the accurate installation of lasers, and the displacement computation shows the irrelevance between high measurement accuracy and installation errors, which is proven by the proposed solution component uniqueness theory for nonlinear equation systems. The simulation shows that even with installation errors close to 1 μm and 1 μrad, the interferometer is still able to measure the multi-DOF displacement accurately. For the case of where there is reading noise with RMS=0.1 nm, the STD of the displacement error is 1.884 nm and 5.871 × 10−7 mrad.
This paper provides a novel way for the design and development of multi-axis interferometers, that is, with today’s more and more developed computer processing ability, using a high-performance numerical computation to compensate for the lack of low mechanical manufacturing and assembly level, so that the measurement accuracy approaches the theoretical limit. The modeling method proposed is not limited to the topology of the interferometer in the paper, but can also be applied to other types of multi-axis interference systems. It is only necessary to take the installation parameters such as the laser angle as unknowns into the optical path model, and then the nonlinear computation method can be used to solve the model to obtain accurate displacement measurement results.
In the future, experiments will be designed to verify the measurement performance of the proposed interferometer. The optimized iteration algorithms with a lower complexity and higher speed will also be studied.

Author Contributions

Writing—original draft, T.W.; Writing—review and editing, J.H., Y.Z. and G.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 51175296 and 51677104, and by the Autonomous Scientific Research Project of the State Key Laboratory of Tribology, Tsinghua University, grant number SKLT2018C07.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yu, X.; Gillmer, S.R.; Woody, S.C.; Ellis, J.D. Development of a compact, fiber-coupled, six degree-of-freedom measurement system for precision linear stage metrology. Rev. Sci. Instrum. 2016, 87, 065109. [Google Scholar] [CrossRef] [PubMed]
  2. Chen, Y.T.; Lin, W.C.; Liu, C.S. Design and experimental verification of novel six-degree-of freedom geometric error measurement system for linear stage. Opt. Lasers Eng. 2017, 92, 94–104. [Google Scholar] [CrossRef]
  3. Zimmermann, N.; Ibaraki, S. Self-calibration of rotary axis and linear axes error motions by an automated on-machine probing test cycle. Int. J. Adv. Manuf. Technol. 2020, 107, 2107–2120. [Google Scholar] [CrossRef]
  4. Christiansen, A.J.; Naylor, D.A.; Gom, B.G. Multiaxis applications of a cryogenic range-resolved laser interferometer. Photonic Instrum. Eng. X 2023, 12428, 271–282. [Google Scholar]
  5. Leun, E.V. Highly Sensitive Single-Coordinate Measurement Electric Field Strength by Jet-Drop Optical Measuring Systems with a Pendant Drop. In Proceedings of the Dynamics of Systems, Mechanisms and Machines (Dynamics), Omsk, Russia, 15–17 November 2022; pp. 1–5. [Google Scholar]
  6. Haitjema, H. Calibration of displacement laser interferometer systems for industrial metrology. Sensors 2019, 19, 4100. [Google Scholar] [CrossRef] [PubMed]
  7. Gao, Z.; Hu, J.; Zhu, Y.; Duan, G. A new 6-degree-of-freedom measurement method of XY stages based on additional information. Precis. Eng. 2013, 37, 606–620. [Google Scholar] [CrossRef]
  8. Li, X.; Gao, W.; Muto, H.; Shimizu, Y.; Ito, S.; Dian, S. A six-degree-of-freedom surface encoder for precision positioning of a planar motion stage. Precis. Eng. 2013, 37, 771–781. [Google Scholar] [CrossRef]
  9. Hsieh, H.L.; Pan, S.W. Development of a grating-based interferometer for six-degree-of-freedom displacement and angle measurements. Opt. Express 2015, 23, 2451–2465. [Google Scholar] [CrossRef] [PubMed]
  10. Dejima, S.; Gao, W.; Shimizu, H.; Kiyono, S.; Tomita, Y. Precision positioning of a five degree-of-freedom planar motion stage. Mechatronics 2005, 15, 969–987. [Google Scholar] [CrossRef]
  11. Holmes, M.; Hocken, R.; Trumper, D. The long-range scanning stage: A novel platform for scanned-probe microscopy. Precis. Eng. 2000, 24, 191–209. [Google Scholar] [CrossRef]
  12. Cai, K.; Tian, Y.; Wang, F.; Zhang, D.; Liu, X.; Shirinzadeh, B. Design and control of a 6-degree-of-freedom precision positioning system. Robot. Comput. Integr. Manuf. 2017, 44, 77–96. [Google Scholar] [CrossRef]
  13. Zhu, J.; Wang, G.; Wang, S.; Li, X. A reflective-type heterodyne grating interferometer for three-degree-of-freedom subnanometer measurement. IEEE Trans. Instrum. Meas. 2022, 71, 1–9. [Google Scholar] [CrossRef]
  14. Van Den Brink, M.A.; Straaijer, A. Lithographic Apparatus for Step-and-Scan Imaging of Mask Pattern with Interferometer Mirrors on the Mask and Wafer Holders. U.S. Patent No. 6,084,673, 4 July 2000. [Google Scholar]
  15. Liu, C.S.; Pu, Y.F.; Chen, Y.T.; Luo, Y.T. Design of a measurement system for simultaneously measuring six-degree-of-freedom geometric errors of a long linear stage. Sensors 2018, 18, 3875. [Google Scholar] [CrossRef] [PubMed]
  16. Madsen, K.; Nielsen, H.B.; Tingleff, O. The Gauss-Newton Method. In Methods for Non-Linear Least Squares Problems, 2nd ed.; Informatics and Mathematical Modelling, Technical University of Denmark: Kongens Lyngby, Denmark, 2004; pp. 20–23. [Google Scholar]
  17. Sugihara, T. Solvability-unconcerned inverse kinematics by the Levenberg–Marquardt method. IEEE Trans. Robot. 2011, 27, 984–991. [Google Scholar] [CrossRef]
  18. Fan, J. The modified Levenberg-Marquardt method for nonlinear equations with cubic convergence. Math. Comput. 2012, 81, 447–466. [Google Scholar] [CrossRef]
  19. Ueda, K.; Yamashita, N. On a global complexity bound of the Levenberg-Marquardt method. J. Optim. Theory Appl. 2010, 147, 443–453. [Google Scholar] [CrossRef]
  20. Dan, H.; Yamashita, N.; Fukushima, M. Convergence properties of the inexact Levenberg-Marquardt method under local error bound conditions. Optim. Methods Softw. 2002, 17, 605–626. [Google Scholar] [CrossRef]
  21. Zeng, Z. A Newton’s iteration converges quadratically to nonisolated solutions too. Math. Comput. 2023, 92, 2795–2824. [Google Scholar] [CrossRef]
  22. Li, C.; Zhang, W.H.; Jin, X.Q. Convergence and uniqueness properties of Gauss-Newton’s method. Comput. Math. Appl. 2004, 47, 1057–1067. [Google Scholar] [CrossRef]
  23. Leon, S.J.; De Pillis, L.G. Gaussian Elimination. In Linear Algebra with Applications.; Pearson Education, Inc.: Upper Saddle River, NJ, USA, 2006; pp. 414–418. [Google Scholar]
  24. Zhang, Z. Introduction to machine learning: K-nearest neighbors. Ann. Transl. Med. 2016, 4, 218. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (a) Interferometer and stage at reference position. (b) Interferometer and stage at measurement position.
Figure 1. (a) Interferometer and stage at reference position. (b) Interferometer and stage at measurement position.
Applsci 13 10887 g001
Figure 2. Scheme of 3-DOF displacement measurement.
Figure 2. Scheme of 3-DOF displacement measurement.
Applsci 13 10887 g002
Figure 3. Measurement principle of 3-DOF displacement of X-Y stage with a 6-axis interferometer.
Figure 3. Measurement principle of 3-DOF displacement of X-Y stage with a 6-axis interferometer.
Applsci 13 10887 g003
Figure 4. Scheme of 6-DOF displacement measurement.
Figure 4. Scheme of 6-DOF displacement measurement.
Applsci 13 10887 g004
Figure 5. (a) Iteration error of x 1 . (b) Iteration error of x d 1 .
Figure 5. (a) Iteration error of x 1 . (b) Iteration error of x d 1 .
Applsci 13 10887 g005
Figure 6. (a) Detail of iteration error of x 1 . (b) Detail of iteration error of x d 1 .
Figure 6. (a) Detail of iteration error of x 1 . (b) Detail of iteration error of x d 1 .
Applsci 13 10887 g006
Figure 7. (a) Eigenvalues of covariance matrix. (b) Independent factors. The symbol * in the figure represents the data point.
Figure 7. (a) Eigenvalues of covariance matrix. (b) Independent factors. The symbol * in the figure represents the data point.
Applsci 13 10887 g007
Figure 8. Scheme of the simulation process.
Figure 8. Scheme of the simulation process.
Applsci 13 10887 g008
Figure 9. (a) Iteration error of x d and y d . (b) Iteration error of α .
Figure 9. (a) Iteration error of x d and y d . (b) Iteration error of α .
Applsci 13 10887 g009
Figure 10. (a) Detail of iteration error of x d and y d . (b) Detail of iteration error of α .
Figure 10. (a) Detail of iteration error of x d and y d . (b) Detail of iteration error of α .
Applsci 13 10887 g010
Figure 11. (a) Iteration error of x and y . (b) Iteration error of θ .
Figure 11. (a) Iteration error of x and y . (b) Iteration error of θ .
Applsci 13 10887 g011
Figure 12. (a) Detail of iteration error of x and y . (b) Detail of iteration error of θ .
Figure 12. (a) Detail of iteration error of x and y . (b) Detail of iteration error of θ .
Applsci 13 10887 g012
Figure 13. ERRs of the 48 unknowns.
Figure 13. ERRs of the 48 unknowns.
Applsci 13 10887 g013
Figure 14. (a) Simulation result of x with reading noise (RMS = 0.1 nm). (b) Simulation result of θ with reading noise (RMS = 0.1 nm).
Figure 14. (a) Simulation result of x with reading noise (RMS = 0.1 nm). (b) Simulation result of θ with reading noise (RMS = 0.1 nm).
Applsci 13 10887 g014
Table 1. Installation errors of lasers.
Table 1. Installation errors of lasers.
Laser Number123456
x d ( × 10−3 mm)−0.960.61−1.400.671.10−0.74
y d ( × 10−3 mm)−0.210.23−1.102.60−0.480.38
α ( × 10−3 mrad)0.51−1.200.09−1.70−1.000.91
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wen, T.; Hu, J.; Zhu, Y.; Hua, G. Multi-Axis Laser Interferometer Not Affected by Installation Errors Based on Nonlinear Computation. Appl. Sci. 2023, 13, 10887. https://doi.org/10.3390/app131910887

AMA Style

Wen T, Hu J, Zhu Y, Hua G. Multi-Axis Laser Interferometer Not Affected by Installation Errors Based on Nonlinear Computation. Applied Sciences. 2023; 13(19):10887. https://doi.org/10.3390/app131910887

Chicago/Turabian Style

Wen, Tingrui, Jinchun Hu, Yu Zhu, and Guojie Hua. 2023. "Multi-Axis Laser Interferometer Not Affected by Installation Errors Based on Nonlinear Computation" Applied Sciences 13, no. 19: 10887. https://doi.org/10.3390/app131910887

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop