1. Introduction
The development of missile control methods has long been an area of interest to scientists. The possibility of combining a missile homing system with its control system in one loop has recently drawn attention [
1]. Increasing missile accuracy is the major factor that necessitates the development of engineering methods based on integration of missile subsystems [
2].
Some of the control methods are based on the Riccati equation [
3] and are utilized for designing non-linear control systems. The operating effectiveness of the optimal LQR control method largely depends on the elements of the Q and R weight matrix. These matrices impact the minimization of offsets generated for state variables under the influence of control signals. Combining the PID regulator tuning method with the LQR concept enables the optimization of set value tracking, with the optimal selection of setpoints for the same control object. The optimal control theory has been extended in order to tune PID regulators [
4].
However, usually the Q and R weight matrices are selected via a trial-and-error method, which—apart from being time-consuming—is also onerous. Different variations (modifications) of LQR control have been used over the past decades, namely, hybrid LQR [
5,
6], fuzzy LQR [
7,
8] and switched LQR [
9,
10]. The author of [
11] presented a description of the use of a linear-quadratic regulator (LQR) for stabilizing the characteristics of an anti-aircraft missile and proposed an analytical method for selecting the weighting elements of the gain matrix in the feedback loop. A novel LQR tuning method using a single V parameter was proposed and tested. In [
12], the linear-quadratic regulator (LQR) was involved in controlling the roll angle of a missile autopilot system. The roll controllers were represented in the second order time domain system, and the results of LQR control were distinguished from the results of sliding mode control (SMC) and fuzzy logic control (FLC). Next, the behavior of roll control systems was analyzed to decide which controller gives better performance results with respect to the desired roll angle. In [
13], the authors used time as an independent variable; the motion equation of the center of mass of the target bomb was linearized, using the small disturbance method, to establish the center of mass motion of the space expressions; the LQR theory was used to design the trajectory tracking guidance law of the target projectile. The authors of [
14] also used the genetic algorithm in order to optimize the weight matrices linked with the optimal regulator, while simultaneously minimizing the ITSE (Integral of Time multiplied Squared Error) index and regulator efficiency. In [
15], a description was given of an application of the sliding mode control (SMC) for stabilizing the static and dynamic characteristics of an anti-aircraft missile. The solution provided an effective separation of the control process from the dynamics of the missile airframe. In the equivalent part of the stabilization system, a linear-quadratic regulator (LQR) was considered, and an analytical method of selecting the weighting elements of the gain matrix was proposed.
Owing to the progress in the development of computer systems, numerical computations enable designing ever newer LQR optimization methods and controlling multidimensional systems. In addition, LQR control has been successfully used in numerous complex situations, such as the double inverted pendulum, fuel cell systems and aircraft [
16]. The LQR method belongs to the group of modern control theory methods, which analyze object model within the state space. Utilizing state space methods is relatively straightforward in the case of a multidimensional system. Taking the mathematical model into account, the selection of the control method for a tested object can be divided into a method which operates based on a mathematical model and a model-less method. The LQG optimal control method is a typical model-based method. In [
17], the ship model was kept to a maneuvering trajectory through a combination of feedforward and LQG feedback control. The variances and weighting coefficients for the LQG controller were chosen systematically. On the other hand, control methods based on a fuzzy regulator and utilizing artificial neural networks are methods that do not use the object model.
LQR optimization methods have almost unlimited use. They can be successfully used in various tasks. They can be used to improve the process of making composites by infusion. In [
18], authors made construction materials from laminates A, B and C with sandwich filling, using vacuum bagging, which confirmed the possibility of absorbing energy using LQR. Work is underway to optimize the selection of the appropriate layer thickness to increase the absorbed energy.
In order to choose the control method, one needs to start with analyzing the plant dynamic behavior. The dynamics of the discussed object (i.e., an anti-tank guided missile) are the change rate of its state variables, depending on its fin deflections. This relationship can, of course, be expressed in the form of a series of differential equations, also called flying object motion equations. The main goal of our article is to develop and model an innovative control actuation system for an anti-tank guided missile based on the use of the thrust vectorization, i.e., controlling the direction of the resultant thrust by deflecting the engine nozzle. The modified LQR method, which is very similar to—and at the same time an improvement on—the SDRE control method, also known in the literature, is the only way to achieve effective control of this type of innovative missile. In this paper, the equations of motion of a flying object simulate the real motion of an anti-tank missile, and the linearized form of the dynamics equations is used to calculate the control signals of the control actuation system. Control signals are determined by the modified LQR method control. While in [
19], the authors the proposed dual-control missiles, which have sets of aerodynamic fins both in front of and behind the missile center of mass, in this paper we consider a hybrid Control Actuation System (hCAS) consisting of aerodynamic surfaces and a moving missile engine nozzle, which constitutes the essence of the research for this article.
2. Model of a Hypothetical Anti-Tank Guided Missile
The controlled plant is a subsonic, hypothetical anti-tank guided short- or medium-range missile of the “fire and forget” type (
Figure 1). Its in-flight control is achieved through a double control actuation system, which consists of two pairs of aerodynamic controls (rudders and elevators) in the front section of the missile and a tilting rocket engine nozzle in the rear section (
Figure 2). The tilting nozzle changes the angular position of the rocket engine thrust vector resultant, relative to the missile’s longitudinal axis
Sx. The application of a hybrid control actuation system significantly improves missile maneuverability, as reviewed in [
20].
Figure 1 uses the following symbols and designations:
—force of gravity;
—vector of resultant aerodynamic forces;
—control forces;
—engine thrust;
—vector of missile velocity;
—ground-fixed coordinate system;
—coordinate system connected with the flow;
—coordinate system connected with the missile;
—angle of attack;
—missile sideslip angle;
and
—angles of the velocity vector (flight-path angles);
—angular velocity components in the body-fixed system.
Figure 2 uses the following symbols and designations:
—engine thrust;
—thrust projection on the vertical axis
;
—thrust projection on the horizontal axis
;
—thrust projection on the longitudinal axis of the ATGM
;
—thrust projection on the vertical plane
;
—thrust projection on the horizontal plane
;
,
—control angle in the horizontal plane (change yaw);
,
—control angle in the vertical plane (change pitch).
Deflections of the rocket engine thrust vector relative to the ATGM by
,
(
Figure 2a) generate control forces across both planes. The flight altitude control force
appears in the vertical plane, whereas the flight direction control force
is generated in the plane perpendicular to the vertical plane passing through the missile’s axis of symmetry [
21].
The trigonometric properties indicate that:
;
—across the horizontal plane
;
;
—across the vertical plane
. Control forces
and
have been derived in detail in [
21] and are equal to:
The aerodynamic controls (rudders and elevators) are located in the front of the missile (
Figure 2b). A lightweight composite rudder and elevator structure [
22] provides rapid response to control signals. In the case of low deflection angles of the aerodynamic controls
and subsonic velocities, the lift and lateral (control) forces
generated on the aerodynamic control surface adopt a simplified form (3, 4). A pair of rudders and elevators in the front part of the missile can be described by the following formulas:
where:
—area of surfaces (
);
—air density (
);
—the actual deflection angles of the aerodynamic control surfaces.
The control object is guided onto a target using a thermal imaging head [
23,
24] or a TV head in the case of a medium-range missile, which enables the “fire and correct” mode. It is assumed that a homing head operates correctly, and the missile location data relative to the target is known. Furthermore, it is assumed that the ATGM does not rotate relative to the axis
Sx.
It is assumed that the ATGM mass changes as a result of rocket engine fuel burnout. Thus, the center of mass shifts and the moments of inertia change. These changes are expressed using time-dependent linear functions (
Figure 3).
Figure 3 uses the following designations:
—initial mass of the ATGM = 13.287 kg;
—mass of the ATGM after consumption of fuel = 8.7 kg;
—initial position of the center of mass of ATGM = 0.5705 m (when viewed from the front of the ATGM);
—position of the center of mass of the ATGM after consumption of fuel = 0.434 m (when viewed from the front of the ATGM);
—initial moment of inertia of ATGM in relation to x axis = 0.0396 kg × m2;
—moment of inertia of the ATGM in relation to x axis after the consumption of fuel = 0.026 kg × m2;
—initial moment of inertia of the ATGM in relation to the transverse axis = 1.7823 kg × m2;
—moment of inertia of the ATGM in relation to the transverse axis after the consumption of fuel = 1.2078 kg × m2;
The full spatial flight equations of dynamics for any flying object as a rigid body can be found, among others, in [
25,
26]. The following equations were used for this study:
where:
—engine thrust;
,
—distance from the center of mass of aerodynamic controls and outlet gas nozzles, correspondingly;
g—acceleration due to gravity;
—coordinates of aerodynamic control center (when viewed from the front of ATGM);
—length of the ATGM body;
—main central moments of inertia in relation to the axes of the related system;
—mass of the missile;
and
—pitch and yaw angle of the missile body;
,
,
—coefficients of aerodynamic forces;
—relative aerodynamic factors of forces and moment;
—lift and drift moment factors;
,
—dumping moment factors from the ATGM body angular velocities (constants values were adopted) [
27].
The components of the angular velocity vector of a system related to the missile take the following forms (assuming that
):
The projections of the linear velocity vector onto individual axes of a system related to the missile
can be expressed in the form of a relationship:
The angle of attack and missile sideslip angle are determined from the following equations:
3. The Control Algorithm in the Modified LQR Method
The general idea of programmed trajectory has been taken from [
25]. Determining a programmed (preset) trajectory involves finding a third-degree polynomial function, which joins the initial point and the endpoint of a flight trajectory segment. These points are called waypoints. There can be several such trajectory segments. The last segment connects the ATGM with the target. The selection of appropriate control angles is such that the ATGM flight covers the programmed trajectory. A modified linear-quadratic regulator is used for this purpose.
ATGM flight equations of dynamics are highly non-linear. Their linearization requires major simplifications. This causes significant errors. Due to the mass changing over time (because of intensive fuel consumption) and due to the dynamic coefficients’ dependance on the Mach number
Ma, we are dealing with a non-stationary system. The authors of [
21] have shown that a linear-quadratic regulator can be used to control such objects; however, when the state
A and control
B matrices are constant over time, the mathematical model becomes far from true. The modified LQR method removes this issue since it determines the control angles for non-linearized flight equations of dynamics (5). The control signals determined by the regulator in question are the deflection angles of the aerodynamic control surface (
,
) and rocket engine nozzle angles (
,
).
It was assumed that the aerodynamic control surface and rocket engine nozzle deflection angles have the same value but opposite directions. Therefore, we have
in the vertical plane and
in the direction control plane, which results in the following formulas:
In a stationary state, at a given operating point, equations of dynamics take the form of a linear function. In the discussed case, the operating point of a system consists of subsequent missile flight positions when homing onto a target (executing a firing task). The operating point is updated every 100
th integration step, at a regulator operating frequency of 1000 Hz. As a result of decimation, the equations of dynamics are linearized every 0.1 s around the consecutive operating point. It should be emphasized that a control algorithm based on the modified LQR collects pseudo-real flight parameters from the equation of dynamics (5) to calculate control signals (
Figure 4). Calculated signals control a hypothetical missile model described by Equations (5).
Equations of dynamics (5) have been reduced to the form of state Equation (10). State equations constitute a part of the regulator used to calculate the Jacobians and control matrix. The following simplifications have been introduced for the purpose of generating control signals using a modified LQR method
:
The equation system (10) can be expressed in the form of a vector-matrix equation
where:
;
—vector of state variables deviations from the required value;
—vector of actual state variables;
—vector of state variables at work point, i.e., vector of required values of state variables;
—vector of forcing (control of) the system;
—Jacobian, system state matrix, calculated within each sample time;
—control matrix.
It should be noted that calculating the Jacobian determinants at any moment in time enables the shifting of the operating point to the current ATGM location. However, such a solution consumes a large share of the autopilot’s computing power, which can result in reduced operating effectiveness. This is why the Jacobian J, control matrix B and operating point are updated periodically (i.e., 0.1 s) during the tests. The data needed for LQR method control are taken directly from the mathematical model expressed in the form of equations of dynamics, described by the formulas (5). Via a new Jacobian J and control matrix B, the gain matrix K is determined for the LQR.
Let us express the Jacobian of the system described by Equation (10) in the following form:
The matrix J is a Jacobian of a system of Equation (10) and consists of partial derivatives calculated for each operating point .
The right of control
was determined and minimizes the quadratic quality index:
where:
—positively semi-definite symmetric matrix;
—positively definite symmetric matrix;
tk—simulation end time.
In essence, we are dealing with an LQR finite horizon problem. However, the cost function (14) is written without the third part responsible for the state deviation at termination time. This is because it is much more important to maintain the control accuracy for the entire time interval of a given linearization of the system. Moreover, the problem can be solved using the algebraic Riccati equation instead of the Riccati differential equation, which saves computing power in embedded systems. For research and simulation purposes, the lqr function from the MATLAB Control System Toolbox [
28] was used.
The LQR gain matrix was determined based on dynamic equations, and it allows us to minimize the quadratic quality index (14):
The minimization process involves solving an algebraic Riccati equation, which can be determined using the lqr function in Matlab [
29].
Matrices
Q,
R are selected for two time intervals. The first time interval is the time when the launch motor is on (
) and the thrust is 4.3 kN. The second interval is when the main engine is on (
), the thrust of which is 280 N. The stability of the control system when switching between different gains K is ensured by appropriate software limitations. Appropriate limitations simulating the real movement of the actuators are introduced in the simulation. The first limitation concerns the deflection angles of the aerodynamic control surfaces (from −16 deg to 16 deg). The second limitation concerns the velocity of the actuators. The third is the limited forces generated by the control actuation systems. The operating stability of the control system used is ensured by the selection of LQR regulator settings, which are responsible for the determination of the
Q and
R weights matrices. As is well known, the LQR algorithm does not have a universal method for selecting the above parameters, and they are usually iteratively selected. In this paper, when selecting the initial values of
Q and
R matrices, the authors used the Bryson [
30] rule, which allowed for determining the selection of the following input parameters:
where:
i—another element of the state vector;
—the maximum values for individual elements of the state vector
x;
—the maximum control moments. The maximum operating parameters of the rocket missile were determined using the Lapunov method, and they are, respectively:
It was assumed that the linear system is in a state of equilibrium. The objective of control was to maintain control at a preset operating point, despite the disturbances affecting it. In this case, the ATGM moves, and the operating point is not constant. The state of equilibrium depends on the offset
. Optimal control takes the form:
After substituting Equations (18) to (11), we get:
The vector of set state variables was derived from data calculated based on the programmed trajectory.
The programmed trajectory of flight within the vertical and horizontal planes are third-degree polynomials
, used to calculate the set angle of control for the altitude and flight direction
:
In addition, the set variable vector was expanded with an angular correction of altitude and direction. The angular position correction results from trigonometric properties. It has the following form:
where:
—polynomial coefficients calculated according to the control algorithm [
23];
;
—programmed trajectory coordinates;
—real ATGM coordinates;
—projected distance necessary to correct the ATGM flight—assumed as:
m.
5. Final Conclusions, Direction of Further Studies
An algorithm of a modified LQR method utilizing a Jacobian was developed to control the hypothetical missile. The modified regulator calculates control signals (aerodynamic surface deflection angles and the rocket engine thrust vector resultant) based on data from pseudo-real equations of dynamics. The data (state variables) relating to the flight of the missile, such as flight speed, flight angles, angular velocities and spatial position, enable the correct operation of the proposed algorithm; i.e., they are signals required to determine the control signals, which enable the missile to follow the set trajectory.
Modified LQR system model linearization, i.e., calculating the new state matrix and control matrix values takes place at a lower frequency than that at which the regulator itself operates. The linearization frequency is 10 Hz, while the regulator operating frequency is 1000 Hz. This enables the autopilot computing unit to save a significant amount of power, while simultaneously preserving sufficient conformity of the dynamics model with the pseudo-real object in the area of a given operating point. The conducted simulations indicate that the discussed model works correctly. The ATGM follows a set trajectory and precisely hits a moving ground target.
The application of the time-varying elements of the J (t) and B (t) matrices allowed the linearization of the flight equations of dynamics in the course of subsequent integration steps. Thus, the dynamic nature of the flight equations was preserved. In practice, a Jacobian in the form of a matrix J is used—with the matrix containing partial derivatives. This results in the possibility of controlling a highly non-linear and non-stationary object via an LQR.
This article demonstrates that using a linear-quadratic regulator (LQR) for controlling an anti-tank guided missile is possible and enables the smooth maneuvering of the missile, until it reaches a ground target. In addition, this method will be applied in further research work on using a state observer (in the case of a lack of access to all state variables) or an extended Kalman filter, especially in the conditions of random ATGM disturbances.