Robust Tracking as Constrained Optimization by Uncertain Dynamic Plant: Mirror Descent Method and ASG—Version of Integral Sliding Mode Control

Nazin, Alexander; Alazki, Hussain; Poznyak, Alexander

doi:10.3390/math11194112

Open AccessArticle

Robust Tracking as Constrained Optimization by Uncertain Dynamic Plant: Mirror Descent Method and ASG—Version of Integral Sliding Mode Control

by

Alexander Nazin

^1,*,†

,

Hussain Alazki

^2,†

and

Alexander Poznyak

^3,†

¹

V. A. Trapeznikov Institute of Control Sciences, Russian Academy of Sciences, Moscow 117997, Russia

²

Facultad de Ingeniería, Universidad Autónoma del Carmen (UNACAR), Playa del Carmen 24180, Mexico

³

Automatic Control Department, Centro de Investigacion y Estudios Avanzados del Instituto Politecnico Nacional, Ciudad de Mexico 07360, Mexico

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2023, 11(19), 4112; https://doi.org/10.3390/math11194112

Submission received: 18 July 2023 / Revised: 21 September 2023 / Accepted: 26 September 2023 / Published: 28 September 2023

(This article belongs to the Special Issue Dynamics and Control Theory with Applications)

Download

Browse Figures

Versions Notes

Abstract

:

A class of controlled objects is considered, the dynamics of which are determined by a vector system of ordinary differential equations with a partially known right-hand side. It is presumed that the state variables and their velocities can be measured. Designing a robust tracking controller under some constraints to admissible state variables is the research goal. This construction, which extends the results for the average subgradient technique (ASG), and is an update of the subgradient descent technique (SDM) and integral sliding mode (ISM) approach, is realized by using the Legendre–Fenchel transform. A two-link robot manipulator with three revolute joints, powered by individual PMDC motors, is presented as an illustrative example of the suggested approach implementation.

Keywords:

robust control; trajectory tracking; convex constrained optimization; subgradient descent method; sliding mode; robot manipulator

MSC:

93B12; 93B51; 93B52; 93D09; 93D21

1. Introduction

1.1. Brief Survey

Constrained optimization is the process of optimizing an objective function with respect to some variables, subject to constraints on those variables. The objective function is either a cost function or energy function to be minimized, or a reward function or utility function to be maximized. Constraints can be either hard constraints, which set conditions on the variables that must be satisfied, or soft constraints, which have some variable values that are penalized in the objective function if and depending on the extent to which the conditions on the variables are not satisfied (see, for example, [1,2,3,4,5,6]).

All control strategies in most publications, treated as Static Optimization Methods (SOM), in continuous time may be represented in the following form

F (x_{t}) \underset{t \to \infty}{\to} F^{*} : = min_{x \in X_{a d m} \subseteq R^{n}} F (x),

where

F : R^{n} \to R

is a convex (not obligatory strongly convex) mapping,

X_{a d m}

is the admissible convex set of arguments, and the process

x_{t}

is generated by the simple ordinary differential equation (ODE)

{\dot{x}}_{t} = u_{t}, x_{0} is fixed, t \geq 0,

(1)

with any initial conditions

x_{0} \in R^{n}

. The relation (1) is referred to hereafter as a static plant. All known procedures of SOM differ only in the designing of the control action

u_{t}

(or an optimization algorithm) as a function of the current state

x_{t}

(Markov’s strategy) or more profound available history, namely,

u_{t} = u (t, x_{τ} ∣_{τ \in [0, t]})

.

Here we consider a more general, and hence, more complex situation when the process

x_{t}

is generated by the dynamic plant

\begin{matrix} {\ddot{x}}_{t} = f (t, x_{t}, {\dot{x}}_{t}) + u_{t}, \\ x_{0}, {\dot{x}}_{0} are fixed, t \geq 0, x_{t}, u_{t} \in R^{n}, \end{matrix}\}

(2)

where the vector function f in the right-hand side is supposed to be unknown but belongs to some class

C

of nonlinearities.

This problem is more closed to the, so-called, Extremum Seeking Problem [7,8,9,10] and includes the first-order derivatives only. Thus, in [11], several optimization schemes are considered and it is shown that under appropriate conditions, these schemes achieve the extremum point from an arbitrarily large domain of initial conditions if the parameters in the controller are appropriately adjusted. This approach was applied in [12] for two levels of plant’s economic optimization. Many advanced process control systems use some form of model predictive control approach [13,14]. Article [15] describes a new algorithm for finding an extremum using stochastic online gradient estimation. Article [16] considers the optimization problem with constraints in dynamic linear time-invariant (LTI) systems characterized by the dimension of the control vector being smaller than the dimension of the system state vector. Convergence in finite time to a neighborhood of order

ε

of the optimal equilibrium point is proven. Ref. [17] presents variable structure convex programming control for a class of linear uncertain accessible state systems.

It is also crucial to discuss the relationship between the problem under consideration and the Model Predictive Control (MPC) approach, commonly referred to as moving or receding horizon control; see the two known surveys [18,19] as well as some new recent papers on Robust MPC (RMPC) [20,21,22,23]. MPC is a group of model-based control theories that anticipate system behavior using linear or nonlinear process models. The quality of the open-loop predictions, which in turn depends on the correctness of the process models, determines how well the MPC control performance operates. The estimated trajectory could not match the behavior of the plant in real life. The performance of the control system may be sluggish or unstable due to the mismatch between the plant and the model, sometimes referred to as model uncertainty. Robust predictive controllers are those that explicitly take the process and model uncertainty into account while establishing the best control strategies. Similar to H-infinity controllers, the core principle behind these controllers is to minimize the worst possible disruption to the behavior of the process. This idea is very different from the one suggested in this paper:

–: The prediction process cannot be accomplished precisely since the right-hand side of the ODE, representing the object model, is considered to be unknown (only dimensions of states and control are available);
–: Because the control action should be implemented in real-time online utilizing feedback (but not open-loop control), it is difficult to test-repeat the appropriate produced trajectories for various potential uncertainties.

In this paper, we consider a class of controlled plants with dynamics governed by a vector system of the second-order ordinary differential equations (ODE) with an unknown right-hand side. All mechanical Lagrange models belong to this class. The state variables and their velocities are assumed to be measurable. We design a controller minimizing a loss function subjected to a set of constraints to the state of the controlled plant. The designed control action is admitted to be a function of the current subgradients of loss function and constraints only, which are also supposed to be measurable online. The control is designed based on the SDM (subgradient descent method) version [24,25] of the integral sliding mode (ISM) concept [26,27,28] aimed at minimizing “on average” a given convex (not obligatory strongly convex) cost function of the current state under a set of given constraints. An optimization type algorithm is developed and analyzed using ideas from the SDM technique [1]. We prove the reachability of the “desired regime” (nonstationary analogue of sliding surface) [27] from the beginning of the process and obtain an explicit upper bound for the cost function decrement; that is, the convergence is proven and the rate of convergence is estimated as

O (t^{- 1})

. This paper generalizes the approach, suggested in [29] for unconstrained dynamic optimization, to the constraint optimization problem realized by an uncertain second-order dynamic plant.

1.2. Main Contributions

The robust tracking problem is reformulated as a constrained optimization realized by a dynamic plant with an unknown (but bounded) right-hand side. When we refer to “robust tracking”, we imply two distinct characteristics that are connected to imperfect a priori knowledge. While the exact control plant models and tracking trajectories are unavailable, a robust controller should nevertheless be able to successfully operate. It is just necessary to measure states and corresponding velocities online.
The cost as well as the constraints are admitted to be convex but not obligatory strictly or strongly convex.
The mirror descent method (MDM) and ASG version of sliding mode control are suggested and realized.
The convergence of the obtained trajectories of the controlled uncertain plant to the corresponding admissible zone close to the minimal point is realized.

2. Uncertain Plant Description and Admitted Dynamic Zone

2.1. Dynamic Model

The second-order dynamic model (2) can be represented in the following extended format

\begin{matrix} (\begin{matrix} {\dot{x}}_{1, t} \\ {\dot{x}}_{2, t} \end{matrix}) = (\begin{matrix} x_{2, t} \\ f (t, x_{1, t}, x_{2, t}) \end{matrix}) + (\begin{matrix} 0_{n \times n} \\ I_{n \times n} \end{matrix}) u_{t}, \\ x_{1, t_{0}} = {\overset{˚}{x}}_{1} \in R^{n}, x_{2, t_{0}} = {\overset{˚}{x}}_{2} \in R^{n}, u_{t} \in R^{n} . \end{matrix}\}

(3)

Here the extended state variables

x_{1, t} = x_{t},

x_{2, t} = {\dot{x}}_{t}

are the current coordinates and their velocities at time

t \geq 0 .

Function

f (t, x_{1, t}, x_{2, t})

is piecewise continuous in all arguments and admits to being unknown but is bounded as

∥f (t, x_{1}, x_{2})∥ \leq k_{x} (x_{1}, x_{2}) : = c_{0} + c_{1} ∥x_{1}∥ + c_{2} ∥x_{2}∥

(4)

with final positive constants

c_{0}

,

c_{1}

, and

c_{2}

. Hereafter the symbol

∥\cdot∥

means the Euclidean norm.

2.2. Reference Trajectory, Tracking Error Dynamics, and Admissible Zone

The aim of the controller (which will be exactly formulated below) is to realize the tracking of the state

x_{t}

for the given reference trajectory

{\{x_{t}^{*}\}}_{t \geq 0}

. Define the tracking error

δ_{1, t}

as

δ_{1, t} : = x_{1, t} - x_{1, t}^{*}, δ_{2, t} = {\dot{δ}}_{1, t} = x_{2, t} - x_{2, t}^{*},

(5)

where

x_{1, t}^{*}

is the continuously differentiable trajectory to be tracked satisfying

{\dot{x}}_{1, t}^{*} = x_{2, t}^{*} = φ (t, x_{1, t}^{*}), t \geq 0, x_{1, 0}^{*} is known .

(6)

In view of that, the error tracking dynamics can be represented as follows

\begin{matrix} (\begin{matrix} {\dot{δ}}_{1, t} \\ {\dot{δ}}_{2, t} \end{matrix}) = (\begin{matrix} δ_{2, t} \\ f_{δ} (t, δ_{1, t}, δ_{2, t}) \end{matrix}) + (\begin{matrix} 0_{n \times n} \\ I_{n \times n} \end{matrix}) u_{t}, \\ f_{δ} (t, δ_{1, t}, δ_{2, t}) : = f (t, δ_{1, t} + x_{1, t}^{*}, δ_{2, t} + x_{2, t}^{*}) - {\dot{x}}_{2, t}^{*} . \end{matrix}\}

(7)

Let us require that the dynamics of

δ_{1, t}

should be realized after time

t_{0} \geq 0

within a bounded admissible zone

D_{a d m} .

This paper’s primary objective is to build a control that minimizes the tracking error

δ_{1}

. The minimization of the assumed convex loss function

F (δ_{1})

can be used to represent this. For example, the class of convex loss functions under consideration includes the following functions:

\begin{matrix} 1) F (δ_{1}) = \sum_{i = 1}^{n} |δ_{1, i}|, \\ 2) F (δ_{1}) = \sum_{i = 1}^{n} {|δ_{1, i}|}_{ε}^{+}, {|z|}_{ε}^{+} : = \{\begin{matrix} z - ε & if & z \geq ε \\ - z - ε & if & z \leq - ε \\ 0 & if & |z| < ε \end{matrix} . \end{matrix}\}

(8)

2.3. Basic Assumptions

A1: The current states $(x_{t}, {\dot{x}}_{t})$ of the plant (3) are supposed to be measurable (available) online for all $t \geq 0$ .
A2: The function $f (t, x_{t}, {\dot{x}}_{t})$ , satisfying (4), is piecewise continuous in all arguments and admits to being unknown.
A3: The current states $(x_{t}^{*}, {\dot{x}}_{t}^{*})$ of the reference trajectory are also supposed to be available online for any $t \geq 0$ .
A4: Here we assume that the subgradient (Recall that a vector $a (x) \in R^{n}$ , satisfying the inequality $F (x + y) \geq$ $F (x)$ + $a^{⊺} (x) y$ for all $y \in R^{n},$ is called the subgradient of the function $F (x)$ at the point $x \in R^{n}$ and is denoted by $a (x) \in \partial F (x)$ , which is the set of all subgradients of F at the point x. If $F (x)$ is differentiable at a point x, then $a (x) = \nabla F (x)$ . In the minimal point $x^{*}$ , we have $0 \in \partial F (x^{*})$ .) of the loss function $F (δ_{1, t})$ is available online for a current time $t \geq 0$ , and the set of minimizers $δ_{1}^{*}$ of $F (\cdot)$ on the set $D_{a d m}$ includes the origin $δ_{1}^{*} = 0$ ; that is,

$0 \in A r g min_{δ_{1} \in D_{a d m}} F (δ_{1}) .$
A5: The admissible set $D_{a d m}$ is nonempty convex compact, i.e., $D_{a d m} \neq ⌀$ .

3. Desired Dynamics

3.1. Mirror Descent Method in Continuous Time

Let us apply the mirror descent approach using the Legendre–Fenchel transformation [30] as follows. For any

ζ \in R^{n}

define

U_{*} (ζ) = max_{z \in D_{a d m}} \{ζ^{⊺} z - U (z)\}, U (z) = \frac{1}{2} {∥z∥}^{2},

(9)

so that (see, for instance, [31,32])

\nabla U_{*} (ζ) = arg max_{δ_{1} \in D_{a d m}} \{ζ^{⊺} δ_{1} - U (δ_{1})\} .

(10)

Define the dynamics for the vector function

ζ_{t} \in R^{n}

as

\begin{matrix} {\dot{ζ}}_{t} = - a (δ_{1, t}), a (δ_{1, t}) \in \partial F (δ_{1, t}), ζ_{t_{0}} = 0, \\ (t + θ) {\dot{δ}}_{1, t} + δ_{1, t} = \nabla U_{*} (ζ_{t} - η), t \geq t_{0} \geq 0, η \in R^{n} . \end{matrix}\}

(11)

Remark 1.

The second differential equation in (11) can be integrated as follows

\begin{matrix} (t + θ) δ_{1, t} - (t_{0} + θ) δ_{1, t_{0}} = \int_{τ = t_{0}}^{t} \nabla U_{*} (ζ_{τ} - η) d τ, \\ δ_{1, t} = λ_{t} δ_{1, t_{0}} + (1 - λ_{t}) [\frac{1}{t - t_{0}} \int_{τ = t_{0}}^{t} \nabla U_{*} (ζ_{τ} - η) d τ] \in D_{a d m}, \\ λ_{t} : = \frac{t_{0} + θ}{t + θ} . \end{matrix}

Therefore,

δ_{1, t} \in D_{a d m}

for all

t \geq t_{0}

because of convexity and due to (9) and (10).

3.2. Why the Dynamics $δ_{1, t}$ Are Desired

The following theorem explains why the dynamics

δ_{1, t}

may be considered as desired.

Theorem 1.

Under Assumptions A1–A5 on the trajectories

δ_{1, t}

, generated by (11), for all

t \geq t_{0} \geq 0

the following property holds

F (δ_{1, t}) \leq F (δ_{1}^{*} (η)) + \frac{t_{0} + θ}{t + θ} [F (δ_{1, t_{0}}) - F (δ_{1}^{*} (η))],

(12)

where

δ_{1}^{*} (η) = arg min_{δ_{1} \in D_{a d m}} \{- η^{⊺} δ_{1} + U (δ_{1})\} .

(13)

Proof.

Defining

μ_{t} : = t + θ

,

δ_{1}^{*} : = δ_{1}^{*} (η)

, we have from (11)

\begin{matrix} \frac{d}{d t} [U_{*} (ζ_{t} - η) - {(ζ_{t} - η)}^{⊺} δ_{1}^{*}] = {\dot{ζ}}_{t}^{⊺} (\nabla U_{*} (ζ_{t} - η) - δ_{1}^{*}) = \\ - a^{⊺} (δ_{1, t}) [μ_{t} {\dot{δ}}_{1, t} + δ_{1, t} - δ_{1}^{*}] = - a^{⊺} (δ_{1, t}) (δ_{1, t} - δ_{1}^{*}) - μ_{t} a^{⊺} (δ_{1, t}) {\dot{δ}}_{1, t} . \end{matrix}

Due to the convexity property for

F (δ_{1})

, we have

a^{⊺} (δ_{1, t}) (δ_{1, t}) (δ_{1, t} - δ_{1}^{*}) \geq F (δ_{1, t}) - F (δ_{1}^{*}),

and in view of the relation

a^{⊺} (δ_{1, t}) {\dot{δ}}_{1, t} = \frac{d}{d t} F (δ_{1, t}),

it follows

\begin{matrix} \frac{d}{d t} [U_{*} (ζ_{t} - η) - {(ζ_{t} - η)}^{⊺} δ_{1}^{*}] = {\dot{ζ}}_{t}^{⊺} (\nabla U_{*} (ζ_{t} - η) - δ_{1}^{*}) = \\ - a^{⊺} (δ_{1, t}) [μ_{t} {\dot{δ}}_{1, t} + δ_{1, t} - δ_{1}^{*}] \leq - [F (δ_{1, t}) - F (δ_{1}^{*})] - μ_{t} a^{⊺} (δ_{1, t}) {\dot{δ}}_{1, t}, \end{matrix}

or equivalently,

\begin{matrix} \frac{d}{d t} [U_{*} (ζ_{δ, t} - η_{δ}) - {(ζ_{δ, t} - η_{δ})}^{⊺} δ_{1}^{*}] \leq \\ - [F (δ_{1, t}) - F (δ_{1}^{*})] - μ_{t} \frac{d}{d t} F (δ_{1, t}) . \end{matrix}

After integration, we obtain

\begin{matrix} \int_{τ = t_{0}}^{t} [F (δ_{1, τ}) - F (δ_{1}^{*})] d τ \leq \\ - [U_{*} (ζ_{τ} - η) - {(ζ_{τ} - η)}^{⊺} δ_{1}^{*}] ∣_{τ = t_{0}}^{τ = t} - \int_{τ = t_{0}}^{t} μ_{τ} \frac{d}{d τ} [F (δ_{1, τ}) - F (δ_{1}^{*})] d τ = \\ - [U_{*} (ζ_{t} - η) - {(ζ_{t} - η)}^{⊺} δ_{1}^{*}] + [U_{*} (- η) + η^{⊺} δ_{1}^{*}] \\ - μ_{τ} [F (δ_{1, τ}) - F (δ_{1}^{*})] ∣_{τ = t_{0}}^{τ = t} + \int_{τ = t_{0}}^{t} [F (δ_{1, τ}) - F (δ_{1}^{*})] d τ, \end{matrix}

which implies

\begin{matrix} μ_{t} [F (δ_{1, t}) - F (δ_{1}^{*})] \leq - [U_{*} (ζ_{t} - η) - {(ζ_{t} - η)}^{⊺} δ_{1}^{*}] + \\ [U_{*} (- η) + η^{⊺} δ_{1}^{*}] + μ_{t_{0}} [F (δ_{1, t_{0}}) - F (δ_{1}^{*})] . \end{matrix}

Using (9), we obtain

U_{*} (ζ_{t} - η) \geq {(ζ_{t} - η)}^{⊺} δ_{1}^{*} - U (δ_{1}^{*}),

- [U_{*} (ζ_{t} - η) - {(ζ_{t} - η)}^{⊺} δ_{1}^{*}] \leq U (δ_{1}^{*}) = \frac{1}{2} {∥δ_{1}^{*}∥}^{2},

and

\begin{matrix} μ_{t} [F (δ_{1, t}) - F (δ_{1}^{*})] \leq \frac{1}{2} {∥δ_{1}^{*}∥}^{2} + [U_{*} (- η) + η^{⊺} δ_{1}^{*}] + \\ μ_{t_{0}} [F (δ_{1, t_{0}}) - F (δ_{1}^{*})] . \end{matrix}

Since by (9) and (10)

\nabla U_{*} (- η) = arg max_{δ_{1} \in D_{a d m}} \{- η^{⊺} δ_{1} - U (δ_{1})\}, U (δ_{1}) = \frac{1}{2} {∥δ_{1}∥}^{2},

and defining

δ_{1}^{*} (η) : = arg max_{δ_{1} \in D_{a d m}} \{- η^{⊺} δ_{1} - U (δ_{1})\} = \nabla U_{*} (- η),

(14)

we obtain

U_{*} (- η) + η^{⊺} δ_{1}^{*} = - U (δ_{1}^{*}) = - \frac{1}{2} {∥δ_{1}^{*}∥}^{2} .

Therefore, we obtain

\begin{matrix} μ_{t} [F (δ_{1, t}) - F (δ_{1}^{*})] \leq \\ \frac{1}{2} {∥δ_{1}^{*}∥}^{2} - \frac{1}{2} {∥δ_{1}^{*}∥}^{2} + μ_{t_{0}} [F (δ_{1, t_{0}}) - F (δ_{1}^{*})] . \end{matrix}

F (δ_{1, t}) \leq F (δ_{1}^{*} (η)) + \frac{μ_{t_{0}}}{μ_{t}} [F (δ_{1, t_{0}}) - F (δ_{1}^{*} (η))] .

□

Example 1.

Assume that

D_{a d m} : = \{δ_{1} \in R^{n} : ∥δ_{1}∥ \leq r\} .

(15)

To calculate

δ_{1}^{*},

according to (13), it is sufficient to note that the solution of the problem

2 η^{⊺} δ_{1} + {∥δ_{1}∥}^{2} = {∥δ_{1} + η∥}^{2} - {∥η∥}^{2} \to min_{∥δ_{1}∥ \leq r},

is

δ_{1}^{*} (η) = \{\begin{matrix} - η & i f & ∥η∥ \leq r \\ - \frac{η}{∥η∥} r & i f & ∥η∥ > r \end{matrix} .

4. Robust Controller Design

4.1. Auxiliary Sliding Variable and Its Dynamics

Introduce a new auxiliary variable (sliding variable)

s_{t} = (t + θ) δ_{2, t} + δ_{1, t} - \nabla U_{*} (ζ_{t} - η), t \geq t_{0} \geq 0 .

Notice that the function

s_{t}

is measurable online, and that the situation when

s_{t} = 0 for all t \geq t_{0}

(16)

corresponds exactly to the desired regime (11), starting from the moment

t_{0}

. Then for

V (s_{t}) = \frac{1}{2} {∥s_{t}∥}^{2}

in view of (7) and the first equation in (11) we have

\begin{matrix} \frac{d}{d t} V (s_{t}) = s_{t}^{⊺} {\dot{s}}_{t} = s_{t}^{⊺} [2 {\dot{δ}}_{1, t} + (t + θ) {\dot{δ}}_{2, t} - \frac{d}{d t} \nabla U_{*} (ζ_{t} - η)] = \\ s_{t}^{⊺} (2 δ_{2, t} + (t + θ) [f (t, δ_{1, t} + x_{1, t}^{*}, δ_{2, t} + x_{2, t}^{*}) - {\dot{x}}_{2, t}^{*} + u_{t}] - \nabla^{2} U_{*} (ζ_{t} - η) {\dot{ζ}}_{t}) = \\ (t + θ) s_{t}^{⊺} f (t, δ_{1, t} + x_{1, t}^{*}, δ_{2, t} + x_{2, t}^{*}) + \\ (t + θ) s_{t}^{⊺} \underset{- k_{t} Sign (s_{t})}{\underset{︸}{[\frac{2}{t + θ} δ_{2, t} - {\dot{x}}_{2, t}^{*} + u_{t} + \frac{1}{t + θ} \nabla^{2} U_{*} (ζ_{t} - η) a (δ_{1, t})]}} \leq \\ (t + θ) ∥s_{t}∥ ∥f (t, δ_{1, t} + x_{1, t}^{*}, δ_{2, t} + x_{2, t}^{*})∥ - (t + θ) k_{t} s_{t}^{⊺} Sign (s_{t}) \leq \\ (t + θ) [∥s_{t}∥ \underset{k_{x, t} : = k_{x} (δ_{1, t} + x_{1, t}^{*}, δ_{2, t} + x_{2, t}^{*})}{\underset{︸}{(c_{0} + c_{1} ∥δ_{1, t} + x_{1, t}^{*}∥ + c_{2} ∥δ_{2, t} + x_{2, t}^{*}∥)}} - k_{t} s_{t}^{⊺} Sign (s_{t})] . \end{matrix}

Here

\begin{matrix} Sign (s_{t}) = {(sign (s_{1, t}), \dots, sign (s_{n, t}))}^{⊺}, \\ sign (s_{i, t}) \{\begin{matrix} = + 1 & if & s_{i, t} > 0 \\ = - 1 & if & s_{i, t} < 0 \\ \in [- 1, + 1] & if & s_{i, t} = 0 \end{matrix} . \end{matrix}

4.2. Robust Control Structure

Since

s_{t}^{⊺} Sign (s_{t}) = \sum_{i = 1}^{n} |s_{i, t}| \geq ∥s_{t}∥

and taking

k_{t} = k_{x, t} + ρ, ρ > 0,

we obtain

\frac{d}{d t} V (s_{t}) \leq (t + θ) ∥s_{t}∥ (k_{x, t} - k_{t}) = - (t + θ) ρ \sqrt{2 V (s_{t})},

which implies

\begin{matrix} \frac{d V (s_{t})}{\sqrt{V (s_{t})}} \leq - (t + θ) \sqrt{2} ρ d t, \\ 2 (\sqrt{V (s_{t})} - \sqrt{V (s_{t_{0}})}) \leq - \frac{\sqrt{2}}{2} ρ [{(t + θ)}^{2} - {(t_{0} + θ)}^{2}], \\ 0 \leq \sqrt{V (s_{t})} \leq \sqrt{V (s_{t_{0}})} - \frac{\sqrt{2}}{4} ρ [{(t + θ)}^{2} - {(t_{0} + θ)}^{2}] . \end{matrix}

This means that for all

t \geq t_{r e a c h},

where

\begin{matrix} t_{r e a c h} : = \{t : \sqrt{V (s_{t_{0}})} - \frac{\sqrt{2}}{4} ρ [{(t + θ)}^{2} - {(t_{0} + θ)}^{2}] = 0\} \\ = \sqrt{\frac{2}{ρ} ∥s_{t_{0}}∥ + {(t_{0} + θ)}^{2}} - θ . \end{matrix}

Finally, the robust control is

\begin{matrix} u_{t} = - \frac{2}{t + θ} δ_{2, t} + {\dot{x}}_{2, t}^{*} - \frac{1}{t + θ} \nabla^{2} U_{*} (ζ_{t} - η) a (δ_{1, t}) - k_{t} Sign (s_{t}) \\ = u_{c o m p, t} + u_{d i s c, t}, \end{matrix}

(17)

where

\begin{matrix} u_{c o m p, t} : = - \frac{2}{t + θ} δ_{2, t} + {\dot{x}}_{2, t}^{*} - \frac{1}{t + θ} \nabla^{2} U_{*} (ζ_{t} - η) a (δ_{1, t}), \\ u_{d i s c, t} : = - k_{t} Sign (s_{t}) . \end{matrix}

(18)

Remark 2.

If we wish to obtain

t_{r e a c h} = t_{0} = 0

, we need to complete the identity

s_{0} = θ δ_{2, 0} + δ_{1, 0} - \nabla U_{*} (- η) \overset{(14)}{=} θ δ_{2, 0} + δ_{1, 0} - δ_{1}^{*} (η) = 0 .

(19)

Since

δ_{1}^{*} (η) \in D_{a d m}

, we may conclude that parameters

θ > 0, η

and initial conditions

(δ_{1, 0}, δ_{2, 0})

should be consistent in the sense that

θ δ_{2, 0} + δ_{1, 0} \in D_{a d m} .

Remark 3.

For example, with Euclidean r-ball in

R^{n}

being the admissible set

D_{a d m}

, from (9) and (10) one has

\nabla U_{*} (ζ) = arg max_{δ_{1} \in D_{a d m}} \{ζ^{⊺} δ_{1} - U (δ_{1})\} = \{\begin{matrix} ζ & if & ∥ζ∥ \leq r \\ r \frac{ζ}{∥ζ∥} & if & ∥ζ∥ > r \end{matrix},

(20)

\begin{matrix} δ_{1}^{*} (η) = arg min_{δ_{1} \in D_{a d m}} \{- η^{⊺} δ_{1} + U (δ_{1})\} = \\ arg min_{δ_{1} \in D_{a d m}} \{- η^{⊺} δ_{1} + \frac{1}{2} {∥δ_{1}∥}^{2}\} = η if ∥η∥ \leq r . \end{matrix}

(21)

From (18) it follows that

θ δ_{2, 0} + δ_{1, 0} = η, ∥η∥ \leq r,

(22)

and

\nabla^{2} U_{*} (ζ) = \{\begin{matrix} I_{n \times n} & if & ∥ζ∥ \leq r \\ \frac{r}{∥ζ∥} (I_{n \times n} - \frac{ζ ζ^{T}}{{∥ζ∥}^{2}}) & if & ∥ζ∥ > r \end{matrix} .

(23)

Notice that

\nabla U_{*}

-function (10) is nondifferential in the points of r-sphere of the ball, and it is continuously differential in all other points of

R^{n}

. The formulas in (20) and (23) are presented as their continuous versions on the ball, including the r-sphere.

4.3. Main Result

We are ready to formulate the main result.

Theorem 2.

Under Assumptions A1–A5, the robust control (17) and (18) with parameter

η,

satisfying (19), provides the property

F (δ_{1, t}) \leq F (δ_{1}^{*} (η)) + \frac{θ}{t + θ} [F (δ_{1, 0}) - F (δ_{1}^{*} (η))]

(24)

for all

t \geq 0

and any regularizing parameter

θ > 0 .

Proof.

In view of the relation (19) of the parameter

η

and initial conditions

δ_{1, 0}, {\dot{δ}}_{1, 0}

, the auxiliary variable

s_{t} = 0

for all

t \geq 0

starting from the beginning of the control process. Using Formula (12) for

t_{0} = 0

we obtain (24). □

5. Discussion

Equations (14) and (19) hold under

θ > 0

,

η \in R^{n}

for the following cases:

Zero initial conditions $δ_{1, 0} = 0$ , $δ_{2, 0} = 0$ . Thus, $η = 0$ for arbitrary $θ > 0$ (see, as an example, the first item in the loss function (8)).
Nonzero initial conditions $δ_{1, 0}$ , $δ_{2, 0}$ are collinear oppositely directed vectors. Therefore, $θ > 0$ and $η = 0$ exist (see, as an example, the first item in the loss function (8)).
Equation (22) holds under nonzero vector η with a sufficiently small $∥ η ∥ \leq ϵ$ and for $θ > 0$ (see, as an example, the second item in the loss function (8)).

6. Numerical Example

A two-link robot manipulator with three revolute joints powered by individual PMDC motors is presented below as an illustrative example of the suggested approach.

6.1. Model Description

A dynamic model of a Lagrangian mechanical system with n degrees of freedom in standard form driven by n independent Permanent Magnet Direct Current (PMDC) motors [33] is defined by the following system of differential equations:

\begin{matrix} D (q_{t}) {\ddot{q}}_{t} + C (q_{t}, {\dot{q}}_{t}) {\dot{q}}_{t} + G (q_{t}) = τ_{t} + ϑ_{t}, \\ τ_{t} = W K_{a} I_{a t}, \\ L_{a} {\dot{I}}_{a t} + R_{a} I_{a t} + K_{e} W^{⊺} {\dot{q}}_{t} = v_{a t}, \end{matrix}\}

(25)

where

q_{t}, {\dot{q}}_{t} \in R^{n}

are the state and velocity vectors,

τ_{t} \in R^{n}

is a vector of external torques,

I_{a t} \in R^{n}

is the armature current vector,

W \in R^{n \times n}

is the matrix of electromotive force constants (possibly taking into account engine gear ratios),

K_{a} \in R^{n \times n}

is the matrix of constants of direct electromotive forces,

D (q_{t}) = M (q_{t}) + W J W^{⊺}

is a positive definite inertia matrix, that is

D (q_{t})

=

D^{⊺} (q_{t})

≥

d_{-} I_{n \times n},

d_{-} > 0

and, therefore, invertible for all

q_{t}

, J =

diag \{J_{1}, J_{2}, \dots, J_{n}\}

is the rotor inertia matrix,

M (q_{t})

is the matrix of the Lagrangian system corresponding to the armature inertia matrix in the original coordinates,

C (q_{t}, {\dot{q}}_{t}) \in R^{n \times n}

is the matrix corresponding to the generalized nonpotential forces

C (q_{t}, {\dot{q}}_{t}) {\dot{q}}_{t}

, which can describe friction, hysteresis, Coriolis, damping, centripetal effects, etc.,

G (q_{t}) \in R^{n}

is a vector corresponding to the generalized potential forces,

K_{e} = diag \{K_{e 1}, K_{e 2}, \dots, K_{e n}\}

is the matrix of reverse electromotive force constants,

L_{a} = diag \{L_{a 1}, L_{a 2}, \dots, L_{a n}\}

and

R_{a} = diag \{R_{a 1}, R_{a 2}, \dots, R_{a n}\}

are armature inductance and resistant positive matrices, respectively,

ϑ_{t} \in R^{n}

is the disturbance (or uncertainty) vector, and

v_{a t} \in R^{n}

is the armature voltage vector, which is treated below as a control designed to achieve the desired behavior. In fact, the third equation in (25) describes the dynamics of the actuator implementing the applied control action

v_{a t}

. Equation (25) assumes fully allocated control.

We assume that

q_{t},

{\dot{q}}_{t}

, and

I_{a t}

are available online. From (25) it follows that

I_{a t} - I_{a t_{0}} = - L_{a}^{- 1} R_{a} \int_{τ = t_{0}}^{t} I_{a τ} d τ - L_{a}^{- 1} K_{e} W^{⊺} (q_{t} - q_{t_{0}}) + L_{a}^{- 1} \int_{τ = t_{0}}^{t} v_{a τ} d τ

(26)

(

t_{0} \geq 0

—any fixed time), and selecting (neglecting the Joule effect, related to the dependence of the winding motor resistance)

v_{a t} = v_{a t}^{(1)} + v_{a t}^{(2)},

(27)

with

v_{a t}^{(1)} = R_{a} I_{a t} + K_{e} W^{⊺} {\dot{q}}_{t},

(28)

the relation (26) becomes

I_{a t} = I_{a t_{0}} + L_{a}^{- 1} \int_{τ = t_{0}}^{t} v_{a τ}^{(2)} d τ .

(29)

Substituting (29) into (25) gives

D (q_{t}) {\ddot{q}}_{t} + C (q_{t}, {\dot{q}}_{t}) {\dot{q}}_{t} + G (q_{t}) = u_{t} + {\tilde{ϑ}}_{t},

(30)

u_{t} : = W K_{a} L_{a}^{- 1} \int_{τ = t_{0}}^{t} v_{a τ}^{(2)} d τ, {\tilde{ϑ}}_{t} : = W K_{a} I_{a t_{0}} + ϑ_{t} .

(31)

Note that in the standard matrix format (3) with new state vectors

x_{1} = q \in R^{n}

and velocity vectors

x_{2} = \dot{q} \in R^{n}

, the Lagrange dynamics under consideration (30) have the following form:

\begin{matrix} {\dot{x}}_{t} = (\begin{matrix} {\dot{x}}_{1 t} \\ {\dot{x}}_{2 t} \end{matrix}) = H (x_{1 t}, x_{2 t}) (\begin{matrix} x_{1 t} \\ x_{2 t} \end{matrix}) + B u_{t} + ξ_{t}, \\ H (x_{1}, x_{2}) = (\begin{matrix} 0 & I_{n \times n} \\ 0 & {- D}^{- 1} (x_{1}) C (x_{1}, x_{2}) \end{matrix}), \\ B = (\begin{matrix} 0_{n \times n} \\ D^{- 1} (x_{1}) \end{matrix}), ξ_{t} = (\begin{matrix} 0_{n \times n} \\ D^{- 1} (x_{1 t}) [{\tilde{ϑ}}_{t} - G (x_{1})] \end{matrix}) . \end{matrix}\}

(32)

Thus, in this representation, the dimension of control vector u is n and the extended state dimension

x = {(x_{1}^{⊺}, x_{2}^{⊺})}^{⊺}

is

2 n

.

6.2. Intended Moving Point

The considered mechanical construction is depicted in Figure 1.

The challenge is to move the robot’s cueing point such that it follows the intended moving point

x_{1, t}^{*} = (x_{t}^{*}, y_{t}^{*}, z_{t}^{*}) \in

R^{3}

, maintaining a tiny space between them. Here, we suppose that the point

x_{1, t}^{*}

accomplishes travel along a certain ellipse

E ({\overset{˚}{x}}_{1}, r_{0})

at a constant module speed v:

\begin{matrix} E_{l} ({\overset{˚}{x}}_{1}, r_{0}) : = \{x_{1} = (x, y, z) : x_{t} = \overset{˚}{x} + r_{0} sin (ω t), \\ y_{t} = \overset{˚}{y} + r_{0} cos (ω t), z_{t} = \overset{˚}{z}\}, ∥{\dot{x}}_{t}^{*}∥ = v = r_{0} ω . \end{matrix}\}

(33)

The corresponding cost function F (8) to be minimized is the function of the tracking error

δ_{1, t}

=

{(x_{t} - x_{t}^{*}, y_{t} - y_{t}^{*}, z_{t} - z_{t}^{*})}^{⊺},

and is selected as

\begin{matrix} F (δ_{1, t}) = F_{1} (|x_{t} - x_{t}^{*}|) + F_{2} (|y_{t} - y_{t}^{*}|) + F_{3} (|z_{t} - z_{t}^{*}|), \\ F_{1} (|x_{t} - x_{t}^{*}|) = |x_{t} - x_{t}^{*}|, F_{2} (|y_{t} - y_{t}^{*}|) = |y_{t} - y_{t}^{*}|, F_{3} (|z_{t} - z_{t}^{*}|) = |z_{t} - z_{t}^{*}|, \\ a (δ_{1, t}) = {(sign (x_{t} - x_{t}^{*}), sign (y_{t} - y_{t}^{*}), sign (z_{t} - z_{t}^{*}))}^{⊺} \in \partial F (δ_{1, t}) . \end{matrix}\}

(34)

6.3. Relation between Cartesian and Angular Coordinates

The relation between vectors

x_{1}

and q

x_{1} = F (q), x_{1} : = (\begin{matrix} x \\ y \\ z \end{matrix}), q : = (\begin{matrix} φ_{1} \\ φ_{2} \\ φ_{3} \end{matrix})

(35)

is as follows:

\begin{matrix} x_{t} = l_{1} cos φ_{2} cos φ_{1} + l_{2} cos (φ_{2} + φ_{3}) cos φ_{1}, \\ y_{t} = l_{1} cos φ_{2} sin φ_{1} + l_{2} cos (φ_{2} + φ_{3}) sin φ_{1}, \\ z_{t} = l_{1} sin φ_{2} + l_{2} sin (φ_{2} + φ_{3}) . \end{matrix}

To implement the simulation, the following form of immeasurable nonpotential forces (friction, hysteresis, Coriolis, damping, centripetal effects, and others) was modeled as

C (q_{t}, {\dot{q}}_{t}) {\dot{q}}_{t} = [- k_{r e s} {\dot{q}}_{t}^{⊺} Sign ({\dot{q}}_{t})] {\dot{q}}_{t}, k_{r e s} > 0 .

6.4. Applied Robust Controller Structure

In this particular case, the suggested robust controller (17) and (18) is as follows:

u_{t} = u_{c o m p, t} + u_{d i s c, t},

(36)

with the compensation control part

u_{c o m p, t}

equal to

u_{c o m p, t} : = - \frac{2}{t + θ} δ_{2, t} + {\dot{x}}_{2, t}^{*} - \frac{1}{t + θ} \nabla^{2} U_{*} (ζ_{t} - η) a (δ_{1, t}),

(37)

where

a (δ_{1, t})

is defined in (34), and

\begin{matrix} δ_{2, t} = {\dot{δ}}_{1, t} = {\dot{x}}_{1, t} - {\dot{x}}_{1, t}^{*}, {\dot{x}}_{2, t}^{*} = {\ddot{x}}_{1, t}^{*} = - Q (x_{t}^{*} - \overset{˚}{x}), \\ ζ_{t} = \int_{τ = 0}^{t} a (δ_{1, τ}) d τ, η = θ δ_{2, 0} + δ_{1, 0}, ∥η∥ \leq r, \\ \nabla^{2} U_{*} (ζ_{t} - η) = \{\begin{matrix} I_{n \times n} & if & ∥ζ_{t} - η∥ \leq r, \\ \frac{r}{∥ζ_{t} - η∥} (I_{n \times n} - \frac{(ζ_{t} - η) {(ζ_{t} - η)}^{T}}{{∥ζ_{t} - η∥}^{2}}) & if & ∥ζ_{t} - η∥ > r . \end{matrix} \end{matrix}

The discontinuous control

u_{d i s c, t}

is designed as

\begin{matrix} u_{d i s c, t} : = - k_{t} Sign (s_{t}), \\ s_{t} = (t + θ) δ_{2, t} + δ_{1, t} - \nabla U_{*} (ζ_{t} - η), \\ \nabla U_{*} (ζ_{t} - η) = \{\begin{matrix} ζ_{t} - η & if & ∥ζ_{t} - η∥ \leq r \\ r \frac{ζ_{t} - η}{∥ζ_{t} - η∥} & if & ∥ζ_{t} - η∥ > r \end{matrix} \end{matrix}\} .

(38)

6.5. Parameters of Simulation

The following are the computer simulation parameters:

Parameter	Numerical Value	Description
$k_{r e s}$	1.1 $\times 10^{- 6}$	environmental (air) resistance
g	9.81 m/s $^{2}$	Gravitational acceleration
$m_{1} {, m}_{2}$	1 kg	Mass
$l_{1} {, l}_{2}$	0.35 m, 0.67 m	Length

\begin{matrix} W = [\begin{matrix} 0.05 & 0.04 & 0.06 \\ 0.04 & 0.044 & 0.022 \\ 0.06 & 0.022 & 0.057 \end{matrix}], L_{a} = [\begin{matrix} 2 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 2 \end{matrix}], \\ J = [\begin{matrix} 0.021 & 0.0006 & 0 \\ 0.0006 & 0.021 & 0 \\ 0 & 0 & 0.0407 \end{matrix}], K_{a} = [\begin{matrix} 0.02 & 0.04 & 0.06 \\ 0.04 & 0.08 & 0.1 \\ 0.06 & 0.1 & 0.04 \end{matrix}], \\ r_{0} = 0.4, ω = 1, \end{matrix}

ϑ_{t}

is generated by uniformly distributed random numbers at an interval [−0.8, 0.8], and

\begin{matrix} K_{e} = diag \{70, 140, 151\}, θ = 10^{- 3}, \\ Q = diag \{0.5, 0.9, 2\}, \overset{˚}{x} = (- 0.35, 2.35, 0.8), r = 0.6, \\ δ_{1, 0} = {[\begin{matrix} - 0.4001 & - 0.7576 & - 0.5236 \end{matrix}]}^{⊺}, δ_{2, 0} = {[\begin{matrix} - 0.3990 & 0.1802 & 0.0012 \end{matrix}]}^{⊺} \\ η = {[\begin{matrix} - 0.3996 & - 0.7570 & 0.6283 \end{matrix}]}^{⊺} calculated as η = θ δ_{2, 0} + δ_{1, 0}, \end{matrix}

6.6. Results of Numerical Simulations

The simulation results are shown in Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6.

As one can see, the suggested method demonstrates a successful workability in the presence of essential model uncertainties and external perturbations.

7. Conclusions

-: The constrained optimization problem is addressed in this study using a second-order differential controlled plant with an unknown (but bounded) right side of the model.
-: The desired dynamics in the tracking error variables is designed based on the mirror descent method.
-: The continuous time convergence to the set of minimizing points is established, and the associated rate of convergence is analytically evaluated.
-: The robust controller, containing both the continuous (compensating) $u_{c o m p}$ and the discontinuous $u_{d i s c}$ , is proposed using the ASG version of the integral sliding mode approach.
-: The suggested controller, under the special relations of it parameters with the initial conditions, is proved to provide the desired regime from the beginning of the control process.
-: This method may have several applications in the development of robust control in mechanical systems, including soft robotics and moving dynamic plants.

Author Contributions

Conceptualization, A.N. and A.P.; methodology, H.A.; formal analysis, A.N. and A.P.; data curation: H.A.; resources: H.A.; software: H.A.; validation: A.N. and A.P.; visualization: A.N., H.A. and A.P.; writing—original draft preparation, A.N. and A.P.; writing—review and editing, A.N. and A.P.; supervision, A.P.; project administration, A.P.; funding acquisition, A.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ASG	Average subgradient
SDM	Subgradient descent Method
ISM	Integral sliding mode
SOM	Static Optimization Methods
ODE	Ordinary differential equation

References

Bertsekas, D.P. Constrained Optimization and Lagrange Multiplier Methods; Academic Press: New York, NY, USA, 1982; ISBN 0-12-093480-9. [Google Scholar]
Dechter, R. Constraint Processing; Morgan Kaufmann: Burlington, MA, USA, 2003; ISBN 1-55860-890-7. [Google Scholar]
Leader, J.J. Numerical Analysis and Scientific Computation; Addison Wesley: Boston, MA, USA, 2003; ISBN 0-201-73499-0. [Google Scholar]
Posser, M.; Posser, M.J. Basic Mathematics for Economists; Routledge: New York, NY, USA, 1993; ISBN 0-415-08424-5. [Google Scholar]
Rossi, F.; van Beek, P.; Walsh, T. Chapter 1—Introduction. In Foundations of Artificial Intelligence: Handbook of Constraint Programming; Rossi, F., van Beek, P., Walsh, T., Eds.; Elsevier: Amsterdam, The Netherlands, 2006; Volume 2, pp. 3–12. [Google Scholar] [CrossRef]
Sun, W.; Yua, Y.-X. Optimization Theory and Methods: Nonlinear Programming; Springer: New York, NY, USA, 2010; ISBN 978-1441937650. [Google Scholar]
Rastrigin, L.A. Systems of Extremal Control; Nauka: Moscow, Russia, 1974. (In Russian) [Google Scholar]
Krstic, M.; Wang, H.H. Stability of extremum seeking feedback for general nonlinear dynamic systems. Automatica 2000, 36, 595–601. [Google Scholar] [CrossRef]
Ariyur, K.B.; Krstic, M. Real-Time Optimization by Extremum-Seeking Control; John Wiley & Sons: Hoboken, NJ, USA, 2003. [Google Scholar]
Tan, Y.; Moase, W.H.; Manzie, C.; Nešić, D.; Mareels, I.M.Y. Extremum seeking from 1922 to 2010. In Proceedings of the 29th Chinese Control Conference, Beijing, China, 29–31 July 2010; pp. 14–26. [Google Scholar]
Tan, Y.; Nešić, D.; Mareels, I. On non-local stability properties of extremum seeking control. Automatica 2006, 42, 889–903. [Google Scholar] [CrossRef]
Rawlings, J.B.; Amrit, R. Optimizing process economic performance using model predictive control. In Nonlinear Model Predictive Control; Magni, L., Raimondo, D.M., Allgöwer, F., Eds.; Lecture Notes in Control and Information Sciences; Springer: Berlin/Heidelberg, Germany, 2009; Volume 384, pp. 119–138. [Google Scholar]
Dehaan, D.; Guay, M. Extremum-seeking control of state-constrained nonlinear systems. Automatica 2005, 41, 1567–1574. [Google Scholar] [CrossRef]
Chunlei, Z.; Ordóñez, R. Robust and adaptive design of numerical optimization-based extremum seeking control. Automatica 2009, 45, 634–646. [Google Scholar]
Solis, C.U.; Clempner, J.B.; Poznyak, A.S. Extremum seeking by a dynamic plant using mixed integral sliding mode controller with synchronous detection gradient estimation. Int. J. Robust Nonlinear Control 2018, 29, 702–714. [Google Scholar] [CrossRef]
Ferrara, A.; Utkin, V.I. Sliding Mode Optimization in Dynamic LTI Systems. J. Optim. Theory Appl. 2002, 115, 727–740. [Google Scholar] [CrossRef]
Ferrara, A. A variable structure convex programming based control approach for a class of uncertain linear systems. Syst. Control Lett. 2005, 54, 529–538. [Google Scholar] [CrossRef]
Bemporad, A.; Morari, M. Robust model predictive control: A survey. In Robustness in Identification and Control; Garulli, A., Tesi, A., Eds.; Lecture Notes in Control and Information Sciences; Springer: London, UK, 1999; Volume 245, pp. 207–226. [Google Scholar] [CrossRef]
Jalali, A.A.; Nadimi, V. A Survey on Robust Model Predictive Control from 1999–2006. In Proceedings of the 2006 International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce (CIMCA’06), Sydney, NSW, Australia, 28 November–1 December 2006; p. 207. [Google Scholar] [CrossRef]
Li, H.; Wang, S.; Shi, H.; Su, C.; Li, P. Two-Dimensional Iterative Learning Robust Asynchronous Switching Predictive Control for Multiphase Batch Processes with Time-Varying Delays. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 6488–6502. [Google Scholar] [CrossRef]
Wang, L.; Zhang, W.; Zhang, Q.; Shi, H.; Zhang, R.; Gao, F. Terminal constrained robust hybrid iterative learning model predictive control for complex time-delayed batch processes. Nonlinear Anal. Hybrid Syst. 2023, 47, 101276. [Google Scholar] [CrossRef]
Liu, X.; Ma, L.; Kong, X.; Lee, K.Y. An efficient iterative learning predictive functional control for nonlinear batch processes. IEEE Trans. Cybern. 2020, 52, 4147–4160. [Google Scholar] [CrossRef] [PubMed]
Shi, H.; Li, P.; Cao, J.; Su, C.; Yu, J. Robust fuzzy predictive control for discrete-time systems with interval time-varying delays and unknown disturbances. IEEE Trans. Fuzzy Syst. 2019, 28, 1504–1516. [Google Scholar] [CrossRef]
Simpson-Porco, J.W. Input/output analysis of primal-dual gradient algorithms. In Proceedings of the 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 27–30 September 2016; pp. 219–224. [Google Scholar]
Nazin, A.V. Algorithms of Inertial Mirror Descent in Convex Problems of Stochastic Optimization. Autom. Remote Control 2018, 79, 78–88. [Google Scholar] [CrossRef]
Utkin, V. Sliding Modes in Control and Optimization; Springer: Berlin/Heidelberg, Germany, 1992. [Google Scholar]
Fridman, L.; Poznyak, A.; Bejarano, F.J. Robust Output LQ Optimal Control via Integral Sliding Modes; Birkhäuser: Basel, Switzerland; Springer Science and Business Media: New York, NY, USA, 2014. [Google Scholar]
Utkin, V.; Poznyak, A.; Orlov, Y.V.; Polyakov, A. Conclusions. In Road Map for Sliding Mode Control Design; SpringerBriefs in Mathematics; Springer International Publishing: Cham, Switzerland, 2020; pp. 125–127. [Google Scholar]
Poznyak, A.S.; Nazin, A.V.; Alazki, H. Integral Sliding Mode Convex Optimization in Uncertain Lagrangian Systems Driven by PMDC Motors: Averaged Subgradient Approach. IEEE Trans. Autom. Control 2021, 66, 4267–4273. [Google Scholar] [CrossRef]
Rockafellar, R.T. Convex Analysis; Princeton University Press: Princeton, NJ, USA, 1970. [Google Scholar]
Ben-Tal, A.; Nemirovski, A. The Conjugate Barrier Mirror Descent Method for Non-Smooth Convex Optimization; Minerva Optimization Center, Technion Institute of Technology: Haifa, Israel, 1999. [Google Scholar]
Juditsky, A.B.; Nazin, A.V.; Tsybakov, A.B.; Vayatis, N. Recursive aggregation of estimators by the mirror descent algorithm with averaging. Probl. Inf. Transm. 2005, 41, 368–384. [Google Scholar] [CrossRef]
Patel, D.K. Mathematical modeling of open loop PMDC motor using MATLAB/Simulink. Int. J. Eng. Develop. Res 2015, 3, 495–500. [Google Scholar]

Figure 1. Robot with two links.

Figure 2. Tracking ellipse.

Figure 3.

x_{t}

tracking of

x_{t}^{*}

.

Figure 3.

x_{t}

tracking of

x_{t}^{*}

.

Figure 4. Tracking of

y_{t}^{*}

with

y_{t}

.

Figure 4. Tracking of

y_{t}^{*}

with

y_{t}

.

Figure 5. Tracking of

z_{t}^{*}

with

z_{t}

.

Figure 5. Tracking of

z_{t}^{*}

with

z_{t}

.

Figure 6. Cost function components.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Nazin, A.; Alazki, H.; Poznyak, A. Robust Tracking as Constrained Optimization by Uncertain Dynamic Plant: Mirror Descent Method and ASG—Version of Integral Sliding Mode Control. Mathematics 2023, 11, 4112. https://doi.org/10.3390/math11194112

AMA Style

Nazin A, Alazki H, Poznyak A. Robust Tracking as Constrained Optimization by Uncertain Dynamic Plant: Mirror Descent Method and ASG—Version of Integral Sliding Mode Control. Mathematics. 2023; 11(19):4112. https://doi.org/10.3390/math11194112

Chicago/Turabian Style

Nazin, Alexander, Hussain Alazki, and Alexander Poznyak. 2023. "Robust Tracking as Constrained Optimization by Uncertain Dynamic Plant: Mirror Descent Method and ASG—Version of Integral Sliding Mode Control" Mathematics 11, no. 19: 4112. https://doi.org/10.3390/math11194112

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Robust Tracking as Constrained Optimization by Uncertain Dynamic Plant: Mirror Descent Method and ASG—Version of Integral Sliding Mode Control

Abstract

1. Introduction

1.1. Brief Survey

1.2. Main Contributions

2. Uncertain Plant Description and Admitted Dynamic Zone

2.1. Dynamic Model

2.2. Reference Trajectory, Tracking Error Dynamics, and Admissible Zone

2.3. Basic Assumptions

3. Desired Dynamics

3.1. Mirror Descent Method in Continuous Time

3.2. Why the Dynamics $δ_{1, t}$ Are Desired

4. Robust Controller Design

4.1. Auxiliary Sliding Variable and Its Dynamics

4.2. Robust Control Structure

4.3. Main Result

5. Discussion

6. Numerical Example

6.1. Model Description

6.2. Intended Moving Point

6.3. Relation between Cartesian and Angular Coordinates

6.4. Applied Robust Controller Structure

6.5. Parameters of Simulation

6.6. Results of Numerical Simulations

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Robust Tracking as Constrained Optimization by Uncertain Dynamic Plant: Mirror Descent Method and ASG—Version of Integral Sliding Mode Control

Abstract

1. Introduction

1.1. Brief Survey

1.2. Main Contributions

2. Uncertain Plant Description and Admitted Dynamic Zone

2.1. Dynamic Model

2.2. Reference Trajectory, Tracking Error Dynamics, and Admissible Zone

2.3. Basic Assumptions

3. Desired Dynamics

3.1. Mirror Descent Method in Continuous Time

3.2. Why the Dynamics δ 1 , t Are Desired

4. Robust Controller Design

4.1. Auxiliary Sliding Variable and Its Dynamics

4.2. Robust Control Structure

4.3. Main Result

5. Discussion

6. Numerical Example

6.1. Model Description

6.2. Intended Moving Point

6.3. Relation between Cartesian and Angular Coordinates

6.4. Applied Robust Controller Structure

6.5. Parameters of Simulation

6.6. Results of Numerical Simulations

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2. Why the Dynamics $δ_{1, t}$ Are Desired