A Multilevel Heterogeneous ADMM Algorithm for Elliptic Optimal Control Problems with L1-Control Cost

Chen, Xiaotong; Song, Xiaoliang; Chen, Zixuan; Xu, Lijun

doi:10.3390/math11030570

Open AccessArticle

A Multilevel Heterogeneous ADMM Algorithm for Elliptic Optimal Control Problems with L¹-Control Cost

¹

School of Science, Dalian Maritime University, Dalian 116026, China

²

School of Mathematical Sciences, Dalian University of Technology, Dalian 116024, China

³

College of Sciences, Northeastern University, Shenyang 110819, China

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(3), 570; https://doi.org/10.3390/math11030570

Submission received: 31 December 2022 / Revised: 18 January 2023 / Accepted: 19 January 2023 / Published: 21 January 2023

(This article belongs to the Section Computational and Applied Mathematics)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, elliptic optimal control problems with

L^{1}

-control cost and box constraints on the control are considered. To numerically solve the optimal control problems, we use the First optimize, then discretize approach. We focus on the inexact alternating direction method of multipliers (iADMM) and employ the standard piecewise linear finite element approach to discretize the subproblems in each iteration. However, in general, solving the subproblems is expensive, especially when the discretization is at a fine level. Motivated by the efficiency of the multigrid method for solving large-scale problems, we combine the multigrid strategy with the iADMM algorithm. Instead of fixing the mesh size before the computation process, we propose the strategy of gradually refining the grid. Moreover, to overcome the difficulty whereby the

L^{1}

-norm does not have a decoupled form, we apply nodal quadrature formulas to approximately discretize the

L^{1}

-norm and

L^{2}

-norm. Based on these strategies, an efficient multilevel heterogeneous ADMM (mhADMM) algorithm is proposed. The total error of the mhADMM consists of two parts: the discretization error resulting from the finite-element discretization and the iteration error resulting from solving the discretized subproblems. Both errors can be regarded as the error of inexactly solving infinite-dimensional subproblems. Thus, the mhADMM can be regarded as the iADMM in function space. Furthermore, theoretical results on the global convergence, as well as the iteration complexity results

o (1 / k)

for the mhADMM, are given. Numerical results show the efficiency of the mhADMM algorithm.

Keywords:

optimal control problems; ADMM; sparse regularization; multilevel

MSC:

49M41; 49M25; 65K10; 65M32

1. Introduction

Sparse optimal control problems are widespread in many areas, such as the placement of actuators [1], quantum spin systems [2], etc. [3,4]. It is well-known that adding the

L^{1}

-norm penalty can lead to sparse optimal control problems, i.e., the infinite dimensional control variable is localized in its domain of action. In this paper, we consider the elliptic optimal control problem with

L^{1}

-control cost and box constraints on the control:

\{\begin{matrix} {min}_{(y, u) \in Y \times U}^{} & J (y, u) = \frac{1}{2} ∥ y - y_{d} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} {∥ u ∥}_{L^{2} (Ω)}^{2} + β {∥ u ∥}_{L^{1} (Ω)} \\ s . t . & L y = u + y_{r} in Ω, \\ y = 0 on \partial Ω, \\ u \in U_{a d} = {v (x) | a \leq v (x) \leq b, a . e on Ω} \subseteq U, \end{matrix}

(1)

where

Y : = H_{0}^{1} (Ω), U : = L^{2} (Ω), Ω \subseteq R^{n} (n = 2, 3)

is a convex, open and bounded domain with

C^{1, 1}

- or polygonal boundary; the desired state

y_{d} \in H^{1} (Ω)

and the source term

y_{r} \in H^{1} (Ω)

are given; parameters

α, β > 0

,

- \infty < a < 0 < b < + \infty

; L is the uniformly elliptic differential operator given by

L y : = - \sum_{i, j = 1}^{n} \partial_{x_{j}} (a_{i j} y_{x_{i}}) + c_{0} y,

(2)

where

a_{i j}, c_{0} \in L^{\infty} (Ω), c_{0} ⩾ 0, a_{i j} = a_{j i}

and there is a constant

θ > 0

, such that

\sum_{i, j = 1}^{n} a_{i j} (x) ξ_{i} ξ_{j} ⩾ θ {∥ ξ ∥}^{2}, a . a . x \in Ω, \forall ξ \in R^{n} .

(3)

Due to the

L^{1}

-control cost, the objective function

J (y, u)

is non-smooth. This means that the structure of the control significantly differs from that of the smooth one. To solve elliptic optimal control problems with

L^{1}

-control cost, Stadler et al. proposed a semi-smooth Newton method to solve elliptic optimal control problems with

L^{1}

-control cost [1]. Porcelli et al. proposed a semi-smooth Newton method with a robust preconditioner for different formulations of the Newton equation [5]. However, solving Newton equations is expensive, especially when the discretization is at a fine level. Thus, some efficient first-order algorithms have received much attention in recent years. In [6], Schindele and Borzì proposed an inexact accelerated proximal gradient (APG) method in function space to solve elliptic optimal control problems with

L^{1}

-control cost. However, the efficiency of the APG method relies closely on the step length. The backtracking approach is applied to obtain the appropriate step length, but this greatly increases the computational cost. In [7], Song et al. proposed an inexact heterogeneous ADMM (ihADMM) algorithm. Different from the classical ADMM, two different weighted inner products are utilized to define the augmented Lagrangian function for two subproblems. Moreover, the ihADMM was applied to solve PDE-constrained optimization problems with

L^{2}

-control cost [8] and elliptic optimal control problems with pointwise box constraints on the state [9]. For more applications of ADMM-type algorithms to solve PDE-constrained optimization problems, see [10,11,12,13]. Inspired by the efficiency of ADMM-type algorithms on PDE-constrained optimal control problems, we consider using ADMM-type algorithms to solve (1).

In classical finite-element-based algorithms, problems are always discretized and computed at a fixed grid level. As the mesh becomes finer and finer, the scale of the discretized problems will be larger and the computation cost will increase, causing a bottleneck. Thus, it is essential to develop new approaches to solve optimal control problems in an accurate and computationally efficient way. The multigrid method is a modern field of research, which started in the early 1970s. It is well-known that the multigrid method is the optimal solution to many discretized partial differential equations [14]. Different from the classical multigrid method, Deuflhard proposed a cascadic multigrid method for elliptic partial differential equations, which solves problems from the coarse grid to the fine grid [15]. In [16], the multigrid method was applied to tackle infinite dimensional non-linear partial differential equations using Newton methods. Due to the efficiency of the multigrid method, it is a natural idea to construct the multigrid type numerical method for PDE-constrained optimization problems. The classical multigrid method for the optimal control problem is designed to solve the linear algebraic systems formulated on each step of the optimization algorithm; refer to [17] for an overview. In [18], Gong et al. proposed an adaptive multilevel correction method, which solves the optimal control problems from the coarsest mesh to the finest mesh. Chen et al. proposed a strategy of gradually refining the grid, and proposed a multilevel ADMM (mADMM) algorithm for PDE-constrained optimization problems with a

L^{2}

-control cost in [19]. The authors proved the global convergence and the iteration complexity results

o (1 / k)

for the mADMM.

In this paper, we focus on the first optimize, then discretize [20] approach. This approach provides the freedom to discretize subproblems using different discretization schemes. Motivated by the success of the multigrid method, we combined the multigrid method and the ihADMM algorithm to numerically solve (1). At the early stage of the whole iteration process of the algorithm, using a coarse grid will not make the precision worse, but will reduce the computational cost. Thus, we propose a strategy of gradually refining the grid. Specifically, we first introduced the iterative scheme of the iADMM in function space. Then, nodal quadrature formulas were used to approximately discretize the

L^{1}

-norm and

L^{2}

-norm to ensure that discretized subproblems have decoupled forms. Finally, appropriate methods, such as Krylov-based methods, are used to solve the discretized subproblems.

The main contribution of this paper can be summarized as follows:

We propose an efficient multilevel heterogeneous ADMM (mADMM) algorithm for solving sparse optimal control problems with $L^{1}$ -control cost. Specifically, we first apply the iADMM in the function space as an outer optimization method. Then, the subproblems in each iteration are discretized by the finite element method. Instead of fixing the grid size before the computation process, we apply the strategy of gradually refining the grid. Moreover, we apply nodal quadrature formulas to approximately $L^{1}$ -norm and $L^{2}$ -norm to overcome the difficulty that $L^{1}$ -norm does not have a decoupled form and ensure z-subproblems have closed form solutions. Finally, we use appropriate methods to inexactly solve the subproblems in each iteration. The global convergence and the iteration complexity results for the mhADMM algorithm are also established.
We present numerical experiments to illustrate the effectiveness of the mhADMM algorithm. In addition, we compare the performance of the mhADMM with two benchmark methods: the ihADMM and the classical ADMM. Compared to the two methods, the mhADMM has an evident advantage in terms of computational time. Moreover, the numerical results regarding iterations illustrate the mesh-independent performance of the mhADMM.

The paper is organized as follows. In Section 2, we first briefly review the iteration format of the inexact ADMM algorithm in function space, then introduce the finite element approximation. A strategy of gradually refining the grid is introduced and the mhADMM algorithm is proposed. Numerical computation of the subproblems is also discussed. In Section 3, we present the convergence results of the mhADMM algorithm. Numerical results are given in Section 4, and concluding remarks are drawn in Section 5.

2. An Multilevel Heterogeneous ADMM Algorithm

In this section, we propose an efficient convergent multilevel alternating direction method of multipliers (mhADMM). Moreover, we introduce the numerical computation of the subproblems in the mhADMM algorithm.

To apply ADMM to solve (1), we introduce an artificial variable z; then, we can rewrite (1) in an equivalent form:

\{\begin{matrix} {min}_{(y, u, z) \in Y \times U \times U}^{} & J (y, u, z) = \frac{1}{2} ∥ y - y_{d} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} {∥ u ∥}_{L^{2} (Ω)}^{2} + β {∥ z ∥}_{L^{1} (Ω)} \\ s . t . & L y = u + y_{r} in Ω, \\ y = 0 on \partial Ω, \\ u = z, \\ z \in U_{a d} = {v (x) | a \leq v (x) \leq b, a . e on Ω} \subseteq U . \end{matrix}

(4)

For the existence and uniqueness of the solution of the elliptic PDE involved in (1), where L is defined by (2), the following proposition holds.

Proposition 1.

For every

u \in L^{2} (Ω)

and

y_{r} \in H^{1} (Ω)

, the elliptic PDE involved in (1):

\begin{matrix} L y & = u + y_{r} in Ω, \\ y & = 0 on \partial Ω, \end{matrix}

(5)

has a unique weak solution

y = y (u) : = S (u + y_{r})

, where

S : L^{2} (Ω) \to H_{0}^{1} (Ω)

denotes the solution operator. Moreover, S is a well-defined continuous linear injective operator. The adjoint operator

S^{*} : H^{- 1} (Ω) \to H_{0}^{1} (Ω)

is also a continuous linear operator.

Proof.

The weak formulation of (5) is given by the following:

Find y \in H_{0}^{1} (Ω) : a (y, v) = {(u + y_{r}, v)}_{L^{2} (Ω)}, \forall v \in H_{0}^{1} (Ω),

where

a : V \times V \to R

is a bilinear form:

a (y, v) : = \int_{Ω} (\sum_{i, j = 1}^{n} a_{i j} y_{x_{i}} v_{x_{j}} + c_{0} y v) d x .

We know from the assumption (3) that

a (\cdot, \cdot)

is symmetric, i.e.,

a (y, v) = a (v, y)

for all

y, v \in H_{0}^{1} (Ω)

. Thus,

(y, v) : = a (y, v)

defines a new inner product on

H_{0}^{1} (Ω)

. Then, the existence of a unique solution of (5) directly follows from the Riesz representation theorem. Moreover,

\begin{matrix} {θ ∥ y ∥}_{H_{0}^{1} (Ω)}^{2} \leq a (y, y) = {(u + y_{r}, y)}_{L^{2} (Ω)} \leq ∥ u + y_{r} ∥_{L^{2} (Ω)} {∥ y ∥}_{L^{2} (Ω)} \leq ∥ u + y_{r} ∥_{L^{2} (Ω)} {∥ y ∥}_{H_{0}^{1} (Ω)}, \end{matrix}

where

θ

is a constant, depending only on

Ω

, and we use the equivalence of norms in

H_{0}^{1} (Ω)

in the first inequality. Then, we have

{∥ y ∥}_{H_{0}^{1} (Ω)}^{2} \leq \frac{1}{θ} {∥ u + y_{r} ∥}_{L^{2} (Ω)} .

Thus, the solution operator S is well-defined and called the control-to-state mapping, which is a continuous linear injective operator. Since

H_{0}^{1}

is a Hilbert space, the adjoint operator

S^{*} : H^{- 1} (Ω) \to H_{0}^{1} (Ω)

is also a continuous linear operator. □

Since (4) is continuous and strongly convex, the solution of (4) exists and is unique. The following Karush–Kuhn–Tucker (KKT) conditions hold at the optimal solution of (4).

Theorem 1.

(First-order optimality condition)

(y^{*}, u^{*}, z^{*})

is the optimal solution of (4), if, and only if, adjoint state

p^{*} \in H_{0}^{1} (Ω)

and Lagrange multiplier

λ^{*} \in L^{2} (Ω)

exists, such that the following conditions hold in the weak sense:

\begin{matrix} y^{*} = S (u^{*} + y_{r}), \\ p^{*} = S^{*} (y^{*} - y_{d}), \\ α u^{*} - p^{*} + λ^{*} = 0, \\ u^{*} = z^{*}, \\ z^{*} \in U_{a d}, \\ {〈- λ^{*}, \tilde{z} - z^{*}〉}_{L^{2} (Ω)} + β (∥ \tilde{z} ∥_{L^{1} (Ω)} - {∥z^{*}∥}_{L^{1} (Ω)}) \geq 0, \forall \tilde{z} \in U_{a d} . \end{matrix}

Moreover, we have

u^{*} : = Π_{U_{a d}} (\frac{1}{α} soft (p^{*}, β)),

where the projection operator

Π_{U_{a d}} (\cdot)

and the soft thresholding operator

soft (\cdot)

, respectively, are defined as follows:

\begin{matrix} Π_{U_{a d}} (v (x)) & : = max {a, min {v (x), b}}, \\ soft (v (x), β) & : = sgn (v (x)) \circ max (| v (x) | - β, 0) . \end{matrix}

Using the solution operator S, (4) can be equivalently rewritten as the following reduced form:

\{\begin{matrix} {min}_{(u, z) \in U \times U}^{} & f (u) + g (z) \\ s . t . & u = z, \end{matrix}

(6)

where

\begin{matrix} f (u) : = \frac{1}{2} ∥ S (u + y_{r}) - y_{d} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} {∥ u ∥}_{L^{2} (Ω)}^{2}, \\ g (z) : = {β ∥ z ∥}_{L^{1} (Ω)} + δ_{U_{a d}} (z), \end{matrix}

δ_{U_{a d}}

denotes the indicator function of

U_{a d}

, i.e.,

δ_{U_{a d}} (z) = \{\begin{matrix} 0, & z \in U_{a d}, \\ \infty, & z \notin U_{a d} . \end{matrix}

Notice that (6) is a two-block separable convex optimization problem with linear equality constraints; the ADMM algorithm can be used to solve (6). The augmented Lagrangian function of (6) is defined as follows:

L_{σ} (u, z, λ; σ) = f (u) + g (z) + {〈 λ, u - z 〉}_{L^{2} (Ω)} + \frac{σ}{2} {∥ u - z ∥}_{L^{2} (Ω)}^{2},

where

λ \in L^{2} (Ω)

denotes the Lagrangian multiplier;

σ > 0

is the penalty parameter. Given the initial point

(u^{0}, z^{0}, λ^{0}) \in L^{2} (Ω) \times dom (δ_{U_{a d}} (\cdot)) \times L^{2} (Ω)

, parameters

σ > 0

,

τ \in (0, \frac{\sqrt{5} + 1}{2})

, the iteration format of ADMM in function space is as follows:

\{\begin{matrix} Step 1 : {\bar{u}}^{k + 1} = \underset{u}{arg min} f (u) + {〈 {\bar{λ}}^{k}, u - {\bar{z}}^{k} 〉}_{L^{2} (Ω)} + \frac{σ}{2} {∥ u - {\bar{z}}^{k} ∥}_{L^{2} (Ω)}^{2}, \\ Step 2 : {\bar{z}}^{k + 1} = \underset{z}{arg min} g (z) + {〈 {\bar{λ}}^{k}, {\bar{u}}^{k + 1} - z 〉}_{L^{2} (Ω)} + \frac{σ}{2} {∥ {\bar{u}}^{k + 1} - z ∥}_{L^{2} (Ω)}^{2}, \\ Step 3 : {\bar{λ}}^{k + 1} = {\bar{λ}}^{k} + τ σ ({\bar{u}}^{k + 1} - {\bar{z}}^{k + 1}) . \end{matrix}

(7)

However, computing the exact solution of each subproblem is usually expensive and unnecessary. In [7], Song et al. propose an inexact ADMM (iADMM) in function space for (6). Krylov-based methods are used to solve the subproblems, which are equivalent to large-scale linear systems. We show the iterative scheme of the iADMM algorithm in Algorithm 1.

It is easy to see that the z-subproblem has a closed-form solution

\begin{matrix} z^{k + 1} & = \underset{z}{arg min} \frac{σ}{2} ∥ z - \frac{1}{σ} (σ u^{k + 1} + λ^{k}) ∥_{L^{2} (Ω)}^{2} + β {∥ z ∥}_{L^{1} (Ω)} + δ_{U_{a d}} (z) \\ = Π_{U_{a d}} (\frac{1}{σ} soft (σ u^{k + 1} + λ^{k}, β)) . \end{matrix}

For the global convergence results and the iteration complexity for Algorithm 1, we have the following theorem.

Algorithm 1 Inexact ADMM (iADMM) algorithm for (6)

Input: Choose the initial point $(u^{0}, z^{0}, λ^{0}) \in L^{2} (Ω) \times dom (δ_{U_{a d}} (\cdot)) \times L^{2} (Ω)$ , parameters $σ > 0$ , $τ \in (0, \frac{\sqrt{5} + 1}{2})$ . Let ${ϵ_{k + 1}}_{k = 0}^{\infty}$ be a sequence satisfying ${ϵ_{k + 1}}_{k = 0}^{\infty} \subseteq [0, + \infty)$ and $\sum_{k = 0}^{\infty} ϵ_{k} < \infty$ . Set $k = 0$ .
Output: $u^{k}, z^{k}, λ^{k}$ .
Step 1 Compute $u^{k + 1}$ as an approximation solution of

$\begin{matrix} min_{u} f (u) + {〈 λ^{k}, u - z^{k} 〉}_{L^{2} (Ω)} + \frac{σ}{2} {∥ u - z^{k} ∥}_{L^{2} (Ω)}^{2} \end{matrix}$

such that the residual $δ_{u}^{k + 1} : = \nabla f (u^{k + 1}) + λ^{k} + σ (u^{k + 1} - z^{k})$ satisfies
$∥ δ_{u}^{k + 1} ∥_{L^{2} (Ω)} \leq ϵ_{k + 1} .$
Step 2 Compute $z^{k + 1}$ as follows:

$\begin{matrix} z^{k + 1} = \underset{z}{arg min} g (z) + {〈 λ^{k}, u^{k + 1} - z 〉}_{L^{2} (Ω)} + \frac{σ}{2} {∥ u^{k + 1} - z ∥}_{L^{2} (Ω)}^{2} . \end{matrix}$
Step 3 Compute

$λ^{k + 1} = λ^{k} + τ σ (u^{k + 1} - z^{k + 1}) .$
Step 4 If a termination criterion is met, stop; otherwise, set $k : = k + 1$ and go to Step 1.

Theorem 2.

([7] Theorem 3) Let

(y^{*}, u^{*}, z^{*}, p^{*}, λ^{*})

be the KKT point of (4), the sequence

{(u^{k}, z^{k}, λ^{k})}_{k = 0}^{\infty}

is generated by Algorithm 1 with the associated state

{y^{k}}_{k = 0}^{\infty}

and adjoint state

{p^{k}}_{k = 0}^{\infty}

; then, we have

\begin{matrix} lim_{k \to \infty} {∥ u^{k} - u^{*} ∥_{L^{2} (Ω)} + ∥ z^{k} - z^{*} ∥_{L^{2} (Ω)} + ∥ λ^{k} - λ^{*} ∥_{L^{2} (Ω)}} = 0, \\ lim_{k \to \infty} {∥ y^{k} - y^{*} ∥_{H_{0}^{1} (Ω)} + ∥ p^{k} - p^{*} ∥_{H_{0}^{1} (Ω)}} = 0 . \end{matrix}

Moreover, a constant

C_{0}

only depends on the initial point

(u^{0}, z^{0}, λ^{0})

and the optimal solution

(u^{*}, z^{*}, λ^{*})

, such that, for

k \geq 1

,

min_{1 \leq i \leq k} R (u^{i}, z^{i}, λ^{i}) \leq \frac{C_{0}}{k}, lim_{k \to \infty} (k \cdot min_{1 \leq i \leq k} R (u^{i}, z^{i}, λ^{i})) = 0,

where the function

R : (u, z, λ) \to [0, \infty)

is defined as

R (u, z, λ) : = {∥ \nabla f (u) + λ ∥}_{L^{2} (Ω)}^{2} + {dist}^{2} (0, - λ + \partial g (z)) + {∥ u - z ∥}_{L^{2} (Ω)}^{2} .

2.1. The mhADMM Algorithm

To numerically solve (6), we consider the full discretization, in which both the state y and the control u are discretized by piecewise, linear, globally continuous finite elements. We introduce a family of regular and quasi-uniform triangulations

{T_{h}}

of

\bar{Ω}

, i.e.,

\bar{Ω} = ⋃_{T \in T_{h}} \bar{T}

. With each element

T \in T_{h}

, we define the diameter of the set T by

ρ_{T} : = diam T

and let

σ_{T}

denote the diameter of the largest ball contained in T. The grid size is defined by

h : = \max_{T \in T_{h}} ρ_{T}

. We suppose the following standard assumption holds (see [20]).

Assumption 1.

(Regular and quasi-uniform triangulations) There are two positive constants κ and τ such that

\frac{ρ_{T}}{σ_{T}} \leq κ, \frac{h}{ρ_{T}} \leq τ

hold for all

T \in T_{h}

and all

h > 0

. Moreover, let us define

{\bar{Ω}}_{h} = ⋃_{T \in T_{h}} \bar{T}

and let

Ω_{h} \subseteq Ω

and

Γ_{h}

denote its interior and boundary, respectively. In the case that Ω is a convex polyhedral domain, we have

Ω = Ω_{h}

. In the case that Ω is a domain with a

C^{1, 1}

- boundary Γ, we assume that

{\bar{Ω}}_{h}

is convex and all boundary vertices of

{\bar{Ω}}_{h}

are contained in Γ, such that

| Ω ∖ Ω_{h} | \leq c^{*} h^{2},

where

| \cdot |

denotes the measure of the set, and

c^{*} > 0

is a constant.

Due to the homogeneous boundary condition of the state equation, we use

\begin{matrix} Y_{h} : = {y_{h} \in C (\bar{Ω}) | y_{h | T} \in P_{1}, \forall T \in T_{h}, y_{h} = 0 in \bar{Ω} ∖ Ω_{h}}, \\ U_{h} : = {u_{h} \in C (\bar{Ω}) | u_{h | T} \in P_{1}, \forall T \in T_{h}, u_{h} = 0 in \bar{Ω} ∖ Ω_{h}}, \end{matrix}

as the discretized state space and the discretized control space, respectively.

P_{1}

denotes the space of polynomials of degree less than or equal to 1. For the given regular and quasi-uniform triangulation

T_{h}

with nodes

{x_{i}}_{i = 1}^{N_{h}}

, let

{ϕ_{i} (x)}_{i = 1}^{N_{h}}

be a basis of

Y_{h}

,

U_{h}

, which satisfies the following properties:

ϕ_{i} (x) ⩾ 0, {∥ ϕ_{i} (x) ∥}_{\infty} = 1 \forall i = 1, . . ., N_{h}, \sum_{i = 1}^{N_{h}} ϕ_{i} (x) = 1 .

Then,

u_{h} \in U_{h}

,

y_{h} \in Y_{h}

can be represented in the following forms, respectively,

y_{h} = \sum_{i = 1}^{N_{h}} y_{i} ϕ_{i}, u_{h} = \sum_{i = 1}^{N_{h}} u_{i} ϕ_{i},

where

y_{i} : = y_{h} (x_{i})

,

u_{i} : = u_{h} (x_{i})

. Moreover,

y_{h}

can be expressed by

y_{h} (u) = S_{h} (u + y_{r})

, where

S_{h}

denotes the discretized version of the solution operator S. Let

U_{a d, h}

denotes the discretized feasible set, which is defined by

U_{a d, h} : = U_{h} \cap U_{a d} = \{z_{h} = \sum_{i = 1}^{N_{h}} z_{i} ϕ_{i} | a \leq z_{i} \leq b, \forall i = 1, \dots, N_{h}\} \subset U_{a d} .

To overcome the difficulty that the discretized

L^{1}

-norm does not have a decoupled form, we choose the nodal quadrature formulas introduced in [21] to approximately discretized the

L^{1}

-norm:

{∥z_{h}∥}_{L_{h}^{1} (Ω_{h})} = \sum_{i = 1}^{n} |z_{i}| \int_{Ω_{h}} ϕ_{i} (x) d x .

Moreover, in order to obtain a closed form solution for the z-subproblem, a similar quadrature formulae introduced in [8] is also used to discretize the

L^{2}

-norm in the z-subproblem:

{∥z_{h}∥}_{L_{h}^{2} (Ω_{h})} = {(\sum_{i = 1}^{n} {(z_{i})}^{2} \int_{Ω_{h}} ϕ_{i} (x) d x)}^{\frac{1}{2}} .

For the given

y_{r} \in H^{1} (Ω)

and

u \in L^{2} (Ω)

, the unique discretized state

y_{h}

associated with u is can be expressed by

y_{h} (u) = S_{h} (u + y_{r})

, where

S_{h}

is the discretized version of the solution operator S. Then, we have the well-known error estimates:

Lemma 1

([22], Theorem 4.4.6). For a given

u \in L^{2} (Ω)

, let y be the unique weak solution of the state Equation (5) and

y_{h}

be the unique discretized state. Then, there exists a constant

c > 0

independent of h, u and

y_{r}

, such that

∥ y - y_{h} ∥_{L^{2} (Ω)} + h ∥ \nabla y - \nabla y_{h} ∥_{L^{2} (Ω)} \leq c h^{2} {(∥ u ∥}_{L^{2} (Ω)} + ∥ y_{r} ∥_{L^{2} (Ω)}) .

In particular, this implies

∥ S - S_{h} ∥_{L (L^{2}, L^{2})} \leq c h^{2}

and

∥ S - S_{h} ∥_{L (L^{2}, H^{1})} \leq c h

.

To project the solution obtained on the coarser grid to the finer grid, we introduce the definition of the node interpolation operator

I_{h}

.

Definition 1.

For a given regular and quasi-uniform triangulation

T_{h}

of Ω with nodes

{x_{i}}_{i = 1}^{N_{h}}

, let

{ϕ_{i} (x)}_{i = 1}^{N_{h}}

denotes the standard set of nodal basis functions. The interpolation operator

I_{h}

is defined as

I_{h} w (x) : = \sum_{i = 1}^{N_{h}} w (x_{i}) ϕ_{i} (x) for any w \in C^{0} (Ω) \cap H^{1} (Ω) .

For the interpolation error estimate, we have the following Theorem 3.

Theorem 3

([19] Theorem 2). For all

w \in C^{0} (Ω) \cap H^{1} (Ω)

, we have

∥ w - I_{h} {w ∥}_{L^{2} (Ω)} \leq c_{I} h {∥ w ∥}_{H^{1} (Ω)},

where

c_{I}

is a constant independent of h.

At the early stage of the whole process, computing on the coarser grid can reduce the computation cost without making the precision worse. While as the iteration process proceeds, the iteration precision is supposed to increase. In this case, it is necessary to use the finer grid at the late stage. Thus, we apply the strategy of gradually refining the grid. In the initial iteration, we obtained a solution on the coarse grid, then projected the obtained solution to the finer grid. For the convenience of representing subproblems on different grids, we define

\begin{matrix} f_{h_{k + 1}} (u) : = \frac{1}{2} ∥ S_{h_{k + 1}} (u + I_{h_{k + 1}} y_{r}) - I_{h_{k + 1}} y_{d} ∥_{L^{2} (Ω_{h_{k + 1}})}^{2} + \frac{α}{2} {∥ u ∥}_{L^{2} (Ω_{h_{k + 1}})}^{2}, \\ g_{h_{k + 1}} (z) : = β {∥ z ∥}_{L_{h_{k + 1}}^{1} (Ω_{h_{k + 1}})} + δ_{U_{a d}, h_{k + 1}} (z) . \end{matrix}

Moreover, let

I_{h} y_{r} : = \sum_{i = 1}^{N_{h}} y_{r}^{i} ϕ_{i}, I_{h} y_{d} : = \sum_{i = 1}^{N_{h}} y_{d}^{i} ϕ_{i}

denotes the

L^{2}

-projection of

y_{r}

and

y_{d}

onto

Y_{h}

, respectively. Then, we show the iterative scheme of the multilevel heterogeneous ADMM alternating direction method of multipliers (mhADMM) in Algorithm 2.

Algorithm 2 Multilevel heterogeneous ADMM (mhADMM) algorithm for (6)

Input: Choose the initial point $(u_{h_{1}}^{0}, z_{h_{1}}^{0}, λ_{h_{1}}^{0}) \in H^{1} (Ω) \times H^{1} (Ω) \times H^{1} (Ω)$ , parameters $σ > 0$ , $τ \in (0, \frac{\sqrt{5} + 1}{2})$ . Let ${ϵ_{k + 1}}_{k = 0}^{\infty}$ be a sequence satisfying ${ϵ_{k + 1}}_{k = 0}^{\infty} \subseteq [0, + \infty)$ and $\sum_{k = 0}^{\infty} ϵ_{k + 1} < \infty$ , mesh sizes ${h_{k + 1}}_{k = 0}^{\infty}$ of each level satisfying $\sum_{k = 0}^{\infty} h_{k + 1} < \infty .$ Set $k = 0$ .
Output: $u_{h_{k}}^{k}, z_{h_{k}}^{k}, λ_{h_{k}}^{k}$ .
Step 1 Compute $u_{h_{k + 1}}^{k + 1}$ as an approximation solution of

$\begin{matrix} min_{u} f_{h_{k + 1}} (u) + {〈 λ_{h_{k + 1}}^{k}, u - z_{h_{k + 1}}^{k} 〉}_{L^{2} (Ω_{h_{k + 1}})} + \frac{σ}{2} {∥ u - z_{h_{k + 1}}^{k} ∥}_{L^{2} (Ω_{h_{k + 1}})}^{2} \end{matrix}$

such that the residual $δ_{u, h_{k + 1}}^{k + 1} : = \nabla f_{h_{k + 1}} (u_{h_{k + 1}}^{k + 1}) + λ_{h_{k + 1}}^{k} + σ (u_{h_{k + 1}}^{k + 1} - z_{h_{k + 1}}^{k})$ satisfies $∥ δ_{u, h_{k + 1}}^{k + 1} ∥_{L^{2} (Ω_{h_{k + 1}})} \leq ϵ_{k + 1} .$
Step 2 Compute $z_{h_{k + 1}}^{k + 1}$ as follows:

$\begin{matrix} z_{h_{k + 1}}^{k + 1} & = \underset{z}{arg min} g_{h_{k + 1}} (z) + {〈 λ_{h_{k + 1}}^{k}, u_{h_{k + 1}}^{k + 1} - z 〉}_{L^{2} (Ω_{h_{k + 1}})} + \frac{σ}{2} {∥ u_{h_{k + 1}}^{k + 1} - z ∥}_{L_{h_{k + 1}}^{2} (Ω_{h_{k + 1}})}^{2} . \end{matrix}$
Step 3 Compute

$λ_{h_{k + 1}}^{k + 1} = λ_{h_{k + 1}}^{k} + τ σ (u_{h_{k + 1}}^{k + 1} - z_{h_{k + 1}}^{k + 1}) .$
Step 4 If a termination criterion is met, stop; otherwise, set $k : = k + 1$ and go to Step 1.

It is easy to see that the z-subproblem has a closed-form solution:

\begin{matrix} z_{h_{k + 1}}^{k + 1} & = \underset{z}{arg min} \frac{σ}{2} ∥ z - \frac{1}{σ} (σ u_{h_{k + 1}}^{k + 1} + λ_{h_{k + 1}}^{k}) ∥_{L^{2} (Ω_{h_{k + 1}})}^{2} + β {∥ z ∥}_{L_{h_{k + 1}}^{1} (Ω_{h_{k + 1}})} + δ_{U_{a d}, h_{k + 1}} (z) \\ = Π_{U_{a d}, h_{k + 1}} (\frac{1}{σ} soft (σ u_{h_{k + 1}}^{k + 1} + λ_{h_{k + 1}}^{k}, β)) . \end{matrix}

2.2. Numerical Computation of the Subproblems in Algorithm 2

To rewrite the subproblems into matrix-vector forms, we define the following matrices

\begin{matrix} K_{h} : = {(a (ϕ_{i}, ϕ_{j}))}_{i, j = 1}^{N_{h}}, \\ M_{h} : = {(\int_{Ω_{h}} ϕ_{i} ϕ_{j} d x)}_{i, j = 1}^{N_{h}}, \\ W_{h} : = diag {(\int_{Ω_{h}} ϕ_{i} (x) d x)}_{i, j = 1}^{N_{h}}, \end{matrix}

where

K_{h}

,

M_{h}

and

W_{h}

denote the finite element stiffness matrix, mass matrix and lump mass matrix, respectively.

For

u_{h} = \sum_{i = 1}^{N_{h}} u_{i} ϕ_{i} \in U_{h}

,

y_{h} = \sum_{i = 1}^{N_{h}} y_{i} ϕ_{i} \in Y_{h}

, let

u_{h} = (u_{1}, . . ., u_{N_{h}}), y_{h} = (y_{1}, . . ., y_{N_{h}})

be the relative coefficient vectors, respectively. For

I_{h} y_{r} = \sum_{i = 1}^{N_{h}} y_{r}^{i} ϕ_{i}

,

I_{h} y_{d} = \sum_{i = 1}^{N_{h}} y_{d}^{i} ϕ_{i}

, let

y_{r, h} = (y_{r}^{1}, y_{r}^{2}, . . ., y_{r}^{N_{h}}), y_{d, h} = (y_{d}^{1}, y_{d}^{2}, . . ., y_{d}^{N_{h}})

be the coefficient vectors, respectively. Let

I_{h}

denotes the vector version of the interpolation operator. Moreover, we define

\begin{matrix} f (u) & : = \frac{1}{2} ∥ K_{h}^{- 1} M_{h} (u + y_{r, h}) - y_{d, h} ∥_{M_{h}}^{2} + \frac{α}{2} {∥ u ∥}_{M_{h}}^{2}, \\ g (z) & : = β ∥ W_{h} {z ∥}_{1}^{2} + δ_{{[a, b]}^{N_{h}}} (z) . \end{matrix}

Then, the matrix-vector form of Algorithm 2 is given in Algorithm 3.

Algorithm 3 Matrix-vector form of the mhADMM algorithm

Input: $(u^{0}, z^{0}, λ^{0}) \in R^{N_{h}} \times {[a, b]}^{N_{h}} \times R^{N_{h}}$ , parameters $σ > 0$ , $τ \in (0, \frac{\sqrt{5} + 1}{2})$ . Let ${ϵ_{k + 1}}_{k = 0}^{\infty}$ be a sequence satisfying ${ϵ_{k + 1}}_{k = 0}^{\infty} \subseteq [0, + \infty)$ and $\sum_{k = 0}^{\infty} \frac{ϵ_{k + 1}}{\sqrt{∥ M_{h_{k + 1}} ∥_{2}}} < \infty$ , mesh sizes ${h_{k + 1}}_{k = 0}^{\infty}$ of each iteration satisfy $\sum_{k = 0}^{\infty} h_{k + 1} < \infty .$ Set $k = 0$ .
Output: $u_{h_{k}}^{k}, z_{h_{k}}^{k}, λ_{h_{k}}^{k}$ .
Step 1: Compute $u_{h_{k + 1}}^{k + 1}$ as an approximation solution of

$min_{u} f (u) + 〈 M_{h_{k + 1}} λ_{h_{k + 1}}^{k}, u - z_{h_{k + 1}}^{k} 〉 + \frac{σ}{2} {∥ u - z_{h_{k + 1}}^{k} ∥}_{M_{h_{k + 1}}}^{2}$

such that the residual $δ_{u, h_{k + 1}}^{k + 1} : = \nabla f (u_{h_{k + 1}}^{k + 1}) + M_{h_{k + 1}} λ_{h_{k + 1}}^{k} + σ M_{h_{k + 1}} (u_{h_{k + 1}}^{k + 1} - z_{h_{k + 1}}^{k})$
satisfies $∥ δ_{u, h_{k + 1}}^{k + 1} ∥ \leq \frac{ϵ_{k + 1}}{\sqrt{∥ M_{h_{k + 1}} ∥_{2}}}$ .
Step 2: Compute $z_{h_{k + 1}}^{k + 1}$ as follows:

$z_{h_{k + 1}}^{k + 1} = \underset{z}{arg min} g (z) + 〈 M_{h} λ_{h_{k + 1}}^{k}, u_{h_{k + 1}}^{k + 1} - z 〉 + \frac{σ}{2} {∥ u_{h_{k + 1}}^{k + 1} - z ∥}_{W_{h}}^{2} .$
Step 3: Compute

$λ_{h_{k + 1}}^{k + 1} = I_{h_{k + 1}} λ_{h_{k}}^{k} + τ σ (u_{h_{k + 1}}^{k + 1} - z_{h_{k + 1}}^{k + 1}) .$
Step 4: If a termination criterion is met, stop; otherwise, set $k : = k + 1$ and go to Step 1.

The u-subproblem at the kth iteration is equivalent to the following linear system:

M_{h_{k}} K_{h_{k}}^{- 1} M_{h_{k}} (K_{h_{k}}^{- 1} M_{h_{k}} (u_{h_{k}}^{k} + y_{r, h_{k}}) - y_{d, h_{k}}) + α M_{h_{k}} u_{h_{k}}^{k} + λ_{h_{k}}^{k} + σ (u_{h_{k}}^{k} - z_{h_{k}}^{k}) = 0,

(8)

where

y_{h_{k}}^{k} : = K_{h_{k}}^{- 1} M_{h_{k}} (u_{h_{k}}^{k} + y_{r, h_{k}}), p_{h_{k}}^{k} : = K_{h_{k}}^{- 1} M_{h_{k}} (y_{d, h_{k}} - y_{h_{k}}^{k})

denote the discretized state and the discretized adjoint state, respectively. Then (8) can be rewritten as:

[\begin{matrix} M_{h_{k}} & 0 & K_{h_{k}} \\ 0 & (α + σ) M_{h_{k}} & - M_{h_{k}} \\ K_{h_{k}} & - M_{h_{k}} & 0 \end{matrix}] \begin{matrix}  [\begin{matrix} y_{h_{k}}^{k} \\ u_{h_{k}}^{k} \\ p_{h_{k}}^{k} \end{matrix}] \end{matrix} =  [\begin{matrix} M_{h_{k}} y_{d, h_{k}} \\ M_{h_{k}} (σ I_{h_{k}} z_{h_{k - 1}}^{k - 1} - I_{h_{k}} λ_{h_{k - 1}}^{k - 1}) \\ M_{h_{k}} y_{r, h_{k}} \end{matrix}] .

(9)

We know from (9) that

p_{h_{k}}^{k} = (α + σ) u_{h_{k}}^{k} - σ I_{h_{k}} z_{h_{k - 1}}^{k - 1} + I_{h_{k}} λ_{h_{k - 1}}^{k - 1}

. By eliminating the variable

p_{h_{k}}^{k}

, (9) can be rewritten in the following reduced form without any additional computational cost:

[\begin{matrix} M_{h_{k}} & (α + σ) K_{h_{k}} \\ - K_{h_{k}} & M_{h_{k}} \end{matrix}] \begin{matrix}  [\begin{matrix} y_{h_{k}}^{k} \\ u_{h_{k}}^{k} \end{matrix}] \end{matrix} =  [\begin{matrix} M_{h_{k}} y_{d, h_{k}} + K_{h_{k}} (σ I_{h_{k}} z_{h_{k - 1}}^{k - 1} - I_{h_{k}} λ_{h_{k - 1}}^{k - 1}) \\ - M_{h_{k}} y_{r, h_{k}} \end{matrix}] .

(10)

The equation system (10) can be solved by the generalized minimal residual (GMRES) with the preconditioned variant of modified hermitian and skew-hermitian splitting (PMHSS) preconditioner [23,24].

For the z-subproblem, there is a closed-form solution

z_{h_{k + 1}}^{k + 1} = Π_{{[a, b]}^{N_{h_{k + 1}}}} (\frac{1}{σ} soft (σ u_{h_{k + 1}}^{k + 1} + W_{h}^{- 1} M_{h} λ_{h_{k + 1}}^{k}, β)) .

3. Convergence Analysis

In this section, we establish the global convergence and the iteration complexity results in non-ergodic sense for the sequence generated by Algorithm 2. Before giving the convergence analysis, we first introduce the exact multi-level heterogeneous ADMM (mhADMM) algorithm. Each subproblem of the exact mhADMM algorithm is solved exactly. Given the initial point

(u^{0}, z^{0}, λ^{0}) \in H^{1} (Ω) \times H^{1} (Ω) \times H^{1} (Ω)

, parameters

σ > 0

,

τ \in (0, \frac{\sqrt{5} + 1}{2})

. The mesh sizes

{h_{k + 1}}_{k = 0}^{\infty}

of each iteration satisfy

\sum_{k = 0}^{\infty} h_{k + 1} < \infty .

Then, each iteration of the exact mhADMM has three main steps:

\{\begin{matrix} Step 1 : {\bar{u}}_{h_{k + 1}}^{k + 1} = \underset{u}{arg min} f_{h_{k + 1}} (u) + {〈 {\bar{λ}}_{h_{k + 1}}^{k}, u - {\bar{z}}_{h_{k + 1}}^{k} 〉}_{L^{2} (Ω_{h_{k + 1}})} + \frac{σ}{2} {∥ u - {\bar{z}}_{h_{k + 1}}^{k} ∥}_{L^{2} (Ω_{h_{k + 1}})}, \\ Step 2 : {\bar{z}}_{h_{k + 1}}^{k + 1} = \underset{z}{arg min} g_{h_{k + 1}} (z) + {〈 {\bar{λ}}_{h_{k + 1}}^{k}, {\bar{u}}_{h_{k + 1}}^{k + 1} - z 〉}_{L^{2} (Ω_{h_{k + 1}})} + \frac{σ}{2} {∥ {\bar{u}}_{h_{k + 1}}^{k + 1} - z ∥}_{L_{h_{k + 1}}^{2} (Ω_{h_{k + 1}})}^{2}, \\ Step 3 : {\bar{λ}}_{h_{k + 1}}^{k + 1} = {\bar{λ}}_{h_{k + 1}}^{k} + τ σ ({\bar{u}}_{h_{k + 1}}^{k + 1} - {\bar{z}}_{h_{k + 1}}^{k + 1}), \end{matrix}

(11)

where

{\bar{λ}}_{h_{k + 1}}^{k} : = I_{h_{k + 1}} {\bar{λ}}_{h_{k}}^{k}

,

{\bar{z}}_{h_{k + 1}}^{k} : = I_{h_{k + 1}} {\bar{z}}_{h_{k}}^{k}

. Then, we use the following lemma to measure the gap between the solution sequence obtained by the ADMM in function space and the exact mhADMM algorithm in finite dimensional space.

Lemma 2.

Let the initial point be

(z^{0}, λ^{0}) \in H^{1} (Ω) \times H^{1} (Ω)

. Let

{({\bar{u}}^{k}, {\bar{z}}^{k}, {\bar{λ}}^{k})}_{k = 0}^{\infty}

defined in (7) be the sequence generated by the ADMM in function space and

{({\bar{u}}_{h_{k}}^{k}, {\bar{z}}_{h_{k}}^{k}, {\bar{λ}}_{h_{k}}^{k})}_{k = 0}^{\infty}

defined in (11) be the sequence generated by the exact mhADMM algorithm. Then, for all

k ⩾ 1

, we have

\begin{matrix} ∥ {\bar{u}}^{k} - {\bar{u}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} \leq C_{u, k} h_{k}, \\ ∥ {\bar{z}}^{k} - {\bar{z}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} \leq C_{z, k} h_{k}, \\ ∥ {\bar{λ}}^{k - 1} - {\bar{λ}}_{h_{k}}^{k - 1} ∥_{L^{2} (Ω_{h_{k}})} \leq C_{λ, k} h_{k}, \end{matrix}

where

C_{u, k}, C_{z, k}, C_{λ, k}

are constants independent of

h_{k}

and there is a constant C such that

C_{u, k} \leq C

for any

k \geq 1

. Thus, we have

\sum_{k = 1}^{\infty} {∥ {\bar{u}}^{k} - {\bar{u}}_{h_{k}}^{k} ∥}_{L^{2} (Ω_{h_{k}})} \leq C \sum_{k = 1}^{\infty} h_{k} .

Proof.

We employ the mathematical induction to prove the conclusion. For

k = 1

, we know from Theorem 3 that

\begin{matrix} ∥ {\bar{λ}}^{0} - {\bar{λ}}_{h_{1}}^{0} ∥_{L^{2} (Ω_{h_{1}})} & = ∥ λ^{0} - I_{h_{1}} λ^{0} ∥_{L^{2} (Ω_{h_{1}})} \\ \leq c_{I} h_{1} {∥ λ^{0} ∥}_{H^{1} (Ω_{h_{1}})} \\ \leq C_{λ, 1} h_{1}, \end{matrix}

where

C_{λ, 1} : = c_{I} {∥ {\bar{λ}}^{0} ∥}_{H^{1} (Ω_{h_{1}})}

is a constant independent of h.

For u-subproblems,

{\bar{u}}^{1}

and

{\bar{u}}_{h_{1}}^{1}

satisfy the following optimality conditions, respectively,

\begin{matrix} S^{*} [S ({\bar{u}}^{1} + y_{r}) - y_{d}] + α {\bar{u}}^{1} + λ^{0} + σ ({\bar{u}}^{1} - z^{0}) = 0, \\ S_{h_{1}}^{*} [S_{h_{1}} ({\bar{u}}_{h_{1}}^{1} + I_{h_{1}} y_{r}) - I_{h_{1}} y_{d}] + α {\bar{u}}_{h_{1}}^{1} + I_{h_{1}} λ^{0} + σ ({\bar{u}}_{h_{1}}^{1} - I_{h_{1}} z^{0}) = 0 . \end{matrix}

By subtracting the two equalities above, we have

\begin{matrix}  [- (α + σ) I - S_{h_{1}}^{*} S_{h_{1}}] ({\bar{u}}^{1} - {\bar{u}}_{h_{1}}^{1}) \\ = & S^{*} S {\bar{u}}^{1} - S_{h_{1}}^{*} S_{h_{1}} {\bar{u}}^{1} + S^{*} S y_{r} - S_{h_{1}}^{*} S_{h_{1}} I_{h_{1}} y_{r} - S^{*} y_{d} + S_{h_{1}}^{*} I_{h_{1}} y_{d} + λ^{0} - I_{h_{1}} λ^{0} + σ I_{h_{1}} z^{0} - σ z^{0} . \end{matrix}

Then, we know from Lemma 2 in [19] that

∥ {\bar{u}}^{1} - {\bar{u}}_{h_{1}}^{1} ∥_{L^{2} (Ω_{h_{1}})} \leq C_{u, 1} h_{1}

, where

C_{u, 1}

is a constant independent of

h_{1}

.

For z-subproblems,

{\bar{z}}^{1}

and

{\bar{z}}_{h_{1}}^{1}

satisfy

\begin{matrix} {\bar{z}}^{1} & = Π_{U_{a d}} (\frac{1}{σ} soft (σ {\bar{u}}^{1} + λ^{0}, β)), \\ {\bar{z}}_{h_{1}}^{1} & = Π_{U_{a d}, h_{1}} (\frac{1}{σ} soft (σ u_{h_{1}}^{1} + λ_{h_{1}}^{0}, β)), \end{matrix}

respectively. Then, we know from the projection operator

Π

and the soft thresholding operator

soft (\cdot)

are nonexpansive, such that

\begin{matrix} ∥ {\bar{z}}^{1} - {\bar{z}}_{h_{1}}^{1} ∥_{L^{2} (Ω_{h_{1}})} & = ∥ Π_{U_{a d}} (\frac{1}{σ} soft (σ {\bar{u}}^{1} + λ^{0}, β)) - Π_{U_{a d}, h_{1}} (\frac{1}{σ} soft (σ {\bar{u}}_{h_{1}}^{1} + λ_{h_{1}}^{0}, β)) ∥_{L^{2} (Ω_{h_{1}})} \\ \leq \frac{1}{σ} {∥ σ {\bar{u}}^{1} - σ {\bar{u}}_{h_{1}}^{1} + λ^{0} - λ_{h_{1}}^{0} ∥}_{L^{2} (Ω_{h_{1}})} \\ \leq ∥ {\bar{u}}^{1} - {\bar{u}}_{h_{1}}^{1} ∥_{L^{2} (Ω_{h_{1}})} + \frac{1}{σ} {∥ λ^{0} - λ_{h_{1}}^{0} ∥}_{L^{2} (Ω_{h_{1}})} \\ \leq C_{u, 1} h_{1} + \frac{1}{σ} C_{λ, 1} h_{1} \\ = C_{z, 1} h_{1}, \end{matrix}

where

C_{z, 1} : = C_{u, 1} + \frac{1}{σ} C_{λ, 1}

. Hence, the statement is true for

k = 1

.

For

k > 1

, we assume the statement is true for

\forall j \leq k

. Then, for

j = k + 1

, we have

\begin{matrix} ∥ {\bar{λ}}^{k} - {\bar{λ}}_{h_{k + 1}}^{k} ∥_{L^{2} (Ω_{h_{k + 1}})} = & ∥ {\bar{λ}}^{k} - {\bar{λ}}_{h_{k}}^{k} + {\bar{λ}}_{h_{k}}^{k} - I_{h_{k + 1}} {\bar{λ}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k + 1}})} \\ \leq & ∥ {\bar{λ}}^{k} - {\bar{λ}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} + {∥ {\bar{λ}}_{h_{k}}^{k} - I_{h_{k + 1}} {\bar{λ}}_{h_{k}}^{k} ∥}_{L^{2} (Ω_{h_{k + 1}})} \\ \leq & ∥ {\bar{λ}}^{k - 1} - {\bar{λ}}_{h_{k}}^{k - 1} + τ σ ({\bar{u}}^{k} - {\bar{u}}_{h_{k}}^{k}) - τ σ ({\bar{z}}^{k} - {\bar{z}}_{h_{k}}^{k}) ∥_{L^{2} (Ω_{h_{k}})} \\ + ∥ {\bar{λ}}_{h_{k}}^{k} - I_{h_{k + 1}} {\bar{λ}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k + 1}})} \\ \leq & ∥ {\bar{λ}}^{k - 1} - {\bar{λ}}_{h_{k}}^{k - 1} ∥_{L^{2} (Ω_{h_{k}})} + τ σ (∥ {\bar{u}}^{k} - {\bar{u}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} + {∥ {\bar{z}}^{k} - {\bar{z}}_{h_{k}}^{k} ∥}_{L^{2} (Ω_{h_{k}})}) \\ + ∥ {\bar{λ}}_{h_{k}}^{k} - I_{h_{k + 1}} {\bar{λ}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k + 1}})} \\ \leq & (C_{λ, k} + τ σ C_{u, k} + τ σ C_{z, k}) h_{k} + c_{I} h_{k + 1} {∥ {\bar{λ}}_{h_{k}}^{k} ∥}_{H^{1} (Ω_{h_{k + 1}})} \\ \leq & C_{λ, k + 1} h_{k + 1}, \end{matrix}

where

C_{λ, k + 1} : = C_{k + 1} (C_{λ, k} h_{k} + τ σ C_{u, k} + τ σ C_{z, k}) + c_{I} {∥ {\bar{λ}}_{h_{k}}^{k} ∥}_{H^{1} (Ω_{h_{k + 1}})}

is a constant independent of

h_{k + 1}

. In the last equality, we use the property

\sum_{k = 0}^{\infty} h_{k} < \infty

; thus, there exists a constant

C_{k + 1}

such that

h_{k} < C_{k + 1} h_{k + 1}

.

For u-subproblems,

{\bar{u}}^{k + 1}

and

{\bar{u}}_{h_{k + 1}}^{k + 1}

satisfy the following optimality conditions respectively,

\begin{matrix} S^{*} [S ({\bar{u}}^{k} + y_{r}) - y_{d}] + α {\bar{u}}^{k + 1} + {\bar{λ}}^{k} + σ ({\bar{u}}^{k + 1} - {\bar{z}}^{k}) = 0, \\ S_{h_{k + 1}}^{*} [S_{h_{k + 1}} ({\bar{u}}_{h_{k + 1}}^{k + 1} + I_{h_{k + 1}} y_{r}) - I_{h_{k + 1}} y_{d}] + α {\bar{u}}_{h_{k + 1}}^{k + 1} + {\bar{λ}}_{h_{k + 1}}^{k} + σ ({\bar{u}}_{h_{k + 1}}^{k + 1} - {\bar{z}}_{h_{k + 1}}^{k}) = 0 . \end{matrix}

By subtracting the two equalities above, we have

\begin{matrix} -  [(α + σ) I + S_{h_{k + 1}}^{*} S_{h_{k + 1}}] ({\bar{u}}^{k + 1} - {\bar{u}}_{h_{k + 1}}^{k + 1}) \\ = & S^{*} S {\bar{u}}^{k + 1} - S_{h_{k + 1}}^{*} S_{h_{k + 1}} {\bar{u}}^{k + 1} + S^{*} S y_{r} - S_{h_{k + 1}}^{*} S_{h_{k + 1}} I_{h_{k + 1}} y_{r} - S^{*} y_{d} + S_{h_{k + 1}}^{*} I_{h_{k + 1}} y_{d} \\ + {\bar{λ}}^{k} - {\bar{λ}}_{h_{k + 1}}^{k} - σ ({\bar{z}}^{k} - {\bar{z}}_{h_{k + 1}}^{k}) . \end{matrix}

Then, we know from Lemma 2 in [19] that

∥ {\bar{u}}^{k + 1} - {\bar{u}}_{h_{k + 1}}^{k + 1} ∥_{L^{2} (Ω_{h_{k + 1}})} \leq C_{u, k + 1} h_{k + 1},

where

C_{u, k + 1}

is a constant independent of

h_{k + 1}

.

For z-subproblems,

{\bar{z}}^{k + 1}

and

{\bar{z}}_{h_{k + 1}}^{k + 1}

satisfy

\begin{matrix} {\bar{z}}^{k + 1} & = Π_{U_{a d}} (\frac{1}{σ} soft (σ {\bar{u}}^{k + 1} + λ^{k}, β)), \\ {\bar{z}}_{h_{k + 1}}^{k + 1} & = Π_{U_{a d}, h_{1}} (\frac{1}{σ} soft (σ {\bar{u}}_{h_{k + 1}}^{k + 1} + λ_{h_{k + 1}}^{k}, β)), \end{matrix}

respectively. We know the projection operator

Π

and the soft thresholding operator

soft (\cdot)

are nonexpansive, such that

\begin{matrix} ∥ {\bar{z}}^{k + 1} - {\bar{z}}_{h_{k + 1}}^{k + 1} ∥_{L^{2} (Ω_{h_{k + 1}})} & \leq \frac{1}{σ} {∥ σ {\bar{u}}^{k + 1} - σ {\bar{u}}_{h_{k + 1}}^{k + 1} + λ^{k} - λ_{h_{k + 1}}^{k} ∥}_{L^{2} (Ω_{h_{k + 1}})} \\ \leq ∥ {\bar{u}}^{k + 1} - {\bar{u}}_{h_{k + 1}}^{k + 1} ∥_{L^{2} (Ω_{h_{k + 1}})} + \frac{1}{σ} {∥ λ^{k} - λ_{h_{k + 1}}^{k} ∥}_{L^{2} (Ω_{h_{k + 1}})} \\ \leq C_{u, k + 1} h_{k + 1} + \frac{1}{σ} C_{λ, k + 1} h_{k + 1} \\ = C_{z, k + 1} h_{k + 1}, \end{matrix}

where

C_{z, k + 1} : = C_{u, k + 1} + \frac{1}{σ} C_{λ, k + 1}

. Hence, the statement is true for

j = k + 1

. We complete the whole proof of Lemma 2. □

Similarly, we have the following lemma. Lemma 3 shows the gap between the sequence

(u^{k}, z^{k}, λ^{k})

generated by Algorithm 1 and the sequence

(u_{h_{k}}^{k}, z_{h_{k}}^{k}, λ_{h_{k}}^{k})

generated by Algorithm 2.

Lemma 3.

Let the initial point be

(u^{0}, z^{0}, λ^{0}) \in H^{1} (Ω) \times H^{1} (Ω) \times H^{1} (Ω)

. Let

{(u^{k}, z^{k}, λ^{k})}_{k = 0}^{\infty}

be the sequence generated by Algorithm 1 and

{(u_{h_{k}}^{k}, z_{h_{k}}^{k}, λ_{h_{k}}^{k})}_{k = 0}^{\infty}

be the sequence generated by Algorithm 2. Then, for all

k ⩾ 1

, we have

\begin{matrix} ∥ u^{k} - u_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} \leq {\hat{C}}_{u, k} (h_{k} + ∥ δ_{u, h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})}), \\ ∥ z^{k} - z_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} \leq {\hat{C}}_{z, k} (h_{k} + ∥ δ_{u, h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})}), \\ ∥ λ^{k - 1} - λ_{h_{k}}^{k - 1} ∥_{L^{2} (Ω_{h_{k}})} \leq {\hat{C}}_{λ, k} (h_{k} + ∥ δ_{u, h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})}), \end{matrix}

where

{\hat{C}}_{u, k}, {\hat{C}}_{z, k}, {\hat{C}}_{λ, k}

are constants independent of

h_{k}

, and there exists a constant

\hat{C}

such that

{\hat{C}}_{u, k} \leq \hat{C}

for any

k \geq 1

. Thus we have

\sum_{k = 1}^{\infty} {∥ u^{k} - u_{h_{k}}^{k} ∥}_{L^{2} (Ω)} \leq \hat{C} \sum_{k = 1}^{\infty} (h_{k} + {∥ δ_{u, h_{k}}^{k} ∥}_{L^{2} (Ω_{h_{k}})}) .

Proof.

We employ the mathematical induction to prove the conclusion. The proof is similar to Lemma 2. We do not discuss this in detail here. □

The total error of utilizing numerical methods to solve PDE-constrained optimal control problem consists of two parts: the discretization error and the iteration error. These two kinds of errors can be regarded as the error of inexactly solving infinite-dimensional subproblems. Thus, the mhADMM algorithm can be regarded as the iADMM algorithm in function space. Inspired by the results of Theorem 2, we have the following convergence results.

Theorem 4.

Let

(y^{*}, u^{*}, z^{*}, p^{*}, λ^{*})

be the KKT point of (1),

{(u_{h_{k}}^{k}, z_{h_{k}}^{k}, λ_{h_{k}}^{k})}_{k = 0}^{\infty}

be the sequence generated by Algorithm 2 with the associated state

{y_{h_{k}}^{k}}_{k = 0}^{\infty}

and the adjoint state

{p_{h_{k}}^{k}}_{k = 0}^{\infty}

. Then we have

\begin{matrix} lim_{k \to \infty} {∥ u_{h_{k}}^{k} - u^{*} ∥_{L^{2} (Ω_{h_{k}})} + ∥ z_{h_{k}}^{k} - z^{*} ∥_{L^{2} (Ω_{h_{k}})} + ∥ λ_{h_{k}}^{k} - λ^{*} ∥_{L^{2} (Ω_{h_{k}})}} = 0, \\ lim_{k \to \infty} {∥ y_{h_{k}}^{k} - y^{*} ∥_{H_{0}^{1} (Ω_{h_{k}})} + ∥ p_{h_{k}}^{k} - p^{*} ∥_{H_{0}^{1} (Ω_{h_{k}})}} = 0 . \end{matrix}

Moreover, there exists a constant

\bar{C}

that only depends on the initial point

(u^{0}, z^{0}, λ^{0})

and the optimal solution

(u^{*}, z^{*}, λ^{*})

such that, for

k \geq 1

,

min_{1 \leq i \leq k} R_{h_{i}} (u_{h_{i}}^{i}, z_{h_{i}}^{i}, λ_{h_{i}}^{i}) \leq \frac{\bar{C}}{k}, lim_{k \to \infty} (k \cdot min_{1 \leq i \leq k} R_{h_{i}} (u_{h_{i}}^{i}, z_{h_{i}}^{i}, λ_{h_{i}}^{i})) = 0,

where

R_{h_{i}} : (u_{h_{i}}^{i}, z_{h_{i}}^{i}, λ_{h_{i}}^{i}) \to [0, \infty)

is defined as

R_{h} (u_{h_{i}}^{i}, z_{h_{i}}^{i}, λ_{h_{i}}^{i}) : = ∥ \nabla f_{h} (u_{h_{i}}^{i}) + λ_{h_{i}}^{i - 1} ∥_{L^{2} (Ω_{h_{i}})}^{2} + {dist}^{2} (0, - λ_{h_{i}}^{i - 1} + \partial g_{h} (z_{h_{i}}^{i})) + {∥ u_{h_{i}}^{i} - z_{h_{i}}^{i} ∥}_{L^{2} (Ω_{h_{i}})}^{2} .

Proof.

Note that

(u_{h_{k}}^{k}, z_{h_{k}}^{k}, λ_{h_{k}}^{k})

can be regarded as the inexact solution obtained by Algorithm 1. The total error

δ_{u}^{k}

consists of two parts, the discretization error from gradually refining the grid and the iteration error from the inexactly solving the subproblems. Then, we know from the optimality conditions of the u-subproblem in Algorithm 1 that

S^{*} [S (u_{h_{k}}^{k} + y_{r}) - y_{d}] + α u_{h_{k}}^{k} + λ_{h_{k - 1}}^{k - 1} + σ (u_{h_{k}}^{k} - z_{h_{k - 1}}^{k - 1}) = δ_{u}^{k} .

(12)

Moreover, we know from the optimality condition of the u-subproblem in ADMM in function space, the optimality conditions of the u-subproblem in Algorithm 2 and the exact multi-level ADMM that

S^{*} [S ({\bar{u}}^{k} + y_{r}) - y_{d}] + α {\bar{u}}^{k} + {\bar{λ}}^{k - 1} + σ ({\bar{u}}^{k} - {\bar{z}}^{k - 1}) = 0,

(13)

S_{h_{k}}^{*} [S_{h_{k}} (u_{h_{k}}^{k} + I_{h_{k}} y_{r}) - I_{h_{k}} y_{d}] + α u_{h_{k}}^{k} + λ_{h_{k}}^{k - 1} + σ (u_{h_{k}}^{k} - z_{h_{k}}^{k - 1}) = δ_{u, h_{k}}^{k},

(14)

S_{h_{k}}^{*} [S_{h_{k}} ({\bar{u}}_{h_{k}}^{k} + I_{h_{k}} y_{r}) - I_{h_{k}} y_{d}] + α {\bar{u}}_{h_{k}}^{k} + {\bar{λ}}_{h_{k}}^{k - 1} + σ ({\bar{u}}_{h_{k}}^{k} - {\bar{z}}_{h_{k}}^{k - 1}) = 0 .

(15)

Then, we know from (12)–(15) that

\begin{matrix} δ_{u}^{k} = & δ_{u}^{k} - δ_{u, h_{k}}^{k} + δ_{u, h_{k}}^{k} \\ = & δ_{u, h_{k}}^{k} + S^{*} S (u_{h_{k}}^{k} - {\bar{u}}^{k}) + (α + σ) (u_{h_{k}}^{k} - {\bar{u}}^{k}) + (λ^{k - 1} - {\bar{λ}}^{k - 1}) - σ (z^{k - 1} - {\bar{z}}^{k - 1}) \\ + S_{h_{k}}^{*} S_{h_{k}} ({\bar{u}}_{h_{k}}^{k} - u_{h_{k}}^{k}) + (α + σ) ({\bar{u}}_{h_{k}}^{k} - u_{h_{k}}^{k}) + ({\bar{λ}}_{h_{k}}^{k - 1} - λ_{h_{k}}^{k - 1}) - σ ({\bar{z}}_{h_{k}}^{k - 1} - z_{h_{k}}^{k - 1}) \\ = & δ_{u, h_{k}}^{k} + \underset{I_{1}}{\underset{︸}{[(α + σ) I + S^{*} S] ({\bar{u}}_{h_{k}}^{k} - {\bar{u}}^{k})}} + \underset{I_{2}}{\underset{︸}{(λ^{k - 1} - λ_{h_{k}}^{k - 1})}} - \underset{I_{3}}{\underset{︸}{σ (z^{k - 1} - z_{h_{k}}^{k - 1})}} \\ + \underset{I_{4}}{\underset{︸}{({\bar{λ}}_{h_{k}}^{k - 1} - {\bar{λ}}^{k - 1})}} - \underset{I_{5}}{\underset{︸}{σ ({\bar{z}}_{h_{k}}^{k - 1} - {\bar{z}}^{k - 1})}} + \underset{I_{6}}{\underset{︸}{(S^{*} S - S_{h_{k}}^{*} S_{h_{k}}) (u_{h_{k}}^{k} - {\bar{u}}_{h_{k}}^{k})}} . \end{matrix}

(16)

For the term

I_{1}

and

I_{4}

, we know from Lemma 2 that

\begin{matrix} ∥ I_{1} ∥_{L^{2} (Ω_{h_{k}})} & \leq (α + σ + ∥ S^{*} ∥ ∥ S ∥) C_{u, k} h_{k}, \end{matrix}

(17)

\begin{matrix} ∥ I_{4} ∥_{L^{2} (Ω_{h_{k}})} & \leq C_{λ, k} h_{k} . \end{matrix}

(18)

For the term

I_{5}

, we know from Theorem 3 and Lemma 2 that

\begin{matrix} ∥ I_{5} ∥_{L^{2} (Ω_{h_{k}})} & = σ ∥ {\bar{z}}_{h_{k}}^{k - 1} - {\bar{z}}_{h_{k - 1}}^{k - 1} + {\bar{z}}_{h_{k - 1}}^{k - 1} - {\bar{z}}^{k - 1} ∥_{L^{2} (Ω_{h_{k}})} \\ \leq σ ∥ {\bar{z}}_{h_{k}}^{k - 1} - {\bar{z}}_{h_{k - 1}}^{k - 1} ∥_{L^{2} (Ω_{h_{k}})} + σ {∥ {\bar{z}}_{h_{k - 1}}^{k - 1} - {\bar{z}}^{k - 1} ∥}_{L^{2} (Ω_{h_{k}})} \\ \leq σ c_{I} h_{k} {∥ {\bar{z}}_{h_{k - 1}}^{k - 1} ∥}_{H^{1} (Ω_{h_{k}})} + σ C_{z, k - 1} h_{k - 1} \\ \leq σ c_{I} h_{k} {∥ {\bar{z}}_{h_{k - 1}}^{k - 1} ∥}_{H^{1} (Ω_{h_{k}})} + σ C_{z, k - 1} C_{k} h_{k} \\ \leq c_{5} h_{k}, \end{matrix}

(19)

where

c_{5} : = σ c_{I} {∥ {\bar{z}}_{h_{k - 1}}^{k - 1} ∥}_{H^{1} (Ω_{h_{k}})} + σ C_{z, k - 1} C_{k}

is a constant,

h_{k - 1} \leq C_{k} h_{k}

.

For the term

I_{2}

, we know from Lemma 3 that

∥ I_{2} ∥_{L^{2} (Ω_{h_{k}})} \leq {\hat{C}}_{λ, k} (h_{k} + ∥ δ_{u, h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})}) .

(20)

For the term

I_{3}

, we know from Theorem 3 and Lemma 2 that

∥ I_{3} ∥_{L^{2} (Ω_{h_{k}})} \leq c_{3} (h_{k} + ∥ δ_{u, h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})}),

(21)

where

c_{3}

is a constant.

Finally, for the term

I_{6}

, we make use of the decomposition

\begin{matrix} ∥ u_{h_{k}}^{k} - {\bar{u}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} = & ∥ u_{h_{k}}^{k} - u^{k} + u^{k} - u^{*} + u^{*} - {\bar{u}}^{k} + {\bar{u}}^{k} - {\bar{u}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} \\ \leq & ∥ u_{h_{k}}^{k} - u^{k} ∥_{L^{2} (Ω_{h_{k}})} + ∥ u^{k} - u^{*} ∥_{L^{2} (Ω_{h_{k}})} + ∥ u^{*} - {\bar{u}}^{k} ∥_{L^{2} (Ω_{h_{k}})} + {∥ {\bar{u}}^{k} - {\bar{u}}_{h_{k}}^{k} ∥}_{L^{2} (Ω_{h_{k}})} \\ \leq & {\hat{C}}_{u, k} (h_{k} + ∥ δ_{u, h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})}) + C^{*}, \end{matrix}

(22)

where

{\hat{C}}_{u, k}, C^{*}

are constants in dependent of

h_{k}

. In the last equality, we used Lemma 2, Lemma 3, the convergence property of ADMM in function space and the inexact ADMM in function space. Then, we know from Proposition 1 and Lemma 1 that

\begin{matrix} ∥ I_{6} ∥_{L^{2} (Ω_{h_{k}})} = & ∥ (S^{*} S - S^{*} S_{h_{k}} + S^{*} S_{h_{k}} - S_{h_{k}}^{*} S_{h_{k}}) (u_{h_{k}}^{k} - {\bar{u}}_{h_{k}}^{k}) ∥_{L^{2} (Ω_{h_{k}})} \\ \leq & ∥ (S^{*} S - S^{*} S_{h_{k}}) (u_{h_{k}}^{k} - {\bar{u}}_{h_{k}}^{k}) ∥_{L^{2} (Ω_{h_{k}})} + {∥ (S^{*} S_{h_{k}} - S_{h_{k}}^{*} S_{h_{k}}) (u_{h_{k}}^{k} - {\bar{u}}_{h_{k}}^{k}) ∥}_{L^{2} (Ω_{h_{k}})} \\ \leq & ∥ S^{*} ∥ ∥ S - S_{h_{k}} ∥ ∥ u_{h_{k}}^{k} - {\bar{u}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} + ∥ S^{*} - S_{h_{k}}^{*} ∥ ∥ S_{h_{k}} ∥ ∥ u_{h_{k}}^{k} - {\bar{u}}_{h_{k}}^{k} ∥_{L^{2} (Ω_{h_{k}})} \\ \leq & c_{6} h_{k}, \end{matrix}

(23)

where

c_{6}

is a constant.

Then, we know from (16)–(23) that there are constants

C_{1}^{*}

and

C_{2}^{*}

such that

\begin{matrix} \sum_{k = 1}^{\infty} ∥ δ_{u}^{k} ∥_{L^{2} (Ω)} \leq C_{1}^{*} \sum_{k = 1}^{\infty} {∥ δ_{u, h_{k}}^{k} ∥}_{L^{2} (Ω_{h_{k}})} + C_{2}^{*} \sum_{k = 1}^{\infty} h_{k} . \end{matrix}

Moreover, the mesh sizes

{h_{k + 1}}_{k = 0}^{\infty}

of each mhADMM iteration satisfy

\sum_{k = 0}^{\infty} h_{k + 1} < \infty

, the residuals of each mhADMM iteration satisfy

\sum_{k = 0}^{\infty} {∥ δ_{u, h_{k + 1}}^{k + 1} ∥}_{L^{2} (Ω_{h_{k + 1}})} \leq \sum_{k = 0}^{\infty} ϵ_{k + 1} < \infty

, thus we have

\sum_{k = 1}^{\infty} {∥ δ_{u}^{k} ∥}_{L^{2} (Ω)} < \infty .

Then, we know from Theorem 2 that the global convergence and the iteration complexity results

o (1 / k)

for Algorithm 2 are guaranteed. □

4. Numerical Experiments

In this section, we illustrate the numerical performance of the mhADMM algorithm for the elliptic PDE-constrained optimization problems with

L^{1}

-control cost. For our numerical experiment, we used MATLAB R2021b with the FEM package iFEM [25] on a Thinkpad laptop with 2.8 GHz Intel Core i7 processor with 16GB of RAM.

In the mhADMM algorithm, the accuracy of a numerical solution is measured by the following residual. Let

ϵ

be a given accuracy tolerance, we terminate the algorithm when

η < ϵ

, where

η : = \max {η_{1}, η_{2}, η_{3}, η_{4}, η_{5}},

where

\begin{matrix} η_{1} & : = \frac{∥ K_{h} y - M_{h} u - M_{h} y_{r} ∥}{1 + ∥ M_{h} y_{r} ∥}, η_{2} : = \frac{∥ M_{h} (u - z) ∥}{1 + ∥ u ∥}, η_{3} : = \frac{∥ M_{h} (y - y_{d}) + K_{h} p ∥}{1 + ∥ M_{h} y_{d} ∥}, \\ η_{4} & : = \frac{∥ α M_{h} u - M_{h} p + M_{h} λ ∥}{1 + ∥ u ∥}, η_{5} : = \frac{∥ z - Π_{{[a, b]}^{N_{h}}} (soft (W_{h}^{- 1} M_{h} λ, β)) ∥}{1 + ∥ z ∥} . \end{matrix}

To present the finite element error estimates’ results, we introduce the experimental order of convergence, a brief EOC defined by

EOC : = \frac{log E (h_{1}) - log E (h_{2})}{log h_{1} - log h_{2}},

where

h_{1}, h_{2} > 0, h_{1} \neq h_{2}

denotes different grid sizes, E denotes the positive error functional

E (h) : = {∥u - u_{h}∥}_{L^{2} (Ω)} .

We note that if

E (h) = O (h^{γ})

holds, then

EOC \approx γ

.

As shown in Section 2.1, instead of using the standard piecewise linear and continuous finite elements, nodal quadrature formulas are used to approximately discretize the

L^{1}

-norm and

L^{2}

-norm in Algorithm 2. In both examples, the mhADMM algorithm, the ihADMM algorithm and the classical ADMM algorithm are employed to obtain numerical solutions of different grid sizes. For both numerical examples and all algorithms, we chose

(u^{0}, z^{0}, λ^{0}) = (0, 0, 0)

as the initial values. The penalty parameter

σ

was chosen as

σ = α

. For the step length

τ

, we chose

τ = 1.618

. We terminate the algorithms when the residual

η < 10^{- 6}

with the maximum number of iterations set to 500.

In numerical experiments, we show the numerical results for different final mesh sizes. In Table 1 and Table 2, h denotes the final mesh size, ‘#dofs’ denotes the dimension of the control variable on each grid level, and ‘iter’ represents the times of iteration. To guarantee the sequence

{ϵ_{k + 1}}_{k = 0}^{\infty} \subseteq [0, + \infty)

satisfies

\sum_{k = 0}^{\infty} \frac{ϵ_{k + 1}}{\sqrt{∥ M_{h_{k + 1}} ∥_{2}^{2}}} < \infty

, and the mesh sizes

{h_{k}}_{k = 0}^{\infty} \subseteq [0, + \infty)

of each iteration satisfy

\sum_{k = 0}^{\infty} h_{k + 1} \leq \infty

, we choose

ϵ_{k + 1} = \frac{C}{{(k + 1)}^{2}}

, where C is a constant and

h_{k} = \frac{\sqrt{2}}{2^{k + 3}}, k \in Z, k \geq 1

in both examples. Moreover, we would like to point out that, in the iteration of mhADMM algorithm, once the mesh size

h_{k}

reaches the final mesh size h, we continue the iteration in the final mesh until the stopping criterion above is satisfied.

Before providing examples, we first introduce the following algorithm, which can help us formulate sparse optimal control problems.

According to the first-order optimality condition given in Theorem 1, it is easy to see that Algorithm 4 provides a construction strategy for problems with known optimal solutions

(y^{*}, u^{*})

.

Algorithm 4 Construct the optimal control problem

Step 1 Choose $y^{*} \in H_{0}^{1} (Ω)$ and $p^{*} \in H_{0}^{1} (Ω)$ arbitrarily.
Step 2

$\begin{matrix} u^{*} : = & Π_{U_{a d}} (\frac{1}{α} soft (p^{*}, β)) \\ = \{\begin{matrix} min \{\frac{p^{*} - β}{α}, b\}, on x \in Ω : p^{*} (x) > β, \\ \max \{\frac{p^{*} + β}{α}, a\}, on x \in Ω : p^{*} (x) < - β \\ 0, elsewhere . \end{matrix} \end{matrix}$
Step 3 Set $y_{r} = S^{- 1} y^{*} - u^{*}$ and $y_{d} = y^{*} - {(S^{*})}^{- 1} p$ .

Example 1.

Consider

\{\begin{matrix} {min}_{(y, u) \in H_{0}^{1} (Ω) \times L^{2} (Ω)}^{} J (y, u) & = \frac{1}{2} ∥ y - y_{d} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} {∥ u ∥}_{L^{2} (Ω)}^{2} + β {∥ u ∥}_{L^{1} (Ω)} \\ s . t . - Δ y & = u + y_{r} in Ω, \\ y & = 0 on \partial Ω, \\ u & \in U_{a d} = {v (x) | a \leq v (x) \leq b, a . e on Ω}, \end{matrix}

where

Ω = {(0, 1)}^{2}

, the parameters

α = 0.5

,

β = 0.5

,

a = - 0.5

,

b = 0.5

. As this is a constructed problem, we set

y^{*} = sin (π x_{1}) sin (π x_{2})

and

p^{*} = 2 β sin (2 π x_{1}) exp (0.5 x_{1}) sin (4 π x_{2})

. Then, through Algorithm 4, we can obtain the optimal control solution

u^{*} = Π_{U_{a d}} (\frac{1}{α} soft (p^{*}, β))

, the source term

y_{r} = S^{- 1} y^{*} - u^{*}

and the desired state

y_{d} = y^{*} - {(S^{*})}^{- 1} p^{*}

. Thus, we construct the example for which we know the exact solution.

We then test the mhADMM, the ihADMM and the classical ADMM for Example 1. The exact optimal control u and an example for the numerical optimal control obtained by mhADMM on the grid with

h = \frac{\sqrt{2}}{2^{7}}

are shown in Figure 1.

In Table 1, we report the dimension of the control variable on each grid level, the error E of the control u, the EOC, the residual

η

, the computational time and the number of iterations obtained by the mhADMM, the ihADMM and the classical ADMM. As can be seen from the fourth and fifth column of Table 1, a high-subfigures are correct.precision solution does not improve the accuracy of the discretization error and the EOC. The computational time on the seventh, eighth and ninth columns show that the mhADMM is much faster than the ihADMM and the classical ADMM, especially when the discretization is at a fine level. The mhADMM algorithm can significantly reduce the computational cost and make the algorithm faster. This is mainly because the mhADMM adopts the strategy of gradually refining the grid, while the ihADMM and the classical ADMM compute the problem on a fixed grid size, which is their computational bottleneck. Moreover, the seventh column illustrates the mesh-independent performance of mhADMM; that is, the number of iteration of the mhADMM is independent of the discretization level. Above all, we can see that the mhADMM is much more efficient than the ihADMM and the classical ADMM.

Example 2.

Consider

\{\begin{matrix} {min}_{(y, u) \in H_{0}^{1} (Ω) \times L^{2} (Ω)}^{} J (y, u) & = \frac{1}{2} ∥ y - y_{d} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} {∥ u ∥}_{L^{2} (Ω)}^{2} + β {∥ u ∥}_{L^{1} (Ω)} \\ s . t . - Δ y & = u in Ω, \\ y & = 0 on \partial Ω, \\ u & \in U_{a d} = {v (x) | a \leq v (x) \leq b, a . e on Ω}, \end{matrix}

where

Ω = {(0, 1)}^{2}

, the parameters

α = 10^{- 4}

,

β = 10^{- 3}

,

a = - 10

,

b = 10

. The exact sparse solution of this problem is not known in advance. Instead, we use the numerical solutions computed on the grid with the grid size

h = \frac{\sqrt{2}}{2^{9}}

as reference solutions.

As an example, Figure 2 presents the numerical optimal control for Example 2 on the grid with

h = \frac{\sqrt{2}}{2^{7}}

.

Table 2 shows the dimension of the control variable at each grid level, the error E of the control u, the EOC, the residual

η

, the computational time and the number of iterations for three methods. Similar to Example 1, the mhADMM still outperforms the ihADMM and the classical ADMM in terms of the computational time. The fourth and fifth column of Table 2 clearly show that a high-precision solution does not improve the accuracy of the discretization error and the EOC. Furthermore, the numerical results in the seventh column also illustrate the mesh-independent performance of our mhADMM algorithm. These results demonstrate that the mhADMM is more efficient than the ihADMM and the classical ADMM.

5. Conclusions

In this paper, we propose a new, efficient, multilevel, heterogeneous ADMM (mhADMM) algorithm for solving sparse elliptic PDE-constrained optimal control problems with

L^{1}

-control cost and box constraints on the control. Specifically, the inexact ADMM is first applied in the function space. Then, we propose the strategy of gradually refining the grid and employ the standard piecewise linear finite element to discretize the related subproblems appearing in each iteration of the inexact ADMM algorithm. Moreover, nodal quadrature formulas are utilized to approximately discretize the

L^{1}

-norm and

L^{2}

-norm to overcome the difficulty that the

L^{1}

-norm does not have a decoupled form. Finally, subproblems are solved by appropriate numerical methods. Theoretical results regarding the global convergence and iteration complexity are presented. In our numerical experiments, we show that the proposed mhADMM is superior to the ihADMM and the classical ADMM in terms of the efficiency.

Author Contributions

Conceptualization, X.C. and Z.C.; methodology, X.S.; software, X.C.; validation, X.C. and L.X.; formal analysis, X.C. and X.S.; writing—original draft preparation, X.C.; writing—review and editing, L.X. and Z.C.; visualization, X.C.; supervision, X.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (No. 11971092), the National Natural Science Foundation of China (No. 42274166), the China Postdoctoral Science Foundation of China (No. 2020M670717), the Fundamental Research Funds for the Central Universities (No. N2105019), the Fundamental Research Funds for the Central Universities (No. 3132022200) and the Fundamental Research Funds for the Central Universities.

Data Availability Statement

The data presented in this study are available on request from the first author and corresponding author.

Acknowledgments

We will thank the reviewers for taking time off their busy schedule to review this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Stadler, G. Elliptic optimal control problems with L¹-control cost and applications for the placement of control devices. Comp. Optim. Appls. 2009, 44, 159–181. [Google Scholar] [CrossRef]
Ciaramella, G.; Borzì, A. A LONE code for the sparse control of quantum systems. Comput. Phys. Commun. 2016, 200, 312–323. [Google Scholar] [CrossRef]
Garcke, H.; Lam, K.F.; Signori, A. Sparse optimal control of a phase field tumor model with mechanical effects. SIAM J. Control Optim. 2021, 59, 1555–1580. [Google Scholar] [CrossRef]
Casas, E.; Tröltzsch, F. Sparse optimal control for a semilinear heat equation with mixed control-state constraints-regularity of Lagrange multipliers. ESAIM Control Optim. Calc. Var. 2021, 27, 2. [Google Scholar] [CrossRef]
Porcelli, M.; Simoncini, V.; Stoll, M. Preconditioning PDE-constrained optimization with L¹- sparsity and control constraints. Comput. Math. Appl. 2017, 74, 1059–1075. [Google Scholar] [CrossRef]
Schindele, A.; Borzì, A. Proximal methods for elliptic optimal control problems with sparsity cost functional. Appl. Math. 2016, 7, 967–992. [Google Scholar] [CrossRef] [Green Version]
Song, X.; Yu, B.; Wang, Y.; Zhang, X. An FE-inexact heterogeneous ADMM for elliptic optimal control problems with L¹-control cost. J. Syst. Sci. Complex. 2018, 31, 1659–1697. [Google Scholar] [CrossRef] [Green Version]
Song, X.; Yu, B. A two-phase strategy for control constrained elliptic optimal control problem. Numer. Linear Algebra Appl. 2018, 25, e2138. [Google Scholar] [CrossRef] [Green Version]
Chen, Z.; Song, X.; Zhang, X.; Yu, B. A FE-ADMM algorithm for Lavrentiev-regularized state-constrained elliptic control problem. ESAIM Control Optim. Calc. Var. 2019, 25, 5. [Google Scholar] [CrossRef] [Green Version]
Zhang, K.; Li, J.; Song, Y.; Wang, X. An alternating direction method of multipliers for elliptic equation constrained optimization problem. Sci. Chin. Math. 2017, 60, 361–378. [Google Scholar] [CrossRef]
Li, J.; Wang, X.; Zhang, K. An efficient alternating direction method of multipliers for optimal control problems constrained by random Helmholtz equations. Numer. Algorithms 2018, 78, 161–191. [Google Scholar] [CrossRef]
Glowinski, R.; Song, Y.; Yuan, X. An ADMM numerical approach to linear parabolic state constrained optimal control problems. Numer. Math. 2020, 144, 931–966. [Google Scholar] [CrossRef]
Glowinski, R.; Song, Y.; Yuan, X.; Yue, H. Application of the alternating direction method of multipliers to control constrained parabolic optimal control problems and beyond. Ann. Appl. Math. 2022, 38, 115–158. [Google Scholar] [CrossRef]
Shaidurov, V.V. Multigrid Methods for Finite Elements; Kluwer Academic Publics: Dordrecht, The Netherlands, 1995. [Google Scholar]
Bornemann, F.A.; Deuflhard, P. The cascadic multigrid method for elliptic problems. Numer. Math. 1996, 75, 135–152. [Google Scholar] [CrossRef]
Deuflhard, P. Newton Methods for Nonlinear Problems: Affine Invariance and Adaptive Algorithms; Springer: Berlin/Heildeberg, Germany, 2011. [Google Scholar]
Borzì, A.; Schulz, V. Multigrid methods for PDE optimization. SIAM Rev. 2009, 51, 361–395. [Google Scholar] [CrossRef]
Gong, W.; Xie, H.; Yan, N. Adaptive multilevel correction method for finite element approximations of elliptic optimal control problems. J. Sci. Comput. 2017, 72, 820–841. [Google Scholar] [CrossRef]
Chen, X.; Song, X.; Chen, Z.; Yu, B. A multilevel ADMM algorithm for elliptic PDE-constrained optimization problems. Comp. Appl. Math. 2020, 39, 331. [Google Scholar] [CrossRef]
Hinze, M.; Pinnau, R.; Ulbrich, M.; Ulbrich, S. Optimization with PDE Constraints; Springer: Berlin/Heildeberg, Germany, 2009. [Google Scholar]
Wachsmuth, G.; Wachsmuth, D. Convergence and regularization results for optimal control problems with sparsity functional. ESAIM Control Optim. Calc. Var. 2011, 17, 858–886. [Google Scholar] [CrossRef] [Green Version]
Ciarlet, P.G. The Finite Element Method for Elliptic Problems; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2002. [Google Scholar]
Bai, Z.; Benzi, M.; Chen, F.; Wang, Z. Preconditioned MHSS iteration methods for a class of block two-by-two linear systems with applications to distributed control problems. IMA J. Numer. Anal. 2013, 33, 343–369. [Google Scholar] [CrossRef]
Cao, S.; Wang, Z. PMHSS iteration method and preconditioners for Stokes control PDE-constrained optimization problems. Numer. Algorithms 2021, 87, 365–380. [Google Scholar] [CrossRef]
Chen, L. iFEM: An Integrated Finite Element Methods Package in MATLAB; Technical Report; University of California at Irvine: Irvine, CA, USA, 2008. [Google Scholar]

Figure 1. (a) The exact optimal control on the grid with

h = \frac{\sqrt{2}}{2^{7}}

for Example 1. (b) The numerical optimal control obtained by the mhADMM on the grid with

h = \frac{\sqrt{2}}{2^{7}}

for Example 1.

Figure 1. (a) The exact optimal control on the grid with

h = \frac{\sqrt{2}}{2^{7}}

for Example 1. (b) The numerical optimal control obtained by the mhADMM on the grid with

h = \frac{\sqrt{2}}{2^{7}}

for Example 1.

Figure 2. The numerical optimal control for Example 2 on the grid with

h = \frac{\sqrt{2}}{2^{7}}

.

Figure 2. The numerical optimal control for Example 2 on the grid with

h = \frac{\sqrt{2}}{2^{7}}

.

Table 1. The convergence behavior of the mhADMM, the ihADMM and the classical ADMM for Example 1.

Case	h	#dofs	E	EOC	Index	mhADMM	ihADMM	ADMM
1	$\frac{\sqrt{2}}{2^{4}}$	225	9.66 × 10 $^{- 2}$	0.99	residual $η$	8.93 × 10 $^{- 7}$	6.69 × 10 $^{- 7}$	6.46 × 10 $^{- 7}$
					time (s)	0.12	0.13	0.13
					#iter	20	16	18
2	$\frac{\sqrt{2}}{2^{5}}$	961	4.46 × 10 $^{- 2}$	1.05	residual $η$	9.44 × 10 $^{- 7}$	6.60 × 10 $^{- 7}$	8.63 × 10 $^{- 7}$
					time (s)	0.40	0.42	0.53
					#iter	20	18	24
3	$\frac{\sqrt{2}}{2^{6}}$	3969	1.49 × 10 $^{- 2}$	1.23	residual $η$	3.30 × 10 $^{- 7}$	7.37 × 10 $^{- 7}$	9.40 × 10 $^{- 7}$
					time (s)	2.16	2.67	8.51
					#iter	22	21	49
4	$\frac{\sqrt{2}}{2^{7}}$	16,129	4.92 × 10 $^{- 3}$	1.32	residual $η$	5.57 × 10 $^{- 7}$	5.54 × 10 $^{- 7}$	8.87 × 10 $^{- 7}$
					time (s)	14.86	18.35	262.56
					#iter	21	23	120
5	$\frac{\sqrt{2}}{2^{8}}$	65,025	1.65 × 10 $^{- 3}$	1.37	residual $η$	7.02 × 10 $^{- 7}$	4.61 × 10 $^{- 7}$	9.97 × 10 $^{- 7}$
					time (s)	114.33	200.43	7576.14
					#iter	20	25	257
6	$\frac{\sqrt{2}}{2^{9}}$	261,121	5.83 × 10 $^{- 4}$	1.39	residual $η$	6.91 × 10 $^{- 7}$	2.82 × 10 $^{- 7}$	1.17 × 10 $^{- 5}$
					time (s)	2457.43	3850.03	279,881.28
					#iter	20	27	500

Table 2. The convergence behavior of the mhADMM, the ihADMM and the classical ADMM for Example 2.

Case	h	#dofs	E	EOC	Index	mhADMM	ihADMM	ADMM
1	$\frac{\sqrt{2}}{2^{4}}$	225	9.69 × 10 $^{- 1}$	1.19	residual $η$	7.12 × 10 $^{- 7}$	7.12 × 10 $^{- 7}$	8.04 × 10 $^{- 7}$
					time (s)	0.25	0.30	0.33
					#iter	22	22	35
2	$\frac{\sqrt{2}}{2^{5}}$	961	5.72 × 10 $^{- 1}$	0.97	residual $η$	7.71 × 10 $^{- 7}$	7.77 × 10 $^{- 7}$	9.37 × 10 $^{- 7}$
					time (s)	0.95	1.10	1.66
					#iter	19	19	88
3	$\frac{\sqrt{2}}{2^{6}}$	3969	1.64 × 10 $^{- 1}$	1.25	residual $η$	6.03 × 10 $^{- 7}$	6.02 × 10 $^{- 7}$	7.63 × 10 $^{- 7}$
					time (s)	5.03	5.57	35.73
					#iter	20	20	198
4	$\frac{\sqrt{2}}{2^{7}}$	16,129	4.44 × 10 $^{- 2}$	1.41	residual $η$	8.48 × 10 $^{- 7}$	8.25 × 10 $^{- 7}$	8.10 × 10 $^{- 7}$
					time (s)	28.13	37.70	851.48
					#iter	20	20	454
5	$\frac{\sqrt{2}}{2^{8}}$	65,025	1.72 × 10 $^{- 2}$	1.40	residual $η$	7.15 × 10 $^{- 7}$	7.74 × 10 $^{- 7}$	1.66 × 10 $^{- 5}$
					time (s)	134.46	196.07	8659.08
					#iter	21	21	500
6	$\frac{\sqrt{2}}{2^{9}}$	261,121	-	-	residual $η$	9.09 × 10 $^{- 7}$	9.01 × 10 $^{- 7}$	6.17 × 10 $^{- 5}$
					time (s)	1052.59	1972.46	151,623.99
					#iter	21	21	500

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, X.; Song, X.; Chen, Z.; Xu, L. A Multilevel Heterogeneous ADMM Algorithm for Elliptic Optimal Control Problems with L¹-Control Cost. Mathematics 2023, 11, 570. https://doi.org/10.3390/math11030570

AMA Style

Chen X, Song X, Chen Z, Xu L. A Multilevel Heterogeneous ADMM Algorithm for Elliptic Optimal Control Problems with L¹-Control Cost. Mathematics. 2023; 11(3):570. https://doi.org/10.3390/math11030570

Chicago/Turabian Style

Chen, Xiaotong, Xiaoliang Song, Zixuan Chen, and Lijun Xu. 2023. "A Multilevel Heterogeneous ADMM Algorithm for Elliptic Optimal Control Problems with L¹-Control Cost" Mathematics 11, no. 3: 570. https://doi.org/10.3390/math11030570

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Multilevel Heterogeneous ADMM Algorithm for Elliptic Optimal Control Problems with L¹-Control Cost

Abstract

1. Introduction

2. An Multilevel Heterogeneous ADMM Algorithm

2.1. The mhADMM Algorithm

2.2. Numerical Computation of the Subproblems in Algorithm 2

3. Convergence Analysis

4. Numerical Experiments

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI