A Comparison Study of the Classical and Modern Results of Semi-Local Convergence of Newton–Kantorovich Iterations-II

Regmi, Samundra; Argyros, Ioannis K.; George, Santhosh; Argyros, Michael I.

doi:10.3390/math10111839

Open AccessArticle

A Comparison Study of the Classical and Modern Results of Semi-Local Convergence of Newton–Kantorovich Iterations-II

¹

Department of Mathematics, University of Houston, Houston, TX 77204, USA

²

Department of Mathematical Sciences, Cameron University, Lawton, OK 73505, USA

³

Department of Mathematical and Computational Sciences, National Institute of Technology Karnataka, Karnataka 575 025, India

⁴

Department of Computer Science, University of Oklahoma, Norman, OK 73019, USA

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(11), 1839; https://doi.org/10.3390/math10111839

Submission received: 29 April 2022 / Revised: 24 May 2022 / Accepted: 25 May 2022 / Published: 27 May 2022

(This article belongs to the Special Issue Mathematics: 10th Anniversary)

Download Versions Notes

Abstract

:

This article is an independently written continuation of an earlier study with the same title [Mathematics, 2022, 10, 1225] on the Newton Process (NP). This process is applied to solve nonlinear equations. The complementing features are: the smallness of the initial approximation is expressed explicitly in turns of the Lipschitz or Hölder constants and the convergence order

1 + p

is shown for

p \in (0, 1] .

The first feature becomes attainable by further simplifying proofs of convergence criteria. The second feature is possible by choosing a bit larger upper bound on the smallness of the initial approximation. This way, the completed convergence analysis is finer and can replace the classical one by Kantorovich and others. A two-point boundary value problem (TPBVP) is solved to complement this article.

Keywords:

iterative processes; Banach space; semi-local convergence

MSC:

49M15; 65J15; 65G99

1. Introduction

Let

X_{1}

and

X_{2}

be Banach spaces, and let

Ω

be a nonempty convex subset of

X_{1} .

In addition,

F : Ω \subset X_{1} ⟶ X_{2}

is a Fréchet differentiable mapping between the Banach spaces

X_{1}

abd

X_{2} .

Let also

L (X_{1}, X_{2})

denote the space of bounded linear operators from

X_{1}

into

X_{2} .

The nonlinear equation

F (x) = 0,

(1)

plays an important role due to the fact that many applications can be brought to look like it. The celebrated Newton Process (NP) in the following form

x_{0} \in Ω, F^{'} (x_{n}) s_{n} = - F (x_{n}), x_{n + 1} = x_{n} + s_{n},

(2)

for

n = 0, 1, 2, \dots

is widely used to solve Equation (1) iteratively. This set up is motivated by the solution of corresponding differential equations (see also the Numerical Section 4).

Kantorovich initiated the semi-local convergence (SLC) analysis of (NP) by using the contraction mapping principle due to Banach [1,2]. He presented two different proofs based on majorant functions and recurrent relations [2,3]. The Newton–Kantorovich Theorem contains the (SLC) for (NP). A plethora of researchers utilized this result, in applications, and also as a theoretical tool.

An elementary scalar equation given in [1,2,4,5,6,7,8,9] shows that convergence criteria may not be satisfied. However, (NP) may converge. That is why these criteria are weakened in [6] without new conditions. However, only linear convergence was obtained for (NP) with the techniques employed in [6].

In the present study, by employing different and more precise techniques, the elusive convergence order

1 + p

is obtained for

p \in (0, 1] .

This is achieved by choosing a bit smaller upper bound on

∥ F^{'} {(x_{0})}^{- 1} F (x_{0}) ∥ .

Another new feature involves an explicit upper bound on the smallness of the initial approximation not given in [6]. Notice also that the present study is written completely independently of the corresponding one in [6]. The former reference is only mentioned to stretch the differences and the benefits of the new approach. Consequently, new results can always replace corresponding ones by Kantorovich [2] and others [7,8,9,10,11], as preceding results imply the one in this study but not necessarily vice versa. The method in this study uses smaller Lipschitz or Hölder parameters to achieve these extensions, which are specializations of earlier ones. That is, no additional effort is needed. The generality of this idea allows its application to other processes [3,5,6,7,9,10,11]. This will be the topic of future work.

Majorization of (NP) is presented in Section 2. The (SLC) of (NP) can be found in Section 3. Section 4 contains a Boundary Value Problem (BVP) as an application. Conclusions complete this study in Section 5.

2. Majorization

Let

K_{0}, L_{0}, K, L

be positive parameters and

η \geq 0 .

The sequence generated for

p \in (0, 1]

and for

\forall n = 0, 1, 2, \dots

by

v_{0} = 0, v_{1} = η

\begin{matrix} v_{2} & = & η + \frac{K η^{1 + p}}{(1 + p) (1 - K_{0} η^{p})}, \\ v_{n + 2} & = & v_{n + 1} + \frac{L {(v_{n + 1} - v_{n})}^{1 + p}}{(1 + p) (1 - L_{0} v_{n + 1}^{p})}, \end{matrix}

(3)

plays a critical role as a majorizing sequence for (NP) in the Lipschitz case (

p = 1

) as well as the Hölder one (

p \in (0, 1)

).

Two convergence results for sequence

{v_{n}}

are developed.

Lemma 1.

Suppose

K_{0} η^{p} < 1

and

L_{0} v_{n + 1}^{p} < 1

\forall n = 0, 1, 2, \dots .

Then, the following assertions hold

0 \leq v_{n} \leq v_{n + 1}

and

lim_{n ⟶ \infty} v_{n} = t^{*} \leq min \{v, {(\frac{1}{L_{0}})}^{\frac{1}{p}}\},

where

v = {(\frac{1}{K_{0}})}^{\frac{1}{p}} .

Proof.

The assertions follow by the definition of the sequence

{v_{n}}

and the conditions of Lemma 1. □

Another convergence result follows.

It is convenient to develop parameters,

d = v_{2} - v_{1}

,

δ_{0} = \frac{K d^{p}}{(1 + p) (1 - K_{0} η^{p})}, δ_{1} = 1 - \frac{d}{{(\frac{1}{L_{0}})}^{p} - η}

and set

S = [0, v) .

Moreover, introduce functions depending on parameter p and defined on the interval S for

n = 1, 2, \dots

by

h_{n, p} (t) = L η^{p} t^{n p} + (1 + p) L_{0} t {(η + \frac{1 - t^{n + 1}}{1 - t} d)}^{p} - (1 + p) t

and

g_{n, p} (t) = L t^{p} - L + (1 + p) L_{0} t [{(t^{- n} + \dots + 1 + t)}^{p} - {(t^{- n} + \dots + 1)}^{p}] .

Next, some properties for these functions are presented.

Lemma 2.

The following assertions hold

h_{n + 1, p} (t) = h_{n, p} (t) + g_{n, p} (t) t^{n p} η^{p},

\begin{matrix} g_{n + 1, p} (t) - g_{n, p} (t) & = & (1 + p) L_{0} t [{(t^{- n + 1} + \dots + t)}^{p} - {(t^{- n + 1} + \dots + t + 1)}^{p} \\ - ({(t^{- n} + \dots + 1 + t)}^{p} - {(t^{- n} + \dots + 1)}^{p})], \end{matrix}

g_{n, 1} (t) = L t - L + (1 + p) L_{0} t^{1 + p},

g_{n + 1, 1} (t) - g_{n, 1} (t) = 0 \forall t \in S

and

g_{n + 1, p} (t) - g_{n, p} (t) \leq 0 \forall t \in S, p \in (0, 1) .

Define the parameter δ by

δ = \{\begin{matrix} \frac{2 L}{L + \sqrt{L^{2} + 8 L_{0} L}}, & if p = 1 \\ the smallest zero in S - {0} of the function g_{1, p}, & if p \in (0, 1) . \end{matrix}

Moreover, suppose

(I): $0 \leq δ_{0} \leq δ \leq δ_{1}, i f p = 1$ or
(II): $η \leq \frac{1}{2}, η_{2}^{- 1} : = min {v, η_{0}, η_{1}},$ if $p \in (0, 1),$ where the parameter $η_{0}$ is the smallest zero in $(0, v)$ of the function

φ (t) = K {[\frac{K t^{1 + p}}{(1 + p) (1 - K_{0} t^{p})}]}^{p} - δ (1 + p) (1 - K_{0} t^{p})

and

η_{1} = {(\frac{(1 + p) δ}{L δ^{p} + (1 + p) L_{0} δ {(1 + δ + δ^{2})}^{p}})}^{\frac{1}{p}} .

Then, the sequence

{v_{n}}

is such that

0 \leq v_{n + 1} - v_{n} \leq δ (v_{n} - v_{n - 1}),

0 \leq v_{n} \leq v_{n + 1} \leq η + (1 + δ_{0} \frac{1 - δ^{n}}{1 - δ} d) \leq η + \frac{δ_{0} d}{1 - δ} = : t^{* *},

and

lim_{n ⟶ \infty} v_{n} = t^{*} \in [0, t^{* *}],

where

δ_{0} = \frac{L {(v_{2} - v_{1})}^{p}}{(1 + p) (1 - L_{0} t^{p})} .

Proof.

The assertions on functions

h_{n, p}

and

g_{n, p}

follow immediately by their definitions. If

p = 1

by using the quadratic formula the parameter

δ \in (0, 1)

is obtained. Then, the definition of the function

g_{1, p}

for

p \in (0, 1)

implies

g_{1, p} (0) < 0

and

g_{1, p} (1) > 0 .

Let

δ

again stand for the smallest zero of the function

g_{1, p}

in

S - {0}

assured to exist by the (IVT) (intermediate Value Theorem). Similarly the definition of parameter

η_{0}

is assured by (IVT), since

φ (0) = - δ (1 + p) < 0

and

φ (t) ⟶ \infty

as

t ⟶ v^{-} .

Notice also that under (I)

h_{n, 1} (t) \leq h_{n + 1, 1} (t) \leq h_{\infty} (t) \leq 0 \forall t \in [0, δ],

whereas under condition (II)

g_{1, p} (t) \leq 0 \forall t \in [0, η_{0}],

h_{1, p} (t) \leq 0 \forall t \in [0, η_{1}]

and

h_{n + 1, p} (t) \leq h_{n, p} (t) \forall t \in [0, δ],

where

h_{\infty, p} (t) = t [L_{0} {(η + \frac{d}{1 - t})}^{p} - 1] .

Hence, the sequence

{v_{n}}

is bounded from above by

t^{* *}

and non-decreasing. □

Next, we show that condition (I) can be solved in terms of

η

as in case (II).

Define the real quadratic polynomials

q, q_{1}, q_{2}

by

q (t) = L_{0} (K - 2 K_{0}) t^{2} + 2 L_{0} t - 1,

q_{1} (t) = (L K + 2 δ L_{0} (K - 2 K_{0})) t^{2} + 4 δ (L_{0} + K_{0}) t - 4 δ,

and

q_{2} (t) = L_{0} (K - 2 (1 - δ) K_{0}) t^{2} + 2 (1 - δ) (L_{0} + K_{0}) t - 2 (1 - δ) .

The discriminants

▵, ▵_{1}, ▵_{2}

of these polynomials can be written as

▵ = 4 L_{0} (L_{0} + K - 2 K_{0}) > 0,

▵_{1} = 16 δ (δ {(L_{0} - K_{0})}^{2} + (L + 2 δ L_{0}) K) > 0

and

▵_{2} = 4 (1 - δ) ((1 - δ) {(L_{0} - K_{0})}^{2} + 2 L_{0} K) > 0,

respectively. It follows by the definition of

δ, q_{1}

and

q_{2}

that

L = \frac{2 L_{0} δ^{2}}{1 - δ}, L K + 2 δ L_{0} (K - 2 K_{0}) = \frac{2 L_{0} δ}{1 - δ} (K - 2 (1 - δ) K_{0}),

and so

q_{1} (t) = \frac{2 L_{0} δ}{1 - δ} q_{2} (t) .

That is the polynomials

q_{1}

and

q_{2}

have the same roots. Denote by

\frac{1}{2 r_{1}}

the unique positive root of polynomial

q .

This root is given by the quadratic formula and can be written as

\frac{1}{2 r_{1}} = \frac{1}{L_{0} + \sqrt{L^{2} + L_{0} (K - 2 K_{0})}} .

Moreover, denote by

\frac{1}{2 r_{2}}

the common positive root of the polynomials

q_{1}

and

q_{2} .

This root can be written as

\frac{1}{2 r_{2}} = \frac{2}{δ (L_{0} + K_{0}) + \sqrt{{(δ (L_{0} + K_{0}))}^{2} + δ (K L + 2 δ L_{0} (K - 2 K_{0}))}} .

Define the parameter

η_{3}

by

η_{3}^{- 1} = min \{\frac{1}{2 r_{1}}, \frac{1}{2 r_{2}}\} .

Suppose that the nonnegative number

η

is such that

η_{3} η \leq \frac{1}{2} .

(4)

It is worth noticing that criterion (4) is written this way to make it looks like the usual Kantorovich criterion for Newton’s method in the Lipschitz case [2,10,11].

By the choice of the parameters

r_{1}

and

r_{2}

the polynomials

q, q_{1}, q_{2}

and the condition (4) we get follows that

L_{0} v_{2} < 1,

since

q (η) < 0

and

K_{0} η < 1 .

We infer that

q_{1} (η) \leq 0

and

q_{2} (η) \leq 0 .

Furthermore, the following estimate holds

δ_{0} \leq δ \leq 1 - \frac{L_{0} (v_{2} - v_{1})}{1 - L_{0} v_{1}} .

(5)

Indeed, the left hand side inequality reduces to showing

q_{1} (η) \leq 0

and the right hand side to showing

q_{2} (η) \leq 0 .

Conditions (4) provides the smallness of

η

to force convergence of the sequence

{v_{n}} .

By choosing

\frac{1}{2 η_{3}}

to be a little bit larger the convergence

1 + p

is recovered as follows. Let

ϵ \geq 0 .

Set

b = \frac{L}{1 + p} (1 + ϵ)

and

c = b^{- \frac{1}{p}} .

Define function

ψ_{\infty, p}

on interval S by

ψ_{\infty, p} (t) = (1 + ϵ) L_{0} {(t + \frac{d (t)}{(1 - t)})}^{p} - ϵ,

where

d (t) = \frac{K t^{p}}{(1 + p) (1 - K_{0} t^{p})} .

These definitions imply

ψ_{\infty, p} (0) = - ϵ < 0

and

ψ_{\infty, p} (t) ⟶ \infty

as

t ⟶ v^{-} .

Denote by

\frac{1}{η_{4}}

the smallest zero of the function

ψ_{\infty, p}

on the interval

(0, v) .

Define

η_{5} = max \{\begin{matrix} {η_{3}, \frac{1}{2} c, \frac{1}{2 η_{4}}}, & i f p = 1 \\ {η_{2}, \frac{1}{2} c, \frac{1}{2 η_{4}}}, & i f p \in (0, 1) . \end{matrix}

Let the sequence

{v_{n}}

be defined as in the Formula (3). Then its convergence is of order

1 + p .

Lemma 3.

Let

η \geq 0

be such that

η_{5} η < \frac{1}{2} .

(6)

Then, the following assertions hold

0 \leq v_{n + 1} - v_{n} \leq \frac{1}{c} {(c η)}^{{(1 + p)}^{n}}

and

t^{*} - v_{n} \leq \frac{1}{c (1 - c η)} {(c η)}^{{(1 + p)}^{n}} .

The convergence order of the sequence

{v_{n}}

is

1 + p .

Proof.

Induction is used to show

0 \leq \frac{L}{(1 + p) (1 - L_{0} v_{n + 1}^{p})} \leq b,

(7)

where

b_{0}^{1 + p} = {sup}_{n \geq 1} \frac{L^{p}}{{(1 + p)}^{p} {(1 - L_{0} v_{n}^{p})}^{p}}, v_{0} = η

and

b \geq b_{0} .

Then, this assertion holds for

n = 1

by the choice of

η_{0} .

Then, the assertion (7) holds if using Lemma 2

(1 + ϵ) L_{0} [η + {(1 + (1 + t + \dots + t^{n - 1}) d]}^{p} - ϵ \leq 0 .

Define the functions

ψ_{n, p} (t) = (1 + ϵ) L_{0} (η + {(1 + (1 + t + \dots + t^{n - 1}) d)}^{p} - ϵ \leq 0 .

It suffices to show

ψ_{n, p} (t) \leq 0 a t t = δ .

The definitions of the functions

{ψ_{n, p}}

yield

\begin{matrix} ψ_{n + 1, p} (t) - ψ_{n, p} (t) & = & (1 + ϵ) L_{0} {[η + {(1 + (1 + t + \dots + t^{n}) d]}^{p} \\ - [η + {(1 + (1 + t + \dots + t^{n - 1}) d]}^{p}} \geq 0 . \end{matrix}

Define the function

ψ_{\infty, p}

on the interval S by

ψ_{\infty, p} (t) = lim_{n ⟶ \infty} ψ_{n, p} (t) .

By the definition of the functions

ψ_{\infty, p},

it suffices to show

ψ_{\infty, p} (t) \leq 0,

which is true by the choice of

η_{4} .

The induction is completed. It follows by the sequence

{v_{n}}

and Lemma 2

\begin{matrix} b^{p} (v_{n + 1} - v_{n}) & \leq & {(b (v_{n} - v_{n - 1}))}^{1 + p} \\ \leq & b^{1 + p} {(b {(v_{n - 1} - v_{n - 2})}^{1 + p})}^{1 + p} \\ = & b^{1 + p} b^{1 + p} {(v_{n - 1} - v_{n - 2})}^{{(1 + p)}^{2}} \\ \leq & \dots \\ \leq & b^{(1 + p) + (1 + p) + \dots + {(1 + p)}^{n - 1}} η^{{(1 + p)}^{n}}, \end{matrix}

so

\begin{matrix} v_{n + 1} - v_{n} & \leq & b^{1 + (1 + p) + \dots + {(1 + p)}^{n - 1}} η^{{(1 + p)}^{n}} \\ = & b^{\frac{{(1 + p)}^{n - 1}}{p}} η^{{(1 + p)}^{n}} \\ = & \frac{1}{c} {(c η)}^{{(1 + p)}^{n}}, \end{matrix}

which shows the first assertion. Moreover, if

k = 1, 2, \dots

\begin{matrix} v_{n + k} - v_{n} & \leq & v_{n + k} - v_{n + k - 1} + \dots + v_{n + 1} - v_{n} \\ \leq & \frac{1}{c} [{(c η)}^{{(1 + p)}^{n + k - 1}} + \dots + {(c η)}^{{(1 + p)}^{n}}] \\ \leq & \frac{1}{c} {(c η)}^{{(1 + p)}^{n}} \frac{1 - {(c η)}^{{(1 + p)}^{n}}}{1 - c η} . \end{matrix}

The second assertion follows if

k ⟶ \infty

in the preceding calculation. □

It is worth noticing that Lemma 3 is used to provide weak convergence conditions for (NP). Then, the upper bounds on the iterates

v_{n + 1}

make the proof of Lemma 3 possible.

Next, the Banach lemma on the invertible operators is recalled.

Lemma 4

([1,2]). If Q is a bounded linear operator in

X_{1}, Q^{- 1}

exists if and only if there is a bounded linear operator P in

X_{1}

such that

P^{- 1}

exists and

∥ I - P Q ∥ \leq 1 .

If

Q^{- 1}

exists, then

Q^{- 1} = \sum_{n = 0}^{\infty} {(I - P T)}^{n} P

and

∥ Q^{- 1} ∥ \leq \frac{∥ P ∥}{1 - ∥ I - P Q ∥} .

3. Convergence of (NP)

The notation

U (w, ρ), U [w, ρ]

means the open and closed balls with radius

ρ > 0

and center

w \in X_{1},

respectively. The parameters

K_{0}, L_{0}, K, L

, and

η

are connected with the operator F as follows. Consider conditions (A):

Suppose

(A1): There exist $x_{0} \in Ω, η \geq 0$ such that $F^{'} {(x_{0})}^{- 1} \in L (X_{2}, X_{1}), x_{1} = x_{0} - F^{'} {(x_{0})}^{- 1} F (x_{0})$

$∥ F^{'} {(x_{0})}^{- 1} F (x_{0}) ∥ \leq η,$

$∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x_{1}) - F^{'} (x_{0})) ∥ \leq K_{0} {∥ x_{1} - x_{0} ∥}^{p}$

and

$∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x_{0} + ξ (x_{1} - x_{0})) - F^{'} (x_{0})) ∥ \leq K ∥ ξ (x_{1} - x_{0}) ∥^{p} .$
(A2): $∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x) - F^{'} (x_{0})) ∥ \leq L_{0} {∥ x - x_{0} ∥}^{p}$ for $\forall x \in Ω .$
Set $B_{1} = U (x_{0}, {(\frac{1}{L_{0}})}^{\frac{1}{p}}) \cap Ω .$
(A3): $∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x + ξ (y - x)) - F^{'} (x)) ∥ \leq L ∥ ξ (y - x) ∥^{p}$ for $\forall x, y \in B_{1}$ and for $\forall ξ \in [0, 1) .$
(A4): The conditions of the preceding Lemma 1 or Lemma 2 or Lemma 3 hold.
(A5): $U [x_{0}, t^{*}] \subset Ω .$

Notice that

K_{0} \leq K \leq L_{0} .

Next, conditions (A) are applied to show the main convergence result for (NP).

Theorem 1.

Under the conditions in (A) any (NP) sequence

{x_{n}}

is convergent to a solution

x^{*} \in U [x_{0}, t^{*}]

of the equation

F (x^{*}) = 0 .

Moreover, upper bounds of the form

∥ x^{*} - x_{n} ∥ \leq t^{*} - v_{n}

(8)

hold for all

n = 0, 1, 2, \dots .

Proof.

The assertions

∥ x_{i + 1} - x_{i} ∥ \leq v_{i + 1} - v_{i},

(9)

and

U [x_{i + 1}, t^{*} - v_{i + 1}] \subseteq U [x_{i}, t^{*} - v_{i}],

(10)

are shown by induction

\forall i = 0, 1, 2, \dots .

Let

u \in U [x_{1}, t^{*} - v_{1}] .

The following inequalities are consequences of conditions (A1) together with the equality

v_{0} = 0 .

∥ x_{1} - x_{0} ∥ = ∥ F^{'} {(x_{0})}^{- 1} F (x_{0}) ∥ \leq η = v_{1} - v_{0},

∥ u - x_{0} ∥ \leq ∥ u - x_{1} ∥ + ∥ x_{1} - x_{0} ∥ \leq t^{*} - v_{1} + v_{1} - v_{0} = t^{*} .

So, the vector

u \in U [x_{0}, t^{*} - v_{0}] .

That is assertions (9) and (10) hold for

i = 0 .

Assume these assertions hold if

i = 0, 1, \dots, n .

It follows for each

ξ \in [0, 1]

\begin{matrix} ∥ x_{i} + ξ (x_{i + 1} - x_{i}) - x_{0} ∥ & \leq & v_{i} + ξ (v_{i + 1} - v_{i}) \leq t^{*}, \end{matrix}

and

∥ x_{i + 1} - x_{i} ∥ \leq \sum_{j = 1}^{i + 1} ∥ x_{j} - x_{j - 1} ∥ \leq \sum_{j = 1}^{i + 1} (v_{j} - v_{j - 1}) = v_{i + 1} .

By the induction hypotheses, by Lemmas 1–3 and the conditions (A1), (A2), and (A4), it follows that

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x_{i + 1}) - F^{'} (x_{0})) ∥ & \leq & \bar{K} {∥ x_{i + 1} - x_{0} ∥}^{p}, \\ \leq & \bar{K} {(v_{i + 1} - v_{0})}^{p} \leq \bar{K} v_{i + 1}^{p} < 1 . \end{matrix}

Hence, the inverse of the linear operator

F^{'} (x_{i + 1})

exists. Therefore,

F^{'} {(v)}^{- 1} \in L (X_{2}, X_{1})

and

∥ F^{'} {(x_{i + 1})}^{- 1} F^{'} (x_{0}) ∥ \leq \frac{1}{1 - \bar{K} v_{i + 1}^{p}}

(11)

follows as a consequence of Lemma 4, where

\bar{K} = \{\begin{matrix} K_{0}, & i = 0 \\ L_{0}, & i = 1, 2, \dots . \end{matrix}

The following general integral equality is implied by (NP)

\begin{matrix} F (x_{i + 1}) & = & F (x_{i + 1}) - F (x_{i}) - F^{'} (x_{i}) (x_{i + 1} - x_{i}), \\ = & \int_{0}^{1} (F^{'} (x_{i} + ξ (x_{i + 1} - x_{i})) - F^{'} (x_{i})) d ξ (x_{i + 1} - x_{i}) . \end{matrix}

(12)

Then, using the induction hypotheses, estimate (9) and condition (A3)

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} F (x_{i + 1}) ∥ & \leq & \bar{L} \int_{0}^{1} (ξ ∥ x_{i + 1} - x_{i} {∥)}^{p} d ξ \\ \leq & \frac{\bar{L}}{1 + p} {(v_{i + 1} - v_{i})}^{1 + p}, \end{matrix}

(13)

where

\bar{L} = \{\begin{matrix} K, & i = 0 \\ L, & i = 1, 2, \dots . \end{matrix}

It follows by (NP), estimates (11), (13) and the definition (3) of the sequence

{v_{n}}

\begin{matrix} ∥ x_{i + 2} - x_{i + 1} ∥ & \leq & ∥ F^{'} {(x_{i + 1})}^{- 1} F^{'} (x_{0}) ∥ ∥ F^{'} {(x_{0})}^{- 1} F (x_{i + 1}) ∥, \\ \leq & \frac{\tilde{K} {(v_{i + 1} - v_{i})}^{1 + p}}{2 (1 - \tilde{L} v_{i + 1}^{p})} = v_{i + 2} - v_{i + 1}, \end{matrix}

where

\tilde{K} = \{\begin{matrix} K, & i = 0 \\ L, & i = 1, 2, \dots . \end{matrix}

and

\tilde{L} = \{\begin{matrix} K_{0}, & i = 0 \\ L_{0}, & i = 1, 2, \dots . \end{matrix}

Moreover, if

v \in U [x_{i + 2}, t^{*} - v_{i + 2}]

it follows

\begin{matrix} ∥ v - x_{i + 1} ∥ & \leq & ∥ v - x_{i + 2} ∥ + ∥ x_{i + 2} - x_{i + 1} ∥ \\ \leq & t^{*} - v_{i + 2} + v_{i + 2} - v_{i + 1} = t^{*} - v_{i + 1} . \end{matrix}

So, the vector

w \in U [x_{i + 1}, t^{*} - v_{i + 1}]

completing the induction for assertions (9) and (10). Notice that the scalar majorizing sequence

{v_{i}}

is fundamentally convergent. Hence, the sequence

{x_{i}}

is also convergent to some

x^{*} \in U [x_{0}, t^{*}] .

Furthermore, let

i ⟶ \infty

in estimate (13), to conclude

F (x^{*}) = 0 .

□

Next, the uniqueness ball for a solution is presented. Notice that not all conditions mentioned in (A) are used.

Proposition 1.

Let, for some

x_{0} \in Ω

the center-Lipschitz condition (A2) be satisfied. Further assume that there exists

0 < R < {(\frac{1 + p}{2 L_{0}})}^{\frac{1}{p}}

such that there exists a solution

s \in U (x_{0}, R) \subset Ω

of equation (1) and such that linear operator

F^{'} (s)

is invertible. Let the parameter

R_{1} \geq R

be given by

R_{1} = {(\frac{1 + p}{L_{0}} - R^{p})}^{\frac{1}{p}} .

(14)

Then, the point s solves uniquely the equation

F (x) = 0

in the set

B_{2} = U (x_{0}, R_{1}) \cap Ω .

Proof.

Define the linear operator

Q = \int_{0}^{1} F^{'} (\bar{s} + ξ (s - \bar{s})) d ξ

for some point

\bar{s} \in B_{2}

satisfying

F (\bar{s}) = 0 .

By using the definition of

R_{1},

set

B_{2}

and condition (A2),

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x_{0}) - Q) ∥ & \leq & \int_{0}^{1} L_{0} {((1 - ξ)}^{p} ∥ x_{0} {- s ∥}^{p} + ξ^{p} ∥ x_{0} - \bar{s} ∥^{p}) d ξ, \\ < & \frac{L_{0}}{1 + p} (R_{1}^{p} + R^{p}) = 1, \end{matrix}

concluding that

s = \bar{s},

where the invertibility of the linear operator is also used together with the approximation

0 = F (s) - F (\bar{s}) = Q (s - \bar{s}) .

□

Remark 1.(i) Under the conditions in (A), the existence of

x^{*}

is assured. In this case set

q = x^{*}

and

R = t^{*} .

(ii) Condition (A3) can be replaced by

∥ F^{'} {(x_{0})}^{- 1} (F^{'} (w_{1} + ξ (w_{2} - w_{1})) - F^{'} (w_{1})) ∥ \leq d_{0} {∥ ξ (w_{1} - w_{2}) ∥}^{p}

(15)

\forall w_{1} \in B_{1}

and

w_{2} = w_{1} - F^{'} {(w_{1})}^{- 1} F (w_{1}) \in B_{1} .

This even smaller parameter

d_{0}

can replace L in the aforementioned results. The existence of the iterate

w_{2}

is assured by (A2) and Lemma 4. Notice that the proof of Theorem 1 goes through if condition (15) replaces stronger (A3).

(iii) Concerning the more general iteration

{{\bar{v}}_{n}}

studied in [6] defined by

\begin{matrix} {\bar{v}}_{0} & = & 0, {\bar{v}}_{1} = η, \\ {\bar{v}}_{2} & = & {\bar{v}}_{1} + \int_{0}^{1} \frac{{\bar{ψ}}_{θ} (θ ({\bar{v}}_{1} - {\bar{v}}_{0})) d θ ({\bar{v}}_{1} - {\bar{v}}_{0})}{1 - {\bar{ψ}}_{1} (\bar{K})}, \\ {\bar{v}}_{n + 2} & = & {\bar{v}}_{n + 1} + \frac{\int_{0}^{1} ψ_{θ} (θ ({\bar{v}}_{n + 1} - {\bar{v}}_{n})) d θ ({\bar{v}}_{n + 1} - {\bar{v}}_{n})}{1 - ψ_{1} ({\bar{v}}_{n + 1})} \forall n = 1, 2, 3, \dots . \end{matrix}

(16)

Suppose function

f_{θ} (t, u) = \frac{1}{t^{p}} \int_{0}^{1} \frac{ψ_{θ} (θ t) d θ}{1 - ψ_{1} (u)}

(17)

is nondecreasing and bounded from above by some

\bar{b} > 0 .

Then, the same proof as Lemma 3 recovers the

1 + p

order of convergence for this general iteration provided that

\bar{c} = {\bar{b}}^{- \frac{1}{p}}, η \leq \frac{1}{\bar{c}},

and the conditions of Lemma 1 or Lemma 2 in [6] hold. This is due to the calculation

\begin{matrix} {\bar{v}}_{n + 2} - {\bar{v}}_{n + 1} & = & \frac{\int_{0}^{1} ψ_{θ} (θ ({\bar{v}}_{n + 1} - {\bar{v}}_{n})) d θ {({\bar{v}}_{n + 1} - {\bar{v}}_{n})}^{1 + p}}{(1 - ψ_{1} ({\bar{v}}_{n + 1})) {({\bar{v}}_{n + 1} - {\bar{v}}_{n})}^{p}} \\ \leq & \bar{b} {({\bar{v}}_{n + 1} - {\bar{v}}_{n})}^{1 + p} for \forall {\bar{v}}_{n + 1} \neq {\bar{v}}_{n} . \end{matrix}

Then, under the conditions of Theorem 1 and Proposition 1 in [6] the conclusions, hold for a sequence

{x_{n}}

in this more general setting, where it is also shown that the convergence order is

1 + p .

In the case when

{\bar{ψ}}_{θ}, ψ_{θ}

are constant functions, then, set

\bar{b} = \frac{L (1 + ϵ)}{1 + p} .

Hence condition (17) can be realized. Notice that sequence

{\bar{v}}_{n}}

specializes to

{v_{n}}

if

{\bar{ψ}}_{1} (t) = K_{0} t^{p},

{\bar{ψ}}_{θ} (t) = K {(θ t)}^{p},

ψ_{1} (t) = L_{0} t^{p}

and

ψ_{θ} (t) = L {(θ t)}^{p} .

Under, these choices of functions Lemma 1 and Theorem 1 coincide with the corresponding ones in [6]. Moreover, the rest of the Lemmas in [6] show only linear convergence of majorizing sequences and, consequently of the sequence

{x_{n}} .

However, in Lemma 3, the convergence order

1 + p

is obtained.

Finally, in Lemma 2, in [6], the upper bound on η is not given explicitly in all cases, nor is the convergence order

1 + p .

However, the objective of this article is to do so. That explains the approach in this article.

4. Example

The solution of a BVP is presented as an application of theory.

Example 1.

Let us consider the two-point BVP(TPBVP)

\begin{matrix} u^{″} + u^{\frac{3}{2}} & = & 0 \\ u (0) = u (1) & = & 0 . \end{matrix}

The interval

[0, 1]

is divided into j subintervals. Set

m = \frac{1}{j} .

Denote by

w_{0} = 0 < w_{1} < \dots < w_{j} = 1

the points of subdivision with corresponding values of the function

u_{0} = u (w_{0}), \dots, u_{j} = u (w_{j}) .

Then, the discretization of

u^{″}

is given by

u_{k}^{″} \approx \frac{u_{k - 1} - 2 u_{k} + u_{k + 1}}{m^{2}} f o r \forall k = 2, 3, \dots j - 1 .

Further, notice that

u_{0} = u_{j} = 0 .

It follows that the following system of equations is obtained

\begin{matrix} m^{2} u_{1}^{\frac{3}{2}} - 2 u_{1} + u_{2} & = & 0, \\ u_{k - 1} + m^{2} u_{k}^{\frac{3}{2}} - 2 u_{k} + u_{k + 1} & = & 0 f o r \forall k = 2, 3, \dots, j - 1 \\ u_{j - 2} + m^{2} u_{j - 1}^{\frac{3}{2}} - 2 u_{j - 1} & = & 0 . \end{matrix}

This system can be converted into an operator equation as follows: Define operator

H : R^{j - 1} ⟶ R^{j - 1}

whose derivative is given as

H^{'} (u) = [\begin{matrix} \frac{3}{2} m^{2} u_{1}^{\frac{1}{2}} - 2 & 1 & 0 & \dots & 0 \\ 1 & \frac{3}{2} m^{2} u_{2}^{\frac{1}{2}} - 2 & 1 & 0 & \dots \\ 0 & ⋱ & ⋱ & ⋱ & ⋱ \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & \dots & 1 & 0 & \frac{3}{2} m^{2} u_{j - 1}^{\frac{1}{2}} - 2 \end{matrix}] .

Pick

z \in R^{j - 1}

be arbitrary. The norm used is

∥ z ∥ = {max}_{1 \leq k \leq j - 1} ∥ z_{k} ∥,

where as the norm for

H \in R^{j - 1} \times R^{j - 1}

is given as

∥ H ∥ = max_{1 \leq k \leq j - 1} \sum_{i = 1}^{j - 1} ∥ h_{k, i} ∥ .

Then, pick

u, z \in R^{j - 1}

for

| u_{k} | > 0, | z_{k} | > 0, f o r \forall k = 1, 2, \dots, j - 1

to obtain in turn

\begin{matrix} ∥ H^{'} (u) - H^{'} (z) ∥ & = & ∥ d i a g {\frac{3}{2} (u_{k}^{\frac{1}{2}} - z_{k}^{\frac{1}{2}})} ∥ \\ = & \frac{3}{2} m^{2} {[max_{1 \leq k \leq j - 1} | u_{k} - z_{k} |]}^{\frac{1}{2}} \\ = & \frac{3}{2} m^{2} {∥ u - z ∥}^{\frac{1}{2}} . \end{matrix}

Choose as an initial guess vector

130 sin π x

to obtain after four iterations

u_{0} = [3.35740 \times 10^{1},

6.5202 \times 10^{1}, 9.15664 \times 10^{1}, 1.09168 \times 10^{2}, 1.15363 \times 10^{2}, 1.09168 \times 10^{2}, 9.15664 \times 10^{1},

6.52027 \times 10^{1}, 3.35740 \times 10^{1}]^{t r} .

Then, the parameters are

∥ Q^{'} {(u_{0})}^{- 1} ∥ \leq 2.5582 \times 10^{1}, η = 9.15311 \times 10^{- 5},

p = 0.5, K_{0} = L_{0} = K = L = \frac{3}{200} = 0.015 .

Then,

K_{0} η^{p} = 1.4351 \times 10^{- 4} .

The following Table 1 shows that the conditions of Lemma 1 are satisfied, since

v_{n} = v_{n + m}

for

\forall n = 0, 1, 2, \dots, m = 0, 1, 2, \dots .

Hence, the conditions of Theorem 1 hold.

By using the initial vector on (NP) the generated vector is not good enough to apply Theorem 1. However, after four iterations, the vector

u_{0}

is good enough. Then, the Hölder constants are obtained simply using conditions (A1)–(A3) and taking the max-norm of the resulting vector or matrix. In this paper, the conditions of Lemma 1 are verified first, which are weaker.

Concerning the convergence order, one should verify conditions (6) of Lemma 3. Choose

ϵ = 0.8 .

Then, using the preceding values

η_{5} η < 0.47 < 0.5 .

Therefore, the convergence order is

1 + p = 1 + 0.5 = 1.5 .

Hence, the conclusions of Theorem 1 hold. The corresponding criteria in ([Remark 2, for the Hölder case], [6]) are

h_{1, p} (γ_{p}) \leq 0 a n d, 0 \leq δ_{0} \leq γ_{p},

where

δ_{0} = \frac{K {(v_{2} - v_{1})}^{p}}{(1 + p) (1 - K_{0} v_{2}^{p})}, γ_{p} = {(\frac{L}{L + (1 + p) L_{0}})}^{\frac{p}{1 + p}}

and

h_{1, p} (t) = \frac{L}{1 + p} t^{p} {(v_{2} - v_{1})}^{p} + t L_{0} {(v_{1} + (1 + t))}^{p} {(v_{2} - v_{1})}^{p} - t .

However, these conditions give an implicit estimate on the smallness

η,

they are not satisfied for this example for

p = 0.5 .

However, even if they were the convergence of the sequence

{x_{n}}

is only linear. The same is true if another criterion given in [6] by

0 \leq η \leq min \{\frac{2 γ_{1}}{(1 + 2 γ_{1}) L_{0}}, \frac{1}{K_{0} + L_{0}}\} .

That is even if it is verified the convergence order is only linear.

5. Conclusions

The two new features are explicit upper bounds on the smallness of

η .

Convergence order

1 + p

is also recovered by choosing a larger upper bound on

η .

New Lipschitz or Hölder parameters are smaller and specializations of previous parameters. The new theory can always replace previous ones due to a weaker a priori hypothesis. The strategy can be applied to other processes, such as Secant, Kurchatov, Stirling’s, Newton-like, and multistep [2,3,5,9,10,11]. This will be done in future work.

Author Contributions

Conceptualization, S.R., I.K.A., S.G., and M.I.A.; methodology, S.R., I.K.A., S.G., and M.I.A.; software, S.R., I.K.A., S.G., and M.I.A.; validation, S.R., I.K.A., S.G., and M.I.A.; formal analysis, S.R., I.K.A., S.G., and M.I.A.; investigation, S.R., I.K.A., S.G., and M.I.A.; resources, S.R., I.K.A., S.G., and M.I.A.; data curation, S.R., I.K.A., S.G., and M.I.A.; writing—original draft preparation, S.R., I.K.A., S.G., and M.I.A.; writing—review and editing, S.R., I.K.A., S.G., and M.I.A.; visualization, S.R., I.K.A., S.G., and M.I.A.; supervision, S.R., I.K.A., S.G., and M.I.A.; project administration, S.R., I.K.A., S.G., and M.I.A.; funding acquisition, S.R., I.K.A., S.G., and M.I.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We would like to express our gratitude to the reviewers for the constructive criticism of this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Argyros, I.K. Unified Convergence Criteria for Iterative Banach Space Valued Methods with Applications. Mathematics 2021, 9, 1942. [Google Scholar] [CrossRef]
Kantorovich, L.V.; Akilov, G.P. Functional Analysis; Pergamon Press: Oxford, UK, 1982. [Google Scholar]
Ezquerro, J.A.; Hernandez, M.A. Newton’s Scheme: An Updated Approach of Kantorovich’s Theory; Springer: Cham, Switzerland, 2018. [Google Scholar]
Appell, J.; De Pascale, E.; Lysenko, J.V.; Zabreiko, P.P. New results on Newton-Kantorovich approximations with applications to nonlinear integral equations. Numer. Funct. Anal. Optim. 1997, 18, 1–18. [Google Scholar] [CrossRef]
Argyros, I.K. The Theory and Applications of Iteration Methods, 2nd ed.; Engineering Series; CRC Press, Taylor and Francis Group: Boca Raton, FL, USA, 2022. [Google Scholar]
Regmi, S.; Argyros, I.K.; George, S.; Christopher, I.A. A Comparison Study of the Classical and Modern Results of Semi-Local Convergence of Newton-Kantorovich Iterations. Mathematics 2022, 10, 1225. [Google Scholar] [CrossRef]
Cianciaruso, F.; De Pascale, E. The Newton-Kantorovich approximations when the derivative is Hölder: Old and New Results. Numer. Funct. Anal. 2002, 24, 713–723. [Google Scholar] [CrossRef]
Zabreiko, P.P.; Nguen, D.F. The majorant method in the theory of Newton-Kantorovich approximations and the Pták error estimates. Numer. Funct. Anal. Optim. 1987, 9, 671–684. [Google Scholar] [CrossRef]
Verma, R. New Trends in Fractional Programming; Nova Science Publisher: New York, NY, USA, 2019. [Google Scholar]
Potra, F.A.; Pták, V. Nondiscrete Induction and Iterative Processes. In Research Notes in Mathematics; Pitman (Advanced Publishing Program): Boston, MA, USA, 1984; p. 103. [Google Scholar]
Proinov, P.D. New general convergence theory for iterative processes and its applications to Newton-Kantorovich type theorems. J. Complex. 2010, 26, 3–42. [Google Scholar] [CrossRef] [Green Version]

Table 1. Sequence (3).

n	1	2	3	4	5	6
$v_{n + 1}$	0.1435 $\times 10^{- 3}$	0.1435 $\times 10^{- 3}$	0.1435 $\times 10^{- 3}$	0.1435 $\times 10^{- 3}$	0.1435 $\times 10^{- 3}$	0.1435 $\times 10^{- 3}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Regmi, S.; Argyros, I.K.; George, S.; Argyros, M.I. A Comparison Study of the Classical and Modern Results of Semi-Local Convergence of Newton–Kantorovich Iterations-II. Mathematics 2022, 10, 1839. https://doi.org/10.3390/math10111839

AMA Style

Regmi S, Argyros IK, George S, Argyros MI. A Comparison Study of the Classical and Modern Results of Semi-Local Convergence of Newton–Kantorovich Iterations-II. Mathematics. 2022; 10(11):1839. https://doi.org/10.3390/math10111839

Chicago/Turabian Style

Regmi, Samundra, Ioannis K. Argyros, Santhosh George, and Michael I. Argyros. 2022. "A Comparison Study of the Classical and Modern Results of Semi-Local Convergence of Newton–Kantorovich Iterations-II" Mathematics 10, no. 11: 1839. https://doi.org/10.3390/math10111839

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Comparison Study of the Classical and Modern Results of Semi-Local Convergence of Newton–Kantorovich Iterations-II

Abstract

1. Introduction

2. Majorization

3. Convergence of (NP)

4. Example

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI