On a modified secant method | Journal of Numerical Analysis and Approximation Theory

Return to Article Details On a modified secant method

L'NALYSE NUMERIQUE ET LA THEORIE DE LAPPROXIMATION Tome 8, $N^{0}$ 2, 1979, pp. 203-214

ON A NIODIFIED SECANT NIHTHOD

F. A. POTRA(Bucuresti)

Abstract. In this paper we apply the metod of v. PTAK ([4], [5]) to the study of the coinvergance of a modified secant method. We prove that the rate of convargence of this method is of the form

ω (γ) = \frac{γ}{d} (H y + d - 2 \sqrt{H^{2} a^{2} - H d γ})

where

a, d, H

and

r

are positive mumbers depending on the initial conditions. We also give sharp estimates for the distance

‖ x_{n} - x^{*} ‖

n = 1

, 2, ... where

{(x_{n})}_{n = 1}^{\infty}

is the sequence obtained by the modified secant method and

x

* is its limit.

1. The Induction rhooren

The method of Nondiscrete Mathematical Induction, introduced by V. PiAK [4], has allowed a new approach in the study of the convergence of iterative procedures. An important role in this approach is played by the notion of the rate of convergance

[5], [6]

. Let

T

be an interval of the form

T = {r \in R; 0 < r ≦ r_{0}}

, for some positive

r_{0} (1 . e . T^{-}] 0, r_{0}])

. Let

ω

be a finction defined on

T

. We define by reccurence:

ω^{0} (γ) = γ, ω ω^{n + 1} (γ) = ω (ω)^{n} (γ)), n = 0, 1, 2, \dots

DEFINITION 1.1. Tha function o, defined on

T

, is called a rate of convergence, if it satisfies the folbowing properties:
(1)

ω

maps

T

into itself;
(2) for each

r \in T

the series

\sum_{n = 0}^{\infty} ω^{n} (r)

is convergent.

The sum of the above series,

σ (r) = \sum_{n = 0}^{\infty} ω^{n} (r)

, obviously satisfies the following functional equation:

\begin{matrix} (3) & σ (r) = r + σ (ω (r)) . \end{matrix}

We shall justify the name of "rate of convergence", given to the function

ω

, after stating the Induction Theorem.

Let

(X, d)

be a complete metric space. If

A

is a subset of

X

, and

x

an element of

X

, we shall denote by

d (x, A)

the g.1.b. of the set

{d (x, y)

;

y \in A}

. For any positive number

r

we shall denote by

U (A, r)

the set

{x \in X; d (x, A) ≦ r}

. If

x

is an element of

X

, we shall write for simplicity

U (x, r)

instead of

U ({x}, r)

Let us denote by

T

the interval

] 0, r_{0}]

of the real line, and for each

r \in T

, let

Z (r)

represent a certain subset of

X

. We shall use the following notation for the limit of the family

Z (

) .

\begin{matrix} (4) & Z (0) = ⋂_{s > 0} ⋃_{r < s} Z (r)^{-} \end{matrix}

Now, we can state the Induction Theorem [4].
theorem 1.1. If
(5)

Z (r) \subset U (Z (ω (r)), r)

for each

r \in T

, then

\begin{matrix} (6) & Z (r) \subset U (Z (0), σ (r)) \end{matrix}

for each

r \in T

.
We shall sketch below how the method of nondiscrete mathematical induction can be applied to the study of the convergence of iterative procedures. Let

F

be a mapping of the complete metric space

X

into itself, and let

x_{0}

be an element of

X

. Suppose that we can attach to the pair (

F

x_{0}

) a rate of convergence

ω

on the interval

T =] 0, r_{0}

], and a family of sets

{Z (r)}_{, \in T}

, such that the following relations be fulfiled:

\begin{matrix} (7) & x_{0} \in Z (r_{0}) \end{matrix}

\begin{matrix} (8) & x \in Z (r) \Rightarrow F (x) \in U (x, r) \cap Z (ω (r)) for each r \in T . \end{matrix}

Then the Induction Theorem assures the fact that

Z (0) \neq \emptyset

. On the other hand (8) implies that each element

ξ

Z (0)

is a fixed element of the mapping

F

i.e.

F (ξ) = ξ

. It also follows that via the iterative procedure:

\begin{matrix} (9) & x_{n + 1} = F (x_{n}), n = 0, 1, 2, \dots \end{matrix}

We obtain al sequence

{(x_{n})}_{n = 0}^{\infty}

which converges to an element

x^{*} \in Z (0)

, such that the following inequalities are satisfied:

\begin{aligned} (10) & d (x_{n + 1}, x_{n}) ≦ ω^{n} (r_{0}), n = 0, 1, 2, \dots \\ (11) & d (x_{n}, x^{*}) ≦ σ (ω^{n} (r_{0})) n = 0, 1, 2, \dots \end{aligned}

From (10) one obtains the following estimates of the distance between the

n^{'}

th iterate

x_{n}

and the "starting point"

x_{0}

\begin{matrix} (12) & d (x_{n}, x_{0}) ≦ σ (r_{0}) - σ (ω^{n} (r_{0})) \end{matrix}

The relation (11) will be called an apriori estimate fo the distance between the

n^{'}

th iterate given by the procedure (9) and the fixed point

x^{*}

. The name ,apriori estimate" is justified by the fact that one can compute this estimate before performing the iterative procedure.

Suppose, that for a certain

n \in {1, 2, \dots}

, one has already computed

x_{1}, x_{2}, \dots, x_{n}

. If

\begin{matrix} (13) & x_{n - 1} \in Z (d (x_{n}, x_{n - 1})) \end{matrix}

then it can easily be proved that the following inequality is satisfied:

\begin{matrix} (14) & d (x_{n}, x^{*}) ≦ σ (ω (d (x_{n}, x_{n - 1})) = σ (d (x_{n}, x_{n - 1})) - d (x_{n}, x_{n - 1}) \end{matrix}

The above estimate will be called an „aposteriori estimate“, because it can be computed only after performing the iterative procedure (9). The aposteriori estimates are generally better than the apriori ones.

Summing up what we have stated above, we get the following:
Corollary. If the conditions (7) and (8) are satisfied, then by the iterative procedure (9) one obtains a sequence

{(x_{n})}_{n = 0}^{\infty}

which converges to a fixed point

x^{*}

of the mapping

F

, and for each

n \in {0, 1, 2, \dots}

the inequalities (10)-(12) are fulfiled. Moreover, if for a certain

n \in {1, 2, 3, \dots

. the condition (13) is satisfied, then for this

n

, the inequality (14) is also fulfiled.

The above corollary will be the basis of the prof of the Theorem 3.1, concerning the convergence of the modified secant method, which will be given in Section 3.

2. Divided differences of an operator

The notion of divided difference of a (nonlinear) operator is an extension of the usual notion of divided difference of a function, in the same sense in which the Fréchet derivative of an operator is an extension of the classical notion of the derivative of a function. This notion was introduced by J. schroder [8] and was used by A. sergeev [9] and J. schmidt [7] to the extension of the secant method for the iterative solution of the nonlinear operatorial equations in Banach spacis.

Let

E

and

F

be two Banach spaces. We shall denote by

L (E, F)

the Banach space of all linear and bounded operators, from

E

into

F

. Let

f

be a (nonlinear) operator from

E

into

F

, and let

x

and

y

be two different points of the domain of

f

DEFINITION 2.1. A bounded linear operator

A \in L (E, F)

is called a divided difference of the operator

f

on the points

x

and

y

, if :

\begin{matrix} (15) & A (x - y) = f (x) - f (y) \end{matrix}

In the scalar case the divided difference of a function is unique, but in the general case this assertion is not true. Let us examine as an illustration the case where

E = F = R^{2}

. In this case, a nonlinear operator

f

is characterized by two real functions of two real variables

f_{1}

and

f_{2}

i.e.

(\forall) x = (\binom{x_{1}}{x_{2}}) \in R^{2}, f (x) = (\binom{f_{1} (x_{1}, x_{2})}{f_{2} (x_{1}, x_{2})})

Then each of the linear operators

A_{1}

and

A_{2}

given by the following two matrices satisfy (15):

A_{1} = (\begin{array}{lc} \frac{f_{1} (x_{1}, y_{2}) - f_{1} (y_{1}, y_{2})}{x_{1} - y_{1}} & \frac{f_{1} (x_{1}, x_{2}) - f_{1} (x_{1}, y_{2})}{x_{2} - y_{2}} \\ \frac{f_{2} (x_{1}, y_{2}) - f_{2} (y_{1}, y_{2})}{x_{1} - y_{1}} & \frac{f_{2} (x_{1}, x_{2}) - f_{2} (x_{1}, y_{2})}{x_{2} - y_{2}} \end{array}) (\begin{array}{cc} \frac{f_{1} (x_{1}, x_{2}) - f_{1} (y_{1}, x_{2})}{x_{1} - y_{1}} & \frac{f_{1} (y_{1}, x_{2}) - f_{1} (y_{1}, y_{2})}{x_{2} - y_{2}} \\ \frac{f_{2} (x_{1}, x_{2}) - f_{2} (y_{1}, x_{2})}{x_{1} - y_{1}} & \frac{f_{2} (y_{1}, x_{2}) - f_{2} (y_{1}, y_{2})}{x_{2} - y_{2}} \end{array}))

f

is differentiable and its Fréchet derivatives

f^{'}

is continuous on the segment

[x, y] = {t x + (1 - t) y; t \in [0, 1]}

, then the linear operator given by

A_{3} = \int_{0} f^{'} (x + t (y - x)) d t

also satisfies (15). That means that any of the three linear operators

A_{1}

A_{2}, A_{3}

, are devided differences of the operator

f

on the points

x

and

y

. Moreover, any convex combination of

A_{1}, A_{2}

and

A_{3}

is also a divided difference of

f

on the points

x

and

y

. It we have two divided differences of

f

on the points

x

and

y

, represented by the matrices

A

and

B

, then the matrix,

C

, having the first line equal to the first line of

A

, and the second line equal to the second line of

B

, also represents a divided difference of

f

on the points

x

and

y

Let us now return to the general case. Concerning the existence of the divided differences see [1]. Concerning other examples in some concrete spaces see

[10]

. Let us suppose that the closed sphere

U = U (x_{0}, m)

is included into the domain of the operator

f

, and let us denote by

D

the set

D = {(x, y) \in U^{'} \times U; x \neq y}

. We consider the mapping:

D \Rightarrow (x, y) \to [x, y; f] \in L (E, F)

where, for any pair

(x, y) \in D

, the linear operator

[x, y; f]

is a divided difference of

f

on the points

x

and

y

i.e. :

\begin{matrix} (16) & [x, y; f] (x - y) = f (x) - f (y) \end{matrix}

In [9] one assumes that the mapping

(x, y) \to [x, y; f]

is symmetric i.e.

[x, y; f] = [y, x; f]

. In [7] this condition is no longer required. Let us remark that in our example

A_{1}

and

A_{2}

are not symmetric, while

A_{3}

and

\frac{1}{2} A_{1} + \frac{1}{2} A_{2}

are.

In both of the above cited papers, one supposes, in order to assure sufficient conditions for the convergence of the secant method, that the mapping

(x, y) \to [x, y; f]

satisfies a Lipschitz condition at least. We shall write this condition under the form:

\begin{matrix} (17) & ‖ [x, y; f] - [u, v; f] ‖ ≦ H (‖ x - u ‖ + ‖ y - v ‖) . \end{matrix}

It is easy to prove that if the above inequality is fulfiled for all

x, y

u, v \in U = U (x_{0}, m)

, with

x \neq y

and

u \neq v

, then for each

x \in U

there exists the limit

lim_{y \to x} [x, y; f]

, and it equals the Fréchet derivative

f^{'} (x)

. We have then:

\begin{matrix} (18) & ‖ f^{'} (x) - f^{'} (y) ‖ ≦ 2 H ‖ x - y ‖, x, y \in U \end{matrix}

The above remark allows us to take by definition

[x, x; f] = f^{'} (x)

for each

x \in X

. Thus (18) implies (17).

Reversely, if the operator

f

is Fréchet differentiable for each

x \in U

, and if (18) is satisfied, then there exists a mapping

U \times U ∋ (x, y) \to\to [x, y; f] \in L (E, F)

which satisfies(16) and (17). We can take, for example,

[x, y; f] = \int f_{0}^{'} (x + t (y - x)) d t

This remark will be used to obtain the theorem concerning the convergence of the modified Newton's process [3] as a consequence of the theorem concerning the convergence of the modified secant method wich will be proved in the next section.

3. The modified secant method

The same as in the preceding section, let

f

be a nonlinear operator from the Banach space

E

into the Banach space

F

, and let the sphere

U == U (x_{0}, m)

be included into its domain of definition. We suppose that there exists a mapping.

U \times U \Rightarrow (x, y) \to [x, y; f] \in L (E, F)

which satisfies (16) and (17). Let

{\bar{x}}_{0}

be a point of

U

, for which the linear operator

[x_{0}, {\bar{x}}_{0}; f]

is boundedly invertible. The modified secant method, we are going to study, consists of the following interative procedure:

\begin{matrix} (19) & x_{n + 1} = x_{n} - {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} f (x_{n}), n = 0, 1, 2, \dots \end{matrix}

For the study of the convergence of the sequence

{(x_{n})}_{n = 0}^{\infty}

yielded by (19), we need some results concerning the behaviour of such a sequence in the particular case where

f

is a certain real quadratic polinomial.
lemma 3.1. If

d, H, q_{0}

and

r_{0}

are positive numbers satisfying the conditions

\begin{matrix} (20) & {(\sqrt{r_{0}} + \sqrt{q_{0} + r_{0}})}^{2} ≦ \frac{d}{H} \end{matrix}

then the function

\begin{matrix} (21) & ω (r) = \frac{r}{d} (H r + d - 2 \sqrt{H^{2} a^{2} + H d r}) \end{matrix}

is a rate of convergence on the interval

T =] 0, r_{0}]

, and the corresponding function

σ

is given by

\begin{matrix} (22) & σ (r) = \sqrt{a^{2} + \frac{d}{H} r} - a, \end{matrix}

where,

\begin{matrix} (23) & a = \frac{1}{2 H} \sqrt{{(d - H q_{0})}^{2} - 4 H d r_{0}} \end{matrix}

Proof. First, we observe that the inequality (20) implies that the quantity under the square root sign from (23) is nonnegative. Let us consider the real polinominal

\begin{matrix} (24) & f (x) = H (x^{2} - a^{2}) \end{matrix}

It is easy to prove, that for any starting point

x_{0}

, chosen in the interval

] a, + \infty [

, and for any positive number

d

, belonging to the interval

[f^{'} (x_{0})

+ \infty

[, the iterative procedure

\begin{matrix} (25) & x_{n + 1} = x_{n} - f_{n} (x) / d \end{matrix}

yields a sequence

{(x_{n})}_{n = 0}^{\infty}

, decreasingly converging to the root

x^{*} = a

of the equation

f (x) = 0

Setting for any

r \in] 0, r_{0}]

\begin{matrix} (26) & x_{0} = x_{0} (r) = \sqrt{a^{2} + \frac{d}{H} r} \end{matrix}

we have

x_{0} > x^{*}

, and

f (x_{0}) / d = r

. Taking

ω (r) = f (x_{1}) / d

and

σ (r) = x_{0} -

x^{*}

we obtain the formulas (22) and (23).

Denoting

{\bar{x}}_{0} = x_{0} (r_{0}) + q_{0}

, and computing the divided difference of the function

f

on the points

x_{0} (r_{0})

and

{\bar{x}}_{0}

we obtain

\begin{matrix} (27) & [x_{0}, {\bar{x}}_{0}; f] = d . \end{matrix}

Taking into account the fact that

f

is a convex function, we infer that

d ≧ f^{'} (x_{0} (r_{0})) ≧ f^{'} (x_{0} (r)) for any r \in] 0, r_{0}]

Thus, for each

r \in] 0, r_{0}]

, we shall obtain, via the iterative procedure (25), a sequence

{(x_{n})}_{n = 0}^{\infty}

, decreasingly converging to

x^{*}

. In this case it is clear that the functions

ω

and

σ

, defined as above, represent a rate of convergence and the function related to it. The following equalities are obviously satisfied :

\begin{matrix} (28) & x_{0} - x_{n} = σ (r) - σ (ω^{n} (r)), \end{matrix}

\begin{matrix} (30) & \begin{array}{r} x_{n} - x_{n + 1} = ω^{n} (r) \\ x_{n} - x^{*} = σ (ω^{n} (r)) \end{array} \end{matrix}

Now, we are able to state our result concerning the modified secant method:
theorem 3.1. If the conditions (16) and (17) are satisfied for all

x, y, u, v \in U = U (x_{0}, m)

, and if the following inequalities:

\begin{matrix} (31) & {‖ {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} ‖}^{- 1} ≧ d \\ (32) & ‖ x_{0} - {\bar{x}}_{0} ‖ ≦ q_{0} \\ (33) & ‖ {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} f (x_{0}) ‖ ≦ r_{0} \\ (34) & {(\sqrt{r_{0}} + \sqrt{q_{0} + r_{0}})}^{2} ≦ \frac{H}{d} \\ (35) & m ≧ σ (r_{0}) \end{matrix}

are fulfiled, then the sequence

{(x_{n})}_{n = 0}^{\infty}

, obtained by the iterative procedure (19), converges to a root

x^{*}

of the equation

f (x) = 0

, and the following inequalities are satisfied :

\begin{matrix} (36) & ‖ x_{n} - x_{0} ‖ ≦ σ (r_{0}) - σ (ω^{n} (r_{0})), n = 0, 1, 2, \dots \\ (37) & ‖ x_{n} - x^{*} ‖ ≦ σ (ω^{n} (r_{0})), n = 0, 1, 2, \dots \\ (38) & ‖ x_{n} - x^{*} ‖ ≦ σ (‖ x_{n} - x_{n - 1} ‖) - ‖ x_{n} - x_{n - 1} ‖, n = 1, 2, 3, \dots \end{matrix}

where

ω

and

σ

are given respectively by (22) and (23).
Proof. The proof is based on the Corollary stated in Section I and on the Lemma 3.1 proved in the present section. The iterative procedure (19) is of the form (7) with

F (x) = x - {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} f (x)

, for

x \in U

. Taking into account the inversability of

[x_{0}, {\bar{x}}_{0}; f]

, it follows that every fixed point of

F

is a root of the equation

f (x) = 0

. We attach to the
pair (

F, x_{0}

) the rate of convergence

ω

given by (22) and the family of sets :

\begin{matrix} Z (r) = {x \in E; ‖ {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} f (x) ‖ ≦ r, ‖ x - x_{0} ‖ ⩽ σ (r_{0}) - \\ (39) & - σ (r)}, r \in] 0, r_{0}] \end{matrix}

It is clear that

z (r_{0}) = {x_{0}}

, so that condition (7) of the above mentioned Corollary is satisfied. We shall prove that condition (8) is also satisfied. Let

x

be an element of

z (r)

, and let

\begin{matrix} (40) & x^{'} = F (x) = x - {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} f (x) \end{matrix}

Using (3) we can write

\begin{matrix} ‖ x^{'} - x_{0} ‖ ≦ ‖ x^{'} - x ‖ + ‖ x - x_{0} ‖ ≦ r + σ (r_{0}) - σ (r) = \\ (41) & = σ (r_{0}) - σ (ω (r)) \end{matrix}

From (16) and (40) we infer that

\begin{aligned} f (x^{'}) = f (x^{'}) - f (x) - [x_{0}, {\bar{x}}_{0}; f] (x^{'} - x) = \\ = ([x^{'}, x; f] - [x_{0}, {\bar{x}}_{0}; f]) (x^{'} - x) \end{aligned}

According to the conditions (17), (31) and (32), the above equality yields:

‖ {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} f (x^{'}) ‖ ≦ \frac{H}{d} (2 ‖ x - x_{0} ‖ + ‖ x^{'} - x ‖ + ‖ x_{0} - {\bar{x}}_{0} ‖) ‖ x^{'} - x ‖ .

Using (22), (23), (39) and (40), we obtain

‖ {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} f (x^{'}) ‖ ⩽ ω (r)

This relation together with (41) imply that

x^{'} \in Z (ω (r))

so that condition (6) is also fulfiled. It follows that by the iterative procedure (19), one obtains a sequence

{(x_{n})}_{n = 0}^{\infty}

which converges to a root

x^{*}

of the equation

f (x) = 0

. Moreover for each

n \in {0, 1, 2, \dots}

the inequalities (10)-(12) are satisfied. But the inequalities (11) and (12), correspond respectively to the inequalities (37) and (36), while from (10), (12), and from the fact that

σ

is an increasing function on

] 0, r_{0}]

we infer that

‖ x_{n - 1} - x_{0} ‖ ≦ σ (r_{0}) - σ (‖ x_{n} - x_{n - 1} ‖), n = 1, 2, 3, \dots

The above relation shows that

x_{n - 1} \in Z (‖ x_{n} - x_{n - 1} ‖)

for

n = 1, 2, 3, \dots

so that the condition (13) of the Corollary is fulfiled. Consequently the aposteriori estimate (38), which correspond to the inequality (14), will be satisfied for

n = 1, 2, 3, \dots

Concerning the hypotheses of the above theorem, we have to note that, in practical applications, the number

q_{0}

from the left side of the inequality (32) can be taken as small as wanted, because having an initial approximation

x_{0}

, one can take for

x_{0}

a small perturbation of it (for example

x_{0} = (1 + ε) x_{0}

). The key condition of our theorem is re-
presented by the inequality (34). This inequality can be satisfied only if

r_{0}

is small enoguh, that is, if the initial approximation

x_{0}

is good enough. However, we can prove that the condition (34) is in some sense the weakest possible. Indeed, let

d, H, q_{0}

and

r_{0}

be some positive numbers, and let us consider the real function

f

given by the formula

f (x) = H x^{2} - d r_{0} - \frac{1}{4 H} {(d - H q_{0})}^{2}

The divided difference of the function

f

, will obviously satisfy (16) and (17). The inequalities (31)-(33) are also verified, if we take

x_{0} = \frac{d - H q_{0}}{2 H}, {\bar{x}}_{0} = \frac{d + H q_{0}}{2 H}

However, if the condition (34) is not verified, then

d r_{0} > \frac{1}{4 H} {(d - H q_{0})}^{2}

, and thus the equation

f (x) = 0

has no solution.

In the following we shall show that the estimates (36)-(38), obtained in Theorem 3.1, are, in some sense, the best possible.
proposition 3.1. The estimates (36)-(33) are sharp in the following sense : for any positive numbers

d, H, q_{0}

and

r_{0}

, satisfying the inequality (34), there exists a function

f

and a pair of points (

x_{0}, {\vec{x}}_{0}

) which satisfy the hypothesis of Theorem 3.1, and for which the inequalities (36)--(38) are verified with equality.

Proof. The proof of the above proposition is a consequence of the proof of Lemma 3.1.

From (36) it follows that

‖ x^{*} - x_{0} ‖ ≦ σ (r_{0})

. We shall prove that

x^{*}

is the unique root of the equation

f (x) = 0

in a neighbourhood of the point

x_{0}

. Let

V

denote the open sphere with centre

x_{0}

and radius

σ (r_{0}) + + 2 a

.
proposition 3.2. If the inequality (34) from Theorem 3.1 is strict, then the root

x^{*}

, whose existence is guaranteed by this theorem, is the unique solution of the equation

f (x) = 0

in the set

U \cap \overset{\circ}{V}

Proof. First, we note that if the inequality (34) is strict, the

a > 0

, so that

x^{*} \in U \cap V^{\circ}

. Let

Y^{*}

be an element of

U \cap \overset{\circ}{V}

, such that

f (y^{*}) = 0

. Using (16) we obtain the relation:

\begin{matrix} (41) & x^{*} - y^{*} = {[x_{0}, {\bar{x}}_{0}; f]}^{- 1} ([x_{0}, {\bar{x}}_{0}; f] - [x^{*}, y^{*}; f]) (x^{*} - y^{*}) \end{matrix}

Now taking into account (17) we obtain:

\begin{matrix} (42) & ‖ x^{*} - y^{*} ‖ ⩽ \frac{H}{d} (‖ x_{0} - x^{*} ‖ + ‖ {\bar{x}}_{0} - y^{*} ‖) ‖ x^{*} - y^{*} ‖ \end{matrix}

On the other hand, from (22), (31) and (32), we infer that

\begin{matrix} (43) & \frac{H}{d} (‖ x_{0} - x^{*} ‖ + ‖ {\bar{x}}_{0} - y^{*} ‖) < \frac{H}{[d} (2 σ (r_{0}) + 2 a + q_{0}) = 1 \end{matrix}

Finally the inequalities (42) and (43) imply that

x^{*} = y^{*}

, so that the proof of the proposition is completed.

4. The" modified Newton's method

As we have anticipated in Section 2, the results concerning the modified Newton's method can be obtained, as a limit case, from the results concerning the modified secant method. In the following, we shall transcribe the results obtained in the preceding section for the case where

x_{0} = {\bar{x}}_{0}

and

q_{0} = 0

.
lemma 4.1. If

d, H

and

r_{0}

are three positive numbers satisfying the inequality :

\begin{matrix} (44) & 4 H r_{0} ≦ d, \end{matrix}

then:

\begin{matrix} (45) & ω_{1} (r) = \frac{r}{d} (H r + d - \sqrt{d^{2} - 4 H d (r_{0} - r)}) \end{matrix}

is a rate of convergence on the interval

T =] 0, r_{0}]

and the corresponding function

σ_{1}

is given by :

\begin{matrix} (46) & σ_{1} (r) = \frac{1}{2 H} (\sqrt{d^{2} - 4 H d (r_{0} - r)} - \sqrt{d^{2} - 4 H d r_{0}}) . \end{matrix}

Now, as in the prece ding two sections, let

f

be a nonlinear operator which maps the sphere

U = U (x_{0}, m)

of the Banach space

E

into the Banach space

F

. We suppose that

f

is Freechet differentiable on

U

and that the condition (18) holds. Then, according to the remark made in Section 2, there exists a mapping

U \times U ∋ (x, y) \mapsto [x, y; f] \in L (E, F)

such that (16) and (17) hold. Moreover for each

x \in U

we have

[x, x; f] == f^{'} (x)

Let us suppose now that the Fréchet derivative

f^{'} (x_{0})

is boundedly invertible. We may then consider the following iterative procedure:

\begin{matrix} (47) & x_{n + 1} = x_{n} - {[f^{'} (x_{0})]}^{- 1} f (x_{n}), n = 0, 1, 2, \dots \end{matrix}

which is called the modified Newton's method. This procedure may be regarded as a limit case of the modified secant method so that from. Theorem 3.1 we can derive the following theorem:
thegorem 4.1. If condition (18) holds for each

x, y \in U = U (x_{0}

, ant

)

and if the following inequalities:

\begin{matrix} (48) & {‖ {[f^{'} (x_{0})]}^{- 1} ‖}^{- 1} ≧ d \\ (49) & ‖ {[f^{'} (x_{0})]}^{- 1} f (x_{0}) ‖ ≦ r_{0} \\ (50) & 4 H r_{0} ≦ d \\ (51) & m ≧ σ_{1} (r_{0}) = \frac{1}{2 H} (d - \sqrt{d^{2} - 4 H d r_{0}}) \end{matrix}

are fulfiled, then the sequence

{(x_{n})}_{n = 0}^{\infty}

obtained by the iterative procedure (47), converges to a root

x^{*}

of the equation

f (x) = 0

, and the following inequalities are satisfied :

\begin{matrix} (54) & \begin{array}{ll} ‖ x_{n} - x_{0} ‖ ≦ σ_{1} (r_{0}) - σ_{1} (ω_{n}^{n} (r_{0})), & n = 0, 1, 2, \dots, \\ ‖ x_{n} - x^{*} ‖ ≦ σ_{1} (ω_{1}^{n} (r_{0})), & n = 0, 1, 2, \dots, \\ ‖ x_{n} - x^{*} ‖ ≦ σ_{1} ‖ (x_{n} - x_{n - 1} ‖) - ‖ x_{n} - x_{n - 1} ‖, & n = 1, 2, 3, \dots, \end{array} \end{matrix}

here

ω_{1}

and

σ_{1}

are given respectively by (45) and (46).
From Propositions 3.1 and 3.2 we obtain the following two propositions, concerning the sharpness of the estimates (58)-(60) and the uniqueness of the root

x^{*}

:
proposition 4.1. The estimates (52)-(54) are sharp in the followwing sense : For any three positive numbers

d, H

and

r_{0}

satisfying the inequality (50) there exists a function

f

, which satisfies the hypotheses of Theorem 4.1, and for which the inequalities (52)-(54) are verified with equality.
proposition 4.2. If the inequality (50) of Theorem 4.1 is strict, then the root

x^{* *}

, whose existence is quaranteed by Theorem 4.1, is the unique solution of the equation

f (x) = 0

in the set

U \cap \overset{\circ}{V}

, where

\overset{\circ}{V}

is the open sphere with center

x_{0}

and radius

σ_{1} (r_{0}) + 2 a

In the end, let us note that the results stated in this section represent a slight improvement of the results obtained by us in [3]. Nanemely the condition (18) of the present paper in weaker than the condition

‖ f^{''} (x) ‖ ≦ 2 H, x \in U

, imposed there. Moreover the aposteriori estimate (54), from Theorem 4.1, is new.

REFERENCES

[3] Potra, F. -A., The rate of convergence of a modified Newton's process. Preprint series in mathematics no. 36/1978 INCREST.
[4] Pták, V., Nondiscrete mathematical induction and iterative existence proofs. Linear algebra and its applications 13 (1976), 223-238.
[5] Pták, V., The rate of convergence of Newton's process, Nun1. Mathem. 25 (1976), 279285.
[6] Pták, V., What should be a rate of convergence? R.A.I.R.O. Analyse Numérique 11, 3 (1977) p. 279 - 286.
[7] Sclınidt, J., Eine übertragung der Regula Falsi auf Gleichungen in Banachraum. I, II, Z. Angew. Math. Mec., 43 (1963), p. 1-8, 97-110.
[8] Schröder, J., Nichtlineare Majoranten beim Verfahren der schrittweissen Näherung, Arch. Math. (Basel) 7, 471-484.
[9] Сергеев, А. С. О методе хорд. Сибир матем. Ж, 2 (1961), 282-289.
[10] Ульм, С. Об обобщенных разделенных разноспях) I, II, ИАН ЭССР, физика, математика, 16 (1967), 13-26, 146-156.

Received 12. III. 1979.
INCREST → Bucureşti

[1] Balazs, M. and Goldner, G., On existence of divided differences in linear spaces Revue d'analyse numérique et de la théorie de l'approximation, 2, $5 - 9$ (1973).
[2] Goldner, G., Balazs, M., Asupra metodei coardei si a unei modificări a ci pentru rezolvarea ecuatiilor operationale neliniare in spatii Banach, Stud. şi.Cerc. Mat., tom 20, 7 (1968).