Observations concerning some approximation methods for the solutions of operator equations

Abstract

(soon)

Authors

Ion Păvăloiu
(Tiberiu Popoviciu Institute of Numerical Analysis)

Keywords

PDF

Scanned paper.

PDF-LaTeX version of the paper.

Cite this paper as:

I. Păvăloiu, Observations concerning some approximation methods for the solutions of operator equations, Rev. Anal. Numér. Théor. Approx., 23 (1994) no. 2, pp. 185-195.

About this paper

Print ISSN

1222-9024

Online ISSN

2457-8126

Google Scholar Profile

References

[1] Argyros, K.I., Concerning the Convergence of Newton’s Method, The Renjab University Journal of Mathematics, Vol. XXI (1988), pp.1-11.

[2] Argyros, K.I, The Secant Method and Fixed Points of Nonlinear Operrators, Mh. Math., 106 (1988) pp. 85-94.

[3] Denis, J. E., Toward a Unified Convergence Theory for Newtonlike Methods, Nonlinear Functional Analysis and Applications. (Ed. by L. B. Rall). John Wiley, New York (1986).

[4] Lazăr, I., On Newton’s Method for Solving Operator Equations with Hölder Continuous Derivative. Revue d’analyse Numérique et de Théorie de l’Approximation. Tome 23. Nr.2 (1993).

[5] Ortega, J. M. and Rheinboldt, W., Iterative Solution of Nonlinear Equations in Several Variables. Academic Press, New York and London, 1970.

[6] Păvăloiu, I., Remarks on the secant method for the solution of nonlinear operational equations. Research Seminars, Seminar on Mathematical analysis. Preprint nr.7 (1991) pp.127-132.

[7] Păvăloiu, I., On the Convergence of a Steffensen-Type Method., Research Seminars. Seminar of Mathematical Analysis. Preprint nr.7 (1991), pp.121-126.

[8] Păvăloiu, I., Introduction in the theory of approximation of equations solutions. Dacia Ed., Cluj-Napoca, (1976) (in Romanian).

[9] Păvăloiu, I., Sur une généralisation de la méthode de Steffensen, Revue d’analyse numérique et de théorie de l’approximation. Tome 21, Nr.1, (1992), pp.59-65.

[10] Schmidt, J. W. Konvergenzgesch windigkert der Regula falsi und der Steffensen Verfahrens in Banachraum. Z.A.M.M. 46, 2, (1996) pp. 146-148.

[11] U’lm, S., Ob. obobschennyh razdelennih raznostiakh I., Izv. Acad. Nauk Estonskoi SSR, 16 (1967), 13-36.

[12] U’lm, S., Ob. obobschennyh razdelennih raznostiakh II., Izv. Acad. Nauk Estonskoi SSR, 16 (1967), 146-155.

[13] U’lm, S., Ob. obobschenie metoda Steffensena dlea reshenia nelineingh-operatornih urovnenii. Jurnal vicisl. mat. i mat.-fiz. 4, 6, (1969

Paper (preprint) in HTML form

Observations Concerning Some Approximation Methods for the Solutions of Operator Equations

Observations Concerning Some Approximation Methods for the Solutions of Operator Equations

Ion Păvăloiu
(Cluj-Napoca)

1. Introduction

The purpose of this paper is to give some completions to some results, recently appeared in the literature, concerning the convergence and the error bounds of some methods for solving operatorial equations, when the Fréchet derivatives or the divided differences of the operators are Hölder continuous.

Let f:XY be an application, where X and Y are Banach spaces. We shall define the divided difference of a certain order in the following way: let uiXi=1,n+1, where uiuj for ij.

Definition 1.1.

[8]. The divided difference of the first order of the application f at uk,usX is an application [uk,us;f](X,Y) which verifies:

  • a)

    [uk,us;f](usuk)=f(us)f(uk)

  • b)

    if f is Fréchet differentiable, at us, then

    [us,us;f]=f(us).

We suppose that there have been defined the applications

[uk,uk+1,,uk+m1;f](Xm1,Y)and
[uk+1,uk+2,,uk+m;f](Xm1,Y),

called the divided differences of the order m1, where k+mn.

Definition 1.2.

[8]. The divided difference of the order m of the application f in uk,uk+1,,uk+mX, is an application

[uk,uk+1,uk+m;f](Xm,Y)

which verifies:

(a’) [uk,uk+1,,uk+m;f](uk+muk)= [uk+1,uk+2,,uk+m;f]
[uk,uk+1,,uk+m1;f]
  • b’)

    if f is m time Fréchet differentiable at uk, then

    [uk,uk,,uk;f]=1m!f(m)(uk).

2. Considerations on the Secant Method

For the approximation of the solution of the operator equation

(1) f(x)=0

consider the iteration

(2) xn+1 =xn[xn1,xn;f]1f(xn),n=1,2,,x0,x1X,

It is known that if f satisfies certain conditions, then the sequence (xn)n0 given by (2) is well defined (there exists [xn1,xn;f]1 for n=1,2,) and converges to a solution x of equation (1) (see for example [2], [5], [8], [10].

In the following we shall give some specifications concerning the results obtained in [2]. Then we shall try to obtain conditions that ensure the convergence of the sequence (xn)n0 to a solution of (1), and, moreover, we shall determine a subset EX that contains this solution.

In paper [2], where the results obtained in [3] are generalized, the convergence of process (2) is studied under the assumptions that f is Fréchet differentiable on a set DX and the Fréchet derivative f() satisfies a Hölder type condition on D: there exist c, c>0 and p(0,1] such that:

(3) f(x)f(y)cxyp,for every x,yD.

Let HD(c,p) denote the set of all applications f for which (3) holds.

In [2], in addition to the conditions from Definition 1.1, it is assumed that the divided differences of the first order of f satisfy a Hölder type condition, namely there exist l1,l2,l30,p=(0,1) such that for every x,y,zD, the inequality:

(4) [x,y;f][y,z;f]l1xzp+l2xyp+l3yzp

holds.

This condition is useful when divided differences of the second order of f are unbounded on D.

Let l2=max{l2,l3}. If x is a simple zero of equation (1), then the application f(x)(X,Y) has a bounded inverse.

From (4) and from the existence and boundness of [f(x)]1 there exists ε>0 such that [x,y;f] has a bounded inverse for every x,yU¯(x,ε), where U¯(x,ε)={xX|xxε}, namely the application B(x,y)=[x,y;f]1 is uniformly bounded on U¯(x,ε).

In [2] the following theorem was proved:

Theorem 2.1.

Let DX be an open set and f:XY. If:

  • i)

    xD is a simple solution of equation (1).

  • ii)

    there exists ε>0 and b>0 such that:

    [x,y;f]1b,for every x,yU¯(x,ε);
  • iii)

    there exists a convex set D0 and a real number ε1, 0<ε1<ε, such that:

    f()Hr0(c,p),for every xD0and U(x,ε1)D0;
  • iv)

    x0,x1U¯(x,r), where 0<r<min{ε1,[q(p)]1p}

    and

    (5) q(p)=b1p[2p(l1+l2)(1+p)+c],

then the sequence given by (2) is well defined, and its elements belong to U¯(x,r). The sequence converges to the unique solution x of (1). Moreover, the following estimation holds

(6) xn+1xγ1xn1xpxnx+γ2xnx1+p

for n large enough where γ1 and γ2 are given by

(7) γ1=b(l1+l2)2p
(8) γ2=bc1+p

The proof is based on the following two lemmas [2]

Lemma 2.1.

Let f:XY and DX be an open set. If f is Fréchet differentiable on D and there exists a convex set D0D such that f()HD0(c,p), then for any x,yD0 the following inequality holds:

(9) f(x)f(y)f(x)(xy)c1+pxyp+1.
Lemma 2.2.

If there exists divided differences [x,y;f] and inequality (4) is verified for all x,y,zD0, then the equality b) from Definition 1.1 holds for every xD0 and the derivative f of f verifies the relation f()HD0[2(l1+l2),p].

In the proof of Theorem 2.1 the following inequality is obtained first

(10) xn+1x[M(r)]n+1x0x,where 0<M(r)<1.

from which it follows that the sequence (xn)n0 is convergent. In the following, by use of inequality (6) obtained in [2], we shall prove that the order of convergence of the sequence given by (2) is t1=1+(1+4p)1/22, i.e. it is the positive root of the equation:

(11) t2tp=0.

For this, besides the hypothesis of Theorem 2.1 we shall suppose that x0 and x1 verify

  • a’)

    xx0αd0;

  • b’)

    xx1min{αd0t1,xx0},

    where 0<d0<1 and α=[q(p)]1p.

Using Lemmas 2.1 and 2.2 and hypotheses of Theorem 2.1, from (2) we obtain

(12) x2xγ1x0xpx1x+γ2x1xp+1

from which, taking into account a’), b’) it follows

x2xαd0t12(γ1+γ2d0p(t11))αp,

where

(γ1+γ2d0p(t11))αp=γ1+γ2d0p(t11)γ1+γ2<1

that is,

(13) x2xd0t12

Relations (12) and (13) imply x2x<x1x.

Suppose that there exists nN,n2, such that

(a”) xn1x αd0t1n1
(b”) xnx min{αd0t1n,xn1x}.

If we repeat the above reasoning and take into account (a”) and (b”) we obtain

xn+1xα1+pd0t1n+1(γ1+γ2d0pt1n(t11))αd0t1n+1,

because

αp(γ1+γ2d0pt1n(t11))<1.

Moreover, it can be easily seen that

xn+1x<xnx.

So far, we have proved the following theorem.

Theorem 2.2.

Under the hypotheses of Theorem 2.1 and if x0 and x1 verify a’) and b’), where α=(q(p))1p and 0<d0<1, then for every n, xnU(x,α) and

(2.13’) xn+1xd0t1n+1,n=0,1,

One must notice that inequality (2.13’) gives a sharper error bound than (10).

In the following we shall establish a result which ensures not only the convergence of the sequences (xn)n0 but also the existence of the solution of equation (1) in a determined subset of X.

In this respect, we observe that if [xn1,xn;f]1 exists for every n=1,2, then:

(14) xn[xn1,xn;f]1f(xn)=xn1[xn1,xn;f]1f(xn1)

and

(15) f(xn+1)= f(xn)+[xn1,xn;f](xn+1xn)+
+([xn,xn+1;f][xn1,xn;f])(xn+1xn),n=1,2,

hold.

Let α,B,d0, α>0,B>0,d0(0,1) and

S={xX:xx0Bαd01d0t11},

where t1 is the positive root of equation (11).

Theorem 2.3.

If the divided differences of the first order of the applicaiton f verify condition (4) for every x,y,zS and

  • i)

    for every x,yS,[x,y;f]1 exists and [x,y,f]1B

  • ii)

    the initial data x0,x1X and f verify the inequalities

    (16) x1x0 <Bαd0,f(x0)αd0 andf(x1)αd0t1

    where

    (17) α=1B1+pp(l1+l2+l3)1p

    the equation (1) has at least one solution xS which is the limit of the sequence (xn)n0 given by (2) the order of the convergence of this sequence and the error bound are given by

    (18) xxnBαd0t1n1d0t1n(t11),n=1,2,
Proof.

From (2), for n=2 we have

x2x1Bf(x1)Bαd0t1

This inequality, together with the first inequality from (16), implies.

x2x0 x2x1+x1x0Bαd0(1+d0t11)Bαd01d0t11,

and so x2S.

Because x2S, from (14), (15) and using (4) we obtain:

(19) f(x2)Bp+1αp+1(l1+l2+l3d0t12p)d0t12pαd0t12,

since

αpBp+1(l1+l2+l3d0p(t11))αpBp+1(l1+2+l3)1.

Suppose

(20) xi S;
(21) f(xi) αd0t1i,hold for i=1,k.¯

Then

xn+1xk Bαd0(1+d0t11+d0t121++d0t1k1)
Bαd0(1+d0(t11)+d02(t11)++d0k(t11))Bαd01d0t11,

that is, xk+1S.

By the use of the same reasoning as for (19) we obtain

(22) f(xk+1)Bp+1αp+1(l1+l2+l3pt1k1(t11))d0t1k1(t1+p)α0d0t1k+1.

The above relations imply that (20) and (21) hold for every k.

Now notice that (xn)n0 is a Cauchy sequence, because

(23) xn+sxnk=nn+s1xk+1xkBαk=nn+s1d0t1kBαd0t1n1d0t1n(t11),

for any n,s, t1>1 and d0(0,1).

Let x=limnxn. Then, taking s in (23) we obtain

(24) xxnBαd0t1n1d0t1n(t11),

n=0,1,, that is, the inequality (18).

For n=0 we obtain xS.

If k in (22) then f(x)=0, that is x, is the solution of equation (1). ∎

3. Considerations on Steffensen Method

It is well known that the order of convergence of the secant method can be improved if the elements xn1 and xn form (2) are related by an application g:XX, described in the following.

Consider the sequence (xn)n0 generated by

(25) xn+1xn[xn,g(xn);f]1f(xn),x0X,

where g is an operator whose fixed points coincide with solutions of equations (1).

Consider x0X, the nonnegative real numbers B,ε0,ρ0,α, β and q1, where

(26) ρ0=Bα(l1Bp+l2βp+l3Bpαp)f(x0)p(q1)
(27) ε0=ρ01(p+q1)f(x0),

the numbers l1,l2,l3 being given by condition (4).

(28) S={xX:xx0rε0ρ01p+q1(1ε0p+q1)},

where r=max{B,β}

Concerning the convergence of method (25), the following theorem holds:

Theorem 3.1.

If the real numbers B,ε0,ρ0,p0,p,α,β,q,l1,l2,l3 the applications f and g, and the element x0 satisfy the conditions:

  • (i)

    for every x,yS there exist [x,y;f]1 and [x,y;f]1B;

  • (ii)

    for every xS,f(g(x))αf(x)q;

  • (iii)

    for every xS, xg(x)βf(x);

  • (iv)

    the divided differences of the first order of the applications f verify condition (4) for every x,y,zS;

  • (v)

    ε0<1,

then the sequence (xn),n0 given by (25) is convergent and if x=limxn, then f(x)=0. Moreover, we have

(29) xxnrε0(p+q)nρ01p+q1(1ε0p+q1).
Proof.

Let x0X be such that ε0 verifies condition (v). Using similar relations to (14) and (15) and condition (4), we obtain from (25)

x1x0Bf(x0)Bρ01p+q1ρ01p+q1f(x0)rε0ρ01p+q1(1ε0p+q1),

which means that x1S.

In the above inequality we have admitted the relation g(x0)S, which is implied by (iii).

From (14), (15), (25), (i), (ii) and (iii) we obtain:

f(x1) [g(x0),x1;f][x0,g(x0);f]x1g(x0)
Bα(l1Bp+l2βp+l3Bpαpf(x0)p(q1))f(x0)p+q
=ρ0f(x0)p+q.

From the above inequality there follows

ρ01p+q1f(x1)(ρ01p+q1f(x0))p+q

and if ε1=ρ01p+q1f(x1) then,

ε1ε0p+q.

It can be easily seen that f(x1)f(x0) and ρ1ρ0, where ρ1=Bα[l1Bp+l2βp+l3Bpαpf(x1)p(q1)]. Suppose now that, for s=1,k,¯ the following relations hold xsS,f(xs)f(xs1), εs<ε0(p+q)s, where εs=ρ1p+q1f(xs).

Using these assumptions and proceeding as above we get

(30) xk+1xkBf(xk)rε0(p+q)kρ01p+q1
(31) xn+1x0rε0ρ01p+q1(1ε0p+q1),

showing that xk+1S.

It is also easy to see that

(32) g(xk)xkrε0(p+q)kρ01p+q1

whence

(33) g(xk)x0rε0ρ01p+q1(1ε0p+q1),

that is, g(xk)S.

We obtain further

(34) f(xk+1)ρ0f(xk)p+q,

whence

(35) εk+1ε0(p+q)k+1,when εk+1=ρ01p+q1f(xk+1).

From (30) it follows that, for every s,n,

(36) xn+sxnBε0(p+q)nρ01p+q1(1ε0p+q1)

and by (v) the sequence (xn)n0 is fundamental, hence convergent. If x=limnxn, from (36), for s, we get (29) and from (35) it follows that x is a solution of (1).

From (29), for n=0 we have that xS. ∎

4. Considerations Concerning Newton’s Method

Consider the sequence given by Newton’s method,

(37) xn+1=xn[f(xn)]1f(xn),n=0,1,,x0X

let S(x0,r)={xX|xx0r}, where r, r>0.

Concerning the convergence of this sequence we have the following theorem.

Theorem 4.1.

If the application f is Fréchet differentiable on S(x0,r), the Fréchet derivative f satisfies (3) for every x,yS(x0,r) and the following conditions hold:

  • (i)

    [f(x0)]1 exists and [f(x0)]1d;

  • (ii)

    crpd<1;

  • (iii)

    ρ0=α1pf(x0)<1, where α=cβ1+p1+p and β=d1cdrp,

  • (iv)
    βf(x0)1αf(x0)pr,

then,

  • (j)

    xnS(x0,r),for every n,

  • (jj)

    there exists Γn=[f(xn)]1 for every n and Γn<d1dcrp,

  • (jjj)

    xn+1xnβα1pρ0(1+p)n;

  • (jv)

    the sequence (xn)n0 is convergent, and if x=limnxn then f(x)=0 and

    (38) xxnβα1pρ0(1+p)n1ρ0(1+p)n,n.
Proof.

By (37), for n=1 we get x1=x0[f(x0)]1f(x0), and from (i) x1x0df(x0)βf(x0)r, that is, x1S(x0,r).

By (3) and ii) it follows

[f(x0)]1[f(x0)f(x1)]dcx1x0pdcrp<1,

whence [f(x1)]1 exists and

[f(x1)]1d1dcrp=β.

From (3) it follows

f(x1) =f(x1)f(x0)f(x0)(x1x0)
cp+1x1x0p+1cβp+1p+1f(x0)p+1

and if

ρ1=α1pf(x1)

then

ρ1ρ01+p,x2x1βα1pρ01+p.

If ρi=α1pf(xi) and

(a) xi S(x0,r),i=1,k¯,
(b) xi+1xi βα1pρ0(1+p)i,i=1,k1¯,
(c) ρi ρ0(1+p)i,i=1,k,¯

then we get by (a), (3) and (i)

[f(x0)]1[f(xk)f(x0)]dcxkx0pcdrp<1.

It follows that

[f(xk)]1d1dcrp=β.

From (37) and from the above inequality we get that:

xk+1xkβf(xk)βα1pρkβα1pρ0(1+p)k,

which by (b) implies

xk+1x0βf(x0)1αf(x0)pr,

that is, xk+1S(x0,r).

Using the assumptions of the theorem we get that

f(xk+1) =f(xk+1)f(xk)f(xk)(xk+1xk)
cp+1xk+1xkp+1α1pρkp+1,

whence

ρk+1ρ0(1+p)k+1.

For every m,n

xm+nxnβα1pρ0(1+p)n1ρ0p(1+p)n

which, together with ρ0<1, show that (xn)n0 is a Cauchy sequence. If x=limnxn then for m in the above inequality, we get (38), and from

f(xn)α1pρ0(1+p)n,

for n, we get f(x)=0. ∎

References

  • [1] Argyros, K.I., Concerning the convergence of Newton’s method, The Renjab University Journal of Mathematics, Vol. XXI (1988), pp.1–11.
  • [2] Argyros, K.I, The secant method and fixed points of nonlinear operrators, Mh. Math., 106 (1988) pp. 85–94.
  • [3] Dennis, J. E., Toward a unified convergence theory for Newton like methods, Nonlinear Functional Analysis and Applications. (Ed. by L.B. Rall). John Wiley, New York (1986).
  • [4] Lazăr, I., margin: available soon,
    refresh and click here
    On Newton’s method for solving operator equations with Hölder continuous derivative. Rev. Anal. Numér. Théor. Approx., v. 23. no. 2 (1993), pp. 177–187.
  • [5] Ortega, J. M. and Rheinboldt, W., Iterative Solution of Nonlinear Equations in Several Variables. Academic Press, New York and London, 1970.
  • [6] margin: clickable Păvăloiu, I., Remarks on the secant method for the solution of nonlinear operational equations. Research Seminars, Seminar on Mathematical analysis. Preprint nr.7 (1991) pp.127–132.
  • [7] margin: clickable Păvăloiu, I., On the convergence of a Steffensen-type method, Research Seminars. Seminar of Mathematical Analysis. Preprint nr.7 (1991), pp.121–126.
  • [8] Păvăloiu, I., margin: available soon,
    refresh and click here
    Introduction in the theory of approximation of equations solutions. Dacia Ed., Cluj-Napoca, (1976) (in Romanian).
  • [9] margin: clickable Păvăloiu, I., Sur une généralisation de la méthode de Steffensen, Rev. Anal. Numér. Théor. Approx. v. 21, no. 1, (1992), pp.59–65.
  • [10] Schmidt, J. W. Konvergenzgesch windigkert der Regula falsi und der Steffensen Verfahrens in Banachraum. Z.A.M.M. 46, 2, (1996) pp. 146–148.
  • [11] U’lm, S., Ob. obobschennyh razdelennih raznostiakh I., Izv. Acad. Nauk Estonskoi SSR, 16 (1967), 13-36.
  • [12] U’lm, S., Ob. obobschennyh razdelennih raznostiakh II., Izv. Acad. Nauk Estonskoi SSR, 16 (1967), 146-155.
  • [13] U’lm, S., Ob. obobschenie metoda Steffensena dlea reshenia nelineingh-operatornih urovnenii. Jurnal vicisl. mat. i mat.-fiz. 4, 6, (1969).

Received 15 X 1993

Institutul de Calcul

Str. Republicii 37

P.O. Box 68

3400 Cluj-Napoca

România

1994

Related Posts