On a Chebyshev-type method for approximating the solutions of polynomial operator equations of degree 2

Abstract

Authors

Emil Cătinaş, Tiberiu Popoviciu Institute of Numerical Analysis, Romanian Academy

Emil Cătinaş

Ion Păvăloiu, Tiberiu Popoviciu Institute of Numerical Analysis, Romanian Academy

Ion Păvăloiu

Keywords

?

Paper coordinates

E. Cătinaş, I. Păvăloiu, On a Chebyshev-type method for approximating the solutions of polynomial operator equations of degree 2, Approximation and Optimization, Proceedings of the International Congress on Approximation and Optimization (Romania)-ICAOR, Cluj-Napoca, July 29-August 1, 1996, vol. I, pp. 219-226, D.D. Stancu, Gh. Coman, W.W. Breckner and P. Blaga (eds.), Transilvania Press, 1997, ISBN 973-98180-7-2.

PDF

About this paper

Journal
Publisher Name
DOI
Print ISSN
Online ISSN

google scholar link

??

Paper (preprint) in HTML form

1. Introduction

1. Introduction

For solving the nonlinear equations, in a recent paper, A. Diaconu [7] has proposed the Chebyshev-type method (1.2), for which the computation of the inverse of the derivative at each step is avoided. Similar methods have been studied by other authors [5], [6], [14], but these does not preserve the r-convergence order 3.

In this note we shall apply method (1.2) for solving polynomial operator equations of degree 2. We shall show that the hypotheses for the convergence of the method take a simplified form, the convergence order remaining unaltered. We shall consider then the eigenproblem for scalar matrices, applying to it the studied method. Numerical examples are also given.

Let X be a Banach space, F:XX a nonlinear operator and consider the equation

F(x)=θ¯, 1.1

θ¯ being the null element of X.

For solving (1.1) in [7] there are considered three sequences, (xk)k0X and (Bk)k0, (Ck)k0(X) given by

Ck =Bk(2IF(xk)Bk) 1.2
xk+1 =xkCkF(xk)12CkF′′(xk)(CkF(xk))2
Bk+1 =Bk[3I3F(xk+1)Bk+(F(xk+1)Bk)2],k=0,1,,

where x0X and B0(X), (X) being the set of all linear continuous operators on X and I(X) being the identity operator.

Remark. The convergence with r-order 3 of this method is obtained despite a general principle which suggests that every newly computed unknown should be used at once in the determination of the other unknowns (e.g. the Gauss-Seidel method for linear systems). In our case we could consider Ck instead of Bk in the determination of Bk+1.

2. The convergence of the method

We shall give first a lemma.

Lemma

If the sequences of real positive numbers (δk)k0 and (ρk)k0 satisfy

δk+1 (δk+2ρk+2ρk2)3
ρk+1 ρkδk2+ρk2δk2+2ρk3+ρk4,k=0,1,,

where max{δ0,ρ0}17d for some 0<d<1,

then the following relation holds:

max{δk,ρk}17d3k,k=0,1,.

The proof can be easily obtained by induction.

In the following we shall study the convergence of method (1.2) supposing that F is a polynomial operator of degree 2, i.e. F is indefinite differentiable on X and F(i)=θi for i3, θi being the i-linear null operators.

Under this condition F satisfies

F(y)=F(x)+F(x)(yx)+12F′′(x)(yx)2,for all x,yX, 2.1

where, in fact, F′′(x) is a constant bilinear operator which does not depend on x.

Let x0X and r,K>0 be two real numbers. Denote S={xX|xx0r} and suppose that we have the estimation

F′′(x)K,for all xS.

Concerning the convergence of method (1.2), the following result holds:

Theorem

If the operator F and the elements x0X, B0(X) satisfy:

a) there exists F(x0)1 and F(x0)1b0 for some b0>0;

b) q=Kb0r<1;

c) max{δ0,ρ0}17d for some 0<d<1, where

ρ0=Ka22F(x0),δ0=IF(x0)B0,a=6449b andb=b01q;

d) 16d49Ka(1d2)r,

then the following properties hold:

1) the sequences (xk)k0, (Bk)k0, (Ck)k0 converge and (xk)k0S;

2) denoting x=limxk,B=limBk,C=limCk, then F(x)=θ¯ and B=C=F(x)1;

3) the following estimations are true:

xxk 16d3k49Ka(1d23k);
BBk 1656ad3k2401(1d23k),k=0,1,
Demonstration Proof

We shall prove firstly that the first order derivative of F is invertible on S. By a) and b) we have

IF(x0)1F(x)F(x0)1F(x0)F(x)b0Kr=q<1,

for all xS. It follows that the operator T(x)=F(x0)1F(x) is invertible on S and T(x)1=F(x)1F(x0), whence F(x)1=T(x)1F(x0)1 and

F(x)1b01q=b.

For the norms of B0 and C0, taking into account the hypotheses, we get

B0 B0F(x0)1+F(x0)1
F(x0)1(1+IF(x0)B0)
b0(1+δ0)87b0<87b

and

C0B0+IF(x0)B0B0B0(1+δ0)6449b=a,

so B0a and C0a.

From (1.2) we have

x1x0 a(1+Ka22F(x0))F(x0)
a(1+ρ0)F(x0)
< 87aF(x0)=2ρ0Ka287a=16d49Ka,

whence, taking into account d) it follows that x1S.

Further, by (1.2) and (2.1),

F(x1) IF(x0)C0(1+12F′′(x0)C02F(x0))F(x0)
+12F′′(x0)2C04F(x0)3+18F′′(x0)3C06F(x0)4,

whence

Ka22F(x1) Ka22F(x0)IF(x0)C0(1+Ka22F(x0))
+2(Ka22F(x0))3+(Ka22F(x0))4.

Denoting ρ1=Ka22F(x1) and taking into account the inequality

IF(x0)C0IF(x0)B02=δ02

it follows

ρ1ρ0δ02+ρ02δ02+2ρ03+ρ04. 2.2

From the third relation of (1.2) we get

IF(x1)B1= (IF(x1)B0)3
IF(x1)B03
(IF(x0)B0+B0Kx1x0)3
(δ0+2ρ0+2ρ02)3,

i.e.,

δ1(δ0+2ρ0+2ρ02)3. 2.3

By (2.2),(2.3) and hypothesis b), we obtain

ρ117d3δ117d3.

Suppose now that the following properties hold:

𝜶 ) x0,x1,,xkS;

𝜷) ρi:=Ka22F(xi)17d3i and δi:=IF(xi)Bi17d3i,i=0,k¯.

It easily follows that Bka, Cka and

xk+1xk a(1+Ka22F(xk))F(xk) 2.4
a(1+ρk)F(xk)
16ρk7Ka16d349Ka.

From the above formula it follows that xk+1S:

xk+1x016d49Kai=0kd3i116d49Ka(1d2)r.

Denoting ρk+1=Ka22F(xk+1) and δk+1=IF(xk+1)Bk+1, the following relations are obtained in the same manner as for ρ1 and δ1:

ρk+1 ρkδk2+ρk2δk2+2ρk3+ρk4
δk+1 (δk+2ρk+2ρk2)3,

whence, by the Lemma, we get

ρk+117d3k+1δk+117d3k+1. 2.5

Then properties 𝜶), 𝜷), (2.4) and (2.5) hold for all k. We shall prove now that (xk)k0 is a Cauchy sequence.

Indeed,

xk+sxk16d3k49Kai=kk+s1d3s3k16d3k49Ka(1d23k),

for all s,k, which proves that (xk)k0 converges. Denoting x=limkxk and passing to limit for s in the above inequality, we obtain

xxk16d3k49Ka(1d23k),k=0,1,

The convergence of (Bk)k0 is obtained from the third relation of (1.2):

Bk+1Bk Bk2IF(xk+1)BkIF(xk+1)Bk
a(1+δk+2ρk+2ρk2)(ρk+2ρk+2ρk2)a16562401d3k.

Denoting B=limBk it easily follows that

BBk1656ad3k2401(1d23k),k=0,1,

Remark. The inequalities from the hypothesis of the Lemma can be replaced by δ0αd,ρ0βd for 0<d<1 and α,β>0 satisfying (α+2β+2β2d)3α and βα2+β2α2+2β3+β4dβ.

We shall obviously obtain in the conclusion that δkαd3k,ρkβd3k,k=0,1,. The theorem can be reformulated then accordingly. ∎

3. The eigenproblem for linear continuous operators

3.1. The infinite dimensional case

Let V be a Banach space over the field 𝕂 (where 𝕂= or 𝕂=) and A:VV a linear continuous operator. The scalar λ𝕂 is an eigenvalue of A iff the equation

Avλv=θ 3.1

has at least a solution vθ, called an eigenvector of A corresponding to λ, where θ is the null element of V.

For the simultaneous determination of a v and a λ we attach to equation (3.1) another equation

G(v)=1, 3.2

where G:V𝕂 is a polynomial functional of degree two for which G(0)1 (a norming function).

Remark. The functional G may also be taken as a polynomial functional of degree one, i.e. a linear continuous functional, but then dimKerG=dimVdimImG=n1, for the finite dimensional case, and dimKerG= otherwise. So, there exist eigenvectors which do not fulfill equation (3.2). ∎

Denote the Banach space X=V×𝕂 and for x=(vλ), with vV and λ𝕂, define

x=max{v,|λ|}.

Considering the operator F:XX given by

F(x)=(AvλvG(v)1)=((AλI)vG(v)1),

and denoting by θ¯=(θ0) the null element of X, then the operatorial equation for which the solution yields an eigenvalue and an eigenvector of A is

F(x)=θ¯. 3.3

It is known that the Fréchet derivatives of F are (see [4]):

F(x0)h1= (Aλ0Iv0G(v0)0)(u1α1)=(Au1λ0u1α1v0G(v0)u1),
F′′(x0)h1h2= (α1u2α2u1G′′(v0)u1u2),

where x0=(v0λ0), h1=(u1α1),h2=(u2α2) with v0, u1,u2V and λ0,α1,α2𝕂.

It is obvious that F(i)(x0)h1hi=θ¯, for all i3 and x0,h1,,hiX.

Our theorem can be applied for this function F and we can take K=max{2,G′′}.

3.2. The eigenproblem for complex matrices

In the following we shall apply the studied method for the approximation of the eigenvalues and eigenvectors of complex matrices.

Let A=(aij)i,j=1,nn(𝕂) be a square matrix with the elements aij𝕂. Consider V=𝕂n and X=V×𝕂=𝕂n+1. In this case the equation (3.3) is written

Fi(x)=Fi(x(1),,x(n+1))=0,i=1,n+1¯,

where for i=1,n¯ we have

Fi(x)=ai1x(1)++ai,i1x(i1)+(aiix(n+1))x(i)+ai,i+1x(i+1)++ainx(n),

and for the norming function G we can take

Fn+1(x(1),,x(n+1))=12i=1n(x(i))21. 3.4

A solution x of equation F(x)=θ¯ yields an eigenvalue λ=x(n+1) and a corresponding eigenvector v=(x(1),,x(n)) of the matrix A.

The first and second order derivatives of F are given by

F(x)h=(a11x(n+1)a12a1nx(1)a21a22x(n+1)a2nx(2)an1an2annx(n+1)x(n)x(1)x(2)x(n)0)(h(1)h(2)h(n)h(n+1)),

and

F′′(x)hk=(k(n+1)00k(1)0k(n+1)0k(2)00k(n+1)k(n)k(1)k(2)k(n)0)(h(1)h(2)h(n)h(n+1)). 3.5

where x=(x(1),,x(n+1)),h=(h(1),,h(n+1)),k=(k(1),,k(n+1))𝕂n+1.

Suppose that 𝕂n+1 is equipped with the norm

x=max1in+1|x(i)|,x=(x(1),,x(n+1))𝕂n+1.

Then, from (3.5) it follows that we can take K=F′′=n. Our theorem can be stated accordingly.

Another possible choice for the definition of Fn+1, is:

Fn+1(x(1),,x(n+1))=12ni=1n(x(i))21, 3.6

in which case we can take K=F′′=2. In this case K does not depend on n.

Remark. In [15] it is proved the following result. If (v,λ) is an eigenpair then F(v,λ) is nonsingular iff λ is simple. Hence our results apply only for simple eigenvalues.

4. Numerical examples

Consider

A=(1111111111111111),

having the eigenvalues and the corresponding eigenvectors λ1,2,3=2, v1=(1,0,0,1), v2=(1,0,1,0), v3=(1,1,0,0), λ4=2, v4=(1,1,1,1).

Taking x0=(1,3,1,3,1.5), B0=F(x0)1 and using formula (3.4) for Fn+1 we get:

kxk(1)xk(2)xk(3)xk(4)xk(5)00.30000000001.0000000000.30000000001.0000000001.70000000010.63449246270.67570481400.63449246270.67570481401.90111529720.70662643750.70689822340.70662643750.70689822341.99877334730.70710677050.70710677680.70710677050.70710677681.99999997640.70710678120.70710678120.70710678120.70710678122.000000000

Using formula (3.6) for Fn+1 and taking x0=(1,1.8,1,1.8,1.6),B0=F(x0)1, we obtain

kxk(1)xk(2)xk(3)xk(4)xk(5)01.0000000001.8000000001.0000000001.8000000001.60000000011.3829306421.4055155791.3829306421.4055155791.97262224821.4141851971.4142098351.4141851971.4142098351.99997582431.4142135621.4142135621.4142135621.4142135622.000000000

References

  • 1 ANSELONE, M. P., RALL, B. L., The solution of characteristic value-vector problems by Newton method, Numer. Math. 11 (1968), 38–45.
  • 2 CIARLET, P.G., Introduction à l’analyse numérique matricielle et à l’optimisation, Mason, 1990.
  • 3 CHATELIN, F., Valeurs propres de matrices, Mason, 1988.
  • 4 COLLATZ, L., Functionalanalysis und Numerische Mathematik, Springer-Verlag, 1964.
  • 5 DIACONU, A, PĂVĂLOIU, I., Sur quelques méthodes iteratives pour la résolution des equations operationelles, Rev. Anal. Numér. Théor. Approx. 1 (1972), no. 1, 45–61.
  • 6 DIACONU, A., Sur quelques méthods itératives combinées, Mathematica 22 (45) (1980), no. 2, 247–261.
  • 7 DIACONU, A., On the convergence of an iterative proceeding of Chebyshev type, Rev. Anal. Numér. Théor. Approx. 24 (1995), no. 1-2, 91–102.
  • 8 KARTÎŞOV, V. S., IUHNO, F. L., O nekotorîh Modifikaţiah Metoda Niutona dlea Resenia Nelineinoi Spektralnoi Zadaci, J. Vîcisl. matem. i matem. fiz. 33 (1973), no. 9, 1403–1409.
  • 9 ORTEGA, J.M., RHEINBOLDT, W.C., Iterative Solutions of Nonlinear Equations in Several Variables, Academic Press, 1970.
  • 10 PĂVĂLOIU, I., Sur les procédés itératifs à un ordre élevé de convergence, Mathematica (Cluj) 12 (35) (1970), no. 2, 309–324.
  • 11 PETERS, G., WILKINSON, J.H., Inverse iteration, ill-conditioned equations and Newton’s method, SIAM Review 21 (1979), no. 3, 339–360.
  • 12 SANTOS, C.M., A note on the Newton iteration for the algebraic eigenvalue problem, SIAM J. Matrix Anal. Appl. 9 (1988), no. 4, 561–569.
  • 13 TRAUB, F. J., Iterative Methods for the Solution of Equations, Prentice-Hall Inc., 1964.
  • 14 ULM, S., Ob iterationnyh metodah s’posledovatel’noi approximacii obratnovo operatora, Izv. Acad. Nauk Estonskoi SSR 16 (1967), no. 4, 403–411.
  • 15 YAMAMOTO, T., Error bounds for computed eigenvalues and eigenvectors, Numer. Math. 34 (1980), 189–199.
1997

Related Posts