Solving equations by interpolation

Abstract

Let \(X,Y\) be normed spaces, \(G:X\rightarrow Y\) a nonlinear operator, and the nonlinar equation \(G\left( x\right) =0\). We define the divided differences of \(G\) and we give some examples of nonlinear operators on Banach spaces for which we construct the divided differences of different orders. We construct the Lagrange interpolation polynomial in the Newton form for \(G\). The solution of equation is a approximated by using the inverse interpolation Lagrange polynomial.

Authors

Ion Păvăloiu
(Tiberiu Popoviciu Institute of Numerical Analysis)

Title

Original title (in French)

La résolution des equations par intérpolation

English translation of the title

Solving equations by interpolation

Keywords

divided difference in normed spaces, Lagrange inverse interpolation; multistep iterative method of Newton type

PDF

Cite this paper as:

I. Păvăloiu, La résolution des equations par intérpolation, Mathématica, 23(46) (1981) no. 1, pp. 61-72 (in French).

About this paper

Journal

Mathematica

Publisher Name
DOI

Not available yet.

Print ISSN

Not available yet.

Online ISSN

Not available yet.

References

[1] A.M. Ostrowski, Resenie uravnenii i sistem uravnenii, Izd. inostr. lit. Moskva (1963).

[2] I. Pavaloiu, Interpolation dans les espaces lineaires normes et applications, Mathematica, Cluj, nr.12 (35), 2, (1970), pp. 309–324.

[3] I. Pavaloiu, Introducere in teoria aproximarii solutiilor ecuatiilor, Editura Dacia, 1976.

[4] T. Popoviciu, Introduction a la theorie des differences divisees, Bulletin mathematique de la societe Roumaine de Sciences, 42, 1, (1940) pp. 65–78.

[5] A.S. Sergeev, O metode hord, Sibirski mat. Jurnal, XI, (2), (1961), pp. 282–289.

[6] J.F. Traub, Iterative Methods for the Solution of Equations, Prentice Hall, Series in Automatic Computation, 1964.

[7] S. Ul’m, Ob obobscennyh razdelennîh raznostiah, II, Izv. Nauk Estonskoi SSR, 16, 2, (1967), 146–155.

Paper (preprint) in HTML form

 
 
Solving equations by interpolation

by
Ion Păvăloiu

1. Divided difference notation.

LetXAndYtwo normed linear spaces andG:XYan application. We considern+1distinct elements of space X

(1) x1,x2,,xn+1.

We designate byL(X,Y)the set of linear and continuous applications defined onXand values ​​inYand we define by recurrence the following sets:

Li(X,Y)=L(X,Li1(X,Y)),i=2,3,

Or

L1(X,Y)=L(X,Y).

In other words, the setsLi(X,Y)are the spaces of applicationsi-linear and continuous defined onXand values ​​inY.

Definition 1 .

The application[.,.;G]:X2L(X,Y)is called the first-order divided difference of the approximation Gon the pointsxi,xIof the system ( 1 ) if

  • 1)
    [xi,xI;G](xIxi)=G(xI)G(xi)
  • 2)

    if there exists the derivative in the Fréchet sense of the applicationGon the pointxi,SO[xi,xi;G]=G(xi).

We have defined above the divided difference of order one of the applicationGon two arbitrary elements of the system ( 1 ).

Now considering consecutive points of the system ( 1 ) we can consider the following divided differences:

[x1,x2;G],[x2,x3;G],,[xn,xn+1;G].

Using the divided differences defined above we define the notion of divided difference of ordern (n>1)of the applicationG.

We assume that we have defined the application[,,,;G]:Xi+1Li(X,Y) with the help of which we can define the divided differences of orderiof the applicationGoni+1any elements of the system ( 1 ).

Let[xk+1,xk+2,,xk+i+1;G]And[xk,xk+1,,xk+i;G]two divided differences of orderion the pointsxk+1,xk+2,,xk+i+1respectively xk,xk+1,,xk+1of the system ( 1 ).

Definition 2 .

The application[,,,;G]:Xi+2Li+1(X,Y)is called divided difference of orderi+1of the applicationGon the pointsxk,xk+1,,xk+i+1,if

  • 1)
    [xk,,xk+i+1;G](xk+i+1xk)=
    =[xk+1,,xk+i+1;G][xk,,xk+i;G]
  • 2)

    if there exists the derivative in the sense of Fréchet of orderi+1 of the applicationGon the pointxi,SO:

    [xs,xs,,xs;G]=1(i+1)!G(i+1)(xs).

In order to be able to use these divided differences in the construction of interpolation polynomials, it is necessary to introduce the notion of divided difference symmetrical with respect to the points considered.

Definition 3 .

The Divided Difference[x1,x2,,xk;G]Orx1,x2,,xkare points of the system ( 1 ) is said to be symmetrical with respect to the points, if we have:

(2) [x1,x2,,xk;G]=[xi1,xi2,,xik;G]

for each permutation(i1,i2,,ik)numbers (1,2,3,,k).

IfX=Y=then, equality ( 2 ) is always verified. Then the divided differences of the real functions of a real variable are symmetric with respect to the points considered.

If we now assume for exampleX=2And Y=, f:2and if we designate by (u,v), (u",v")two points of2,We have

(3) [x,x";f]=[f(u,v)f(u",v)uu",f(u",v)f(u",v")vv"]

Or

x=(u,v)And x"=(u",v").

We show that the divided difference ( 3 ) satisfies conditions ( 1 ) and ( 2 ) of definition 1 , but that it is not symmetric with respect to the pointsxAnd x".

But we can construct a divided difference on these points which is symmetric. For example if we consider the divided difference defined by:

[x,x";f]= 12[f(u,v)f](u",v)uu"+f(u",v)f(u",v")v"v,
f(u",v)f(u",v")vv"+f(u,v")f(u,v)v"v]

so we obviously have

[x,x";f]=[x",x;f]

Be it nowX=Y=C[0,1]where byC[0,1] we designate the set of functions defined and contained on the interval[0,1].Now let us consider the application F:C[0,1]×C[0,1]given by the following equality:

F(x)(s)=01K(s,t,x(t))𝑑t

OrK:[0,1]×[0,1]×is a continuous function over its entire domain of definition.

Be it nown+1-functions

(4) x1,x2,,xn+1

from spaceC[0,1]which have the property that xI(t)xi(t)ForiIand for eacht[0,1].

The application's split differenceFon functions xi,xIof the system ( 4 ) can be defined using the following equality:

[x1,x2;F]h(s)=01K(s,t,x2(t))K(s,t,x1(t))x2(t)x1(t)h(t)𝑑t

Or

hC[0,1].

It is immediately shown that the linear application defined above has the properties ( 1 ) and ( 2 ) of definition 1 .

If we designate by[xi,xi+1,,xi+s;K]the divided difference of ordersof the functionK, then the divided difference of ordersof the applicationFis defined by the following equality:

[xi,xi+1,,xi+s;F]h1h2h3=
=01[xi,xi+1,,xi+s;K]h1(t)h2(t)hs(t)𝑑t

Or

h1,h2,,hsC[0,1].

2. Interpolation

In all that follows we will assume that the divided differences are symmetrical with respect to the points considered.

We now designate, byLn:XYan application defined by the equality:

(5) Ln(x)= G(x1)+[x1,x2;G](xx1)+
+[x1,x2,,xn+1;G](xxn)(xx1)

So we have the following theorem:

Theorem 1 .

The applicationLndefined by ( 5 ) has the following properties

  • 1)

    Ln(xi)=G(xi),i=1,2,,n+1

  • 2)

    G(x)Ln(x)=[x,x1,,xn+1;G](xxn+1)(xx1) ForxX, xxi, i=1,2,,n+1.

Demonstration..

We prove equality 2) by induction. Forn=0equality 2) is obvious becauseG(x)G(x1)=[x,x1;G](xx1) which is none other than equality 1) of definition 1. We assume that equality 2) is true byn=kand we will show that it is also true forn=k+1.

Indeed, if the following equality takes place

(6) G(x)= G(x1)+[x1,x2;G](xx1)+
+[x1,x2,,xk+1;G](xxk)(xx1)
+[x,x1,,xk+1;G](xxk+1)(xx1),

then, taking into account equality

[x,x1,,xk+1;G]=[x,x1,,xk+2;G](xxk+2)+[x1,x2,..,xk+2;G]

which results from the definition of the divided difference of order k+2.The following equality results:

G(x)= G(x1)+[x1,x2;G](xx1)+
+[x1,x2,,xk+2;G](xxk+1)(xx1)
+[x,x1,,xk+2;G](xxk+2)(xx1)

which shows that equality 2) holds forn=k+1.

It remains to demonstrate the equalities 1). From the definition of the applicationLnit results:

(7) Ln(xi)= G(x1)+[x1,x2;G](xix1)+
+[x1,x2,,xi;G](xixi1)(xix1).

If we now consider the last term on the right of equality ( 7 ) and take into account the equality:

[x1,x2,,xi;G](xixi1)=
=[xi1,x1,,xi2,xi;G](xixi1)
=[x1,x2,,xi2,xi;G][xi1,x1,,xi2;G]
=[x1,x2,,xi1,xi;G][x1,x2,,xi2,xi1;G)]

we get

Ln(xi)= G(x1)+[x1,x2;G](xix1)+
+[x1,x2,,xi2,xi;G](xixi2)(xix1).

Proceeding in the same way after a finite number of steps we arrive at the following equality:

Ln(xi)=G(x1)+[x1,x2;G](xix1)=G(x1)+G(xi)G(x1)=G(xi).

The applicationLnis called the Lagrange interpolation polynomial of the applicationGon the knotsxi,i=1,2,,n+1of the system ( 1 ) ∎

3. Solving equations by interpolation

We consider the equation

(8) G(x)=θ

θis the zero element of the spaceYAndG:XY is a nonlinear operator.

To solve equation ( 8 ) we can proceed as follows.

Assuming that the points of the system ( 1 ) are in a neighborhood of the solutionx¯from equation ( 8 ), from equality 2) (theorem 1 ) the following equality results:

G(x¯)=Ln(x¯)+[x¯,x1,,xn+1;G](x¯xn+1)(x¯x1)=θ

from which it results

Ln(x¯)[x¯,x1,,xn+1;G]xx1xxn+1.

If we now assume that we have the inequalitiesx¯xiε, i=1,n+1¯Or εis a real and positive number, then we have:

Ln(x¯)[x¯,x1,,xn+1;G]εn+1

from which it follows that to find an approximation of the solution x¯you just have to solve the following equation:

(9) Ln(x)=θ.

The method outlined above has some drawbacks, including:

First, solving an equation in the form ( 9 ) even in the casen=1is a difficult mathematical problem and the difficulty of the problem increases considerably ifn>1. Even in the caseX=Y=,that is to say, whenLnis an algebraic polynomial with real coefficients, the problem can be very difficult because of the fact that an algebraic equation of degreenhasnroots, and then it is necessary to choose the root of equation ( 9 ) which approaches the rootx¯ of equation ( 8 ).

Second, it is well known that the position of the roots of an algebraic equation in the plane and the nature of these roots can change considerably when the coefficients of the equation are affected by non-essential errors.

In the following we will present another method by which some of the difficulties mentioned above can be eliminated.

We assume that linear operators

[xi,xI;G]L(X,Y),iI;I=1,2,,n+1

are reversible.

It follows thatG(xi)G(xi), iI.Be it nowDany set of elements of spaceX.We designate byG¯=G|Dthe restriction of the applicationGto the wholeD.

We assume that the applicationG¯is a homeomorphism of setsDAndG(D).Where byG(D)we have designated the image of the wholeDbyG.

We have the following theorem:

Theorem 2 .

If the linear operator[xi,xI;G], Orxi,xIDis invertible, then the equality holds:

[yi,yI,G¯1]=[xi,xI;G]1

Or

yi=G(xi)And yI=G(xI).
Demonstration..

From the definition of divided difference of order 1 it follows that

G(xI)=G(xi)=[xi,xI;G](xIxI)

Or

[xi,xI;G]1(yIyi)=xIxi

That's to say

G¯1(yI)G¯1(yi)=[xi,xI;G]1(yIyi)

from which results the equality of the statement of the theorem.

Let us nowx1, x2,,xn+1, n+1- elements of the setD.

We designate byy1,y2,,yn+1application values Gon these points, that is to sayyi=G(xi), i=1,2,,n+1.We consider the applicationLn:YX given by

(10) Ln(y)= x1+[y1,y2;G¯1](yy1)+
+[y1,,yn+1;G¯1](yyn)(yyn+1)(yy1)

which verifies equalities

(11) G¯1(y)=Ln(y)+[y,y1,,yn+1;G¯1](yyn+1)(yy1)

And

(12) G¯1(yi)=Ln(yi)=xi,i=1,2,,n+1.

If we now assume that the solutionx¯of equation ( 8 ) belongs to the setD,so obviously x¯=G¯1(θ)and from ( 11 ) we deduce:

(13) x¯=G¯1(θ)=Ln(θ)+(1)n+1[θ,y1,,yn+1;G¯1]yn+1y1.

If we designate by

(14) x¯=Ln(θ)

we deduce:

(15) x¯x¯[θ,y1,,yn+1;G¯1]yn+1y1.

From equality ( 15 ) it follows that if the elements x1,x2,,xn+1are chosen in such a way thaty1=G(xi)<ε,i=1,2,,n+1Orεis a sufficiently small real number, thenx¯is a good approximation forx¯.

For clarity we refer toLn(y1,y2,,yn+1;G¯1|y)the operatorLn(y)given by the relation ( 10 ). Now let x1, x2,,xn+1,n+1-given initial elements and either (xn)n=1the sequence generated by the recurrence relation:

(16) xi+n+1=Ln(yi,yi+1,,yi+n;G¯1|θ),i=1,2,\eqqed

We will now present some special cases of the iterative process ( 16 ).

A. Letx1Andx2two arbitrary elements of space X.If we consider the first two terms of the inverse interpolation polynomial ( 10 ) taking into account Theorem 2 , we have

(17) xk+1=xk1[xk1,xk;G]1G(xk1),k=2,3,

which is nothing other than the extension of the rope method.

B. In case we consider three initial elements x1,x2,x3Xand we limit ourselves to the first three terms of the relation ( 10 ) we have:

(18) xk= xk3[xk3,xk2;G]1G(xk3)
[xk3,xk2;G]1[xk3,xk1,xk1;G][xk3,xk1;G]1
G(xk3)[xk1,xk2;G]1G(xk2),k=4,5,

This method is an extension to linear spaceXof a method analogous to the well-known Tchebycheff method. From ( 15 ) and ( 16 ) we deduce

(19) x¯xi+n+1 [θ,yi,,yn+1;G¯1]yiyn+1,

i=1,2,

Regarding the method ( 16 ) the following two questions arise:

1. What are the conditions for the convergence of the method ( 16 ).

2. In the case of convergence of method ( 16 ), what is the speed of convergence?

Concerning the convergence speed of an iterative method of type ( 16 ), AM Ostrowski [ 1 ] showed that the order of convergence of this method does not exceed 2, for each given number of interpolation points.

In the following we will show that if at each iteration step we choose the interpolation points appropriately, then the order of convergence increases considerably.

As is well known for the resolution of an equation of the form ( 8 ) by an iterative method it is necessary to highlight an applicationQ:XXwhich has the property that each solution of equation ( 8 ) is a fixed point for the applicationQ.

An operatorQwhich has the above property is said to be an iterative operator attached to equation ( 8 ).

For the operators attached to equation ( 8 ) we can define the notion of order.

Definition 4 .

EitherDXa set of space elements XAndρ>0a real number.

We say that the iterative operatorQto orderk (k>0real number)on the wholeD,with respect to equation ( 8 ) if for eachxDwe have

(20) G(Q(x))ρG(x)k.

Let us nowQ1,Q2,..,Qn;n-iterative operators attached to equation ( 8 ) , respectively of order k1,k2,,kn.

We consider an arbitrary elementx0Xand designate byxs0;s=1,n¯the following expressions:

(21) x10=Q1(x0),xs0=Qs(xs10),s=2,3,,n

If the initial elements in the iterative method ( 16 ) are:

(22) x0,x10,,xn0

so if we writey00=G(x0),yi0=G(xi0),i=1,2,,n,we have:

(23) x1=Ln(y00,y10,,yn0;G¯1|θ).

We now assume that we have built the firsti+1 elements of the suite(xn)n=0then the elementxn+1is obtained in the following way:

We write:

(24) x1i=Q1(xi),xIi=QI(xI1i),I=2,3,,n

And

y0i=G(xi),yIi=G(xIi),I=1,2,,n

So the elementxi+1has the following form:

(25) xi+1=Ln(y0i,y1i,,yni;G¯1|θ)

From ( 15 ) and ( 25 ) we have:

(26) x¯xi+1[θ,y0i,,yni;G¯1]y0iyni.

But since we assumed that iterative operators QI,I=1,2,,nhave the orders respectivelykI,with the constantsρI;I=1,2,,nthe following relationships result:

(27) y0i=G(xi)

And

(28) ysi G(xsi)=G(Qs(xs1i))ρsG(Qs1(xs2i))ks
ρsρs1ksG(Qs2(xs3))ksks1
ρsρs1ksρs1ksks1ρ1ksks1k2G(xi)k1k2ks

Fors=1,2,,n.

If we write now

(29) Cs=ρsρs1ksρs2ksks1ρ1ksks1k2;s=2,3,,n,
(30) K=s=2nCs

And

(31) m=1+k1+k1k2++k1k2kn

and if we assume that the divided differences[θ,y0i,y1i,,yni;G¯1]are strongly bounded by the same constantM>0,then we deduce from ( 26 );

(32) x¯xi+1MKG(xi)m,i=0,1,

If we now assume that the first-order divided differences of the applicationGare strongly bounded by the same constantB>0, we have:

(33) G(xi)Bx¯xii=0,1,

Taking into account inequalities ( 33 ) and ( 32 ) we obtain:

(34) x¯xi+1MKBmx¯xi,i=0,1,

By multiplying the inequalities ( 34 ) by(MKBm)1m1and writing

δk=(MKBm)1m1x¯xk,k=0,1,,

we obtain from ( 34 )

(35) δi+1δim,i=0,1,

We will now admit that we can choose the constantsM, K, Band the initial pointx0in such a way thatδ0<1.

From inequalities ( 35 ) we deduce:

δi+1δ0mi,i=0,1,

hence taking into account the conditionδ0<1we deduce

limiδi+1=0

That's to say

limiixi+1=x¯.

We now consider some special cases of the iterative method ( 25 ).

1. If we consider a single iterative operatorQof order k=1then the iterative method ( 25 ) returns to the well-known Aitken-Steffensen method. In this case we have:m=2.

2. IfQ1=Q2==Qn,k1=k2==kn=k,Ork1 SOCs=ρkn+11k1Andm=kn+11k1.

3. IfQ1=Q2==Qn   Andk1=k2==kn=1,SO Cs=ρsAndm=n.

Noticed .

The order of convergence given by equality ( 31 ) of method ( 25 ) is the largest possible with respect to the iterative operatorsQ1, Q2,,Qn given, if the order in which they are applied to obtain the elements ( 24 ), is given by the decreasing order of the numbersk1, k2,,kn.That is, the order of convergencemis maximal ifk1k2knand minimal ifk1k2km.

Bibliography


Received, 8.IV.1980

1981

Related Posts