On best approximation of the continous functions by polynomials

Abstract

Authors

Tiberiu Popoviciu
Institutul de Calcul

Keywords

?

Paper coordinates

T. Popoviciu, On the best approximation of continuous functions by polynomials, five lectures delivered at the Faculty of Sciences, Cluj, academic year 1933–1934, Cluj, 1934.

PDF

About this paper

Journal
Publisher Name
DOI
Print ISSN
Online ISSN

google scholar link

??

Paper (preprint) in HTML form

On the Best Approximation of Continuous Functions by Polynomials.
Five lessons held at the Faculty of Science from Cluj during the academic year 1933-1934

Dr. Tiberiu Popoviciu
(Institutul de Arte Grafice Ardealul, Cluj, 1937
(English translation 2016))

Chapter 1 First Lesson. The existence and uniqueness of the best approximation polynomials.

1.1 Bounded functions. The oscillation of a function.

We will consider real valued functions f(x)f\left(x\right) of real variable x,x, defined on the bounded and closed interval [a,b],a<b\left[a,b\right],a<b.

Such a function f(x)f\left(x\right) is upper bounded if there exists a real number AA such that all values taken by the function are less than AA. On the contrary, the function is not upper bounded. Let us denote by M(f)M\left(f\right) the upper bound or the maximumof f(x)f\left(x\right). Let us remaind the definition of this number M(f)M\left(f\right): if f(x)f\left(x\right) is not upper bounded M(f)M\left(f\right) equals ++\infty and if f(x)f\left(x\right) is upper bounded M(f)M\left(f\right) is defined by the property that for any positive number ε\varepsilon, there exists at least a point xx such that

f(x)>M(f)εf\left(x\right)>M\left(f\right)-\varepsilon

and also for any xx we have

f(x)M(f).f\left(x\right)\leq M\left(f\right).

It is now fairly clear what is the meaning of the lower bounded function as well as of a function which is not lower bounded. The definition of the lower bound or of the minimum m(f)m\left(f\right) of the function f(x)f\left(x\right) is perfectly analogous with that of M(f)M\left(f\right). A function which is simultaneously upper an lower bounded is simply called a bounded function. The difference M(f)m(f)M\left(f\right)-m\left(f\right) is called the oscillation of f(x)f\left(x\right) in the interval [a,b]\left[a,b\right].

1.2 Continuous functions.

The meaning of the continuity of a function in an interval [a,b]\left[a,b\right] is well known. A continuous function in such an interval is uniformly continuous in that interval. This means that for any positive number ε\varepsilon one can determine another positive number δ\delta such that

|f(x)f(x′′)|<ε\left|f\left(x^{\prime}\right)-f\left(x^{\prime\prime}\right)\right|<\varepsilon

for any x,x′′x^{\prime},\ x^{\prime\prime} verifying the condition

|xx′′|<δ.\left|x^{\prime}-x^{\prime\prime}\right|<\delta.

A continuous function attains its maximum M(f)M\left(f\right) and its minimum m(f)m\left(f\right). Consequently, there exists at least a point xx^{\prime} such that f(x)=M(f)f\left(x^{\prime}\right)=M\left(f\right) and a point x′′x^{\prime\prime} such that f(x′′)=m(f)f\left(x^{\prime\prime}\right)=m\left(f\right). Moreover, we can state that M(f)M\left(f\right) is at the same time the upper bound of f(x)f\left(x\right). In other words, M(f)M\left(f\right) enjoys the property that for any positive number ε\varepsilon, there exists a set containing an infinity number of points xx such that

f(x)>M(f)εf\left(x\right)>M\left(f\right)-\varepsilon

and at most a finite number of points xx such that

f(x)>M(f)+ε.f\left(x\right)>M\left(f\right)+\varepsilon.

In the same way the minimum m(f)m\left(f\right) coincides with the lower bound of f(x)f\left(x\right), this lower bound being analogously defined with the upper bound. All these definitions extend to the functions of more variables defined in closed and bounded domains. Throughout these lectures we will need some other properties which will be recalled at the right moments.

1.3 The distance between two functions.

f1(x)f_{1}\left(x\right) and f2(x)f_{2}\left(x\right) being two functions M(|f1f2|)M\left(\left|f_{1}-f_{2}\right|\right) will be called their distance. If one of these functions is bounded and the other one is unbounded their distance equals infinity. If both functions are unbounded their distance can be finite. If one of the functions is bounded and their distance is finite then the other function must be bounded. The distance enjoy the following properties which can be easily proved:

10.{}^{0}.

M(|f1f2|)M\left(\left|f_{1}-f_{2}\right|\right) is a positive or null number;

20.{}^{0}.

M(|f1f2|)=0M\left(\left|f_{1}-f_{2}\right|\right)=0 implies f1(x)=f2(x)f_{1}\left(x\right)=f_{2}\left(x\right);

30.{}^{0}.

M(|Cf|)=CM(|f|)M\left(\left|Cf\right|\right)=CM\left(\left|f\right|\right), CC being a positive constant;

40.{}^{0}.

M(|f1f2|)M(|f1f3|)+M(|f2f3|)M\left(\left|f_{1}-f_{2}\right|\right)\leq M\left(\left|f_{1}-f_{3}\right|\right)+M\left(\left|f_{2}-f_{3}\right|\right).

The problem of the best approximation which follows below depends on this definition of distance.

1.4 The problem of the best approximation using polynomials.

Let us see how this problem is formulated. Let us consider the family or the set of polynomials

P(x)=a0xn+a1xn1++anP\left(x\right)=a_{0}x^{n}+a_{1}x^{n-1}+\cdots+a_{n}

of degree nn. A polynomial from this set is completely determined by the coefficients a0,a1,,ana_{0},a_{1},\ldots,a_{n} which are real positive, negative or null numbers. It means that any polynomial of degree nn is at the same time a polynomial of degree mm with m>nm>n. In other words, the set of polynomials of degree nn contains the set of all polynomials of any degree less than nn.

For an arbitrary function f(x)f\left(x\right) we say, by definition, that the distance M(|fP|)M\left(\left|f-P\right|\right) between this function and a polynomial P(x)P\left(x\right) is the error or the approximation of f(x)f\left(x\right) provided by the polynomial P(x)P\left(x\right).

For all polynomials P(x)P\left(x\right) of degree nn, M(|fP|)M\left(\left|f-P\right|\right) has a lower bound denoted mun(f)mu_{n}\left(f\right) or simpler μn\mu_{n}. μn\mu_{n} is by definition the best approximation of f(x)f\left(x\right) by polynomials of degree nn.

The problem of the best approximation using polynomials will be formulated in the following way:

Given a function f(x)f\left(x\right), one has to determine the set of polynomials P(x)P\left(x\right) of degree nn such that M(|fP|)M\left(\left|f-P\right|\right) attains its lower bound μn\mu_{n} and then to study the number μn\mu_{n}.

A polynomial P(x)P\left(x\right) of degree nn for which μn\mu_{n} is attained will be called a best approximation polynomial of degree nn of the function f(x)f\left(x\right). Shortly we say that such a polynomial is a TnT_{n} polynomial and will be denoted with Tn(x;f)T_{n}\left(x;f\right) , Tn(x)T_{n}\left(x\right) or simply TnT_{n}.

The problem of the polynomials of the best approximation has been for the first time formulated by Russian mathematician P. L. Tchebychef.

1.5 The determination of TnT_{n} in simple cases.

The problem of the best approximation can not be formulated for unbounded functions because in this situation M(|fP|)M\left(\left|f-P\right|\right) equals ++\infty, a polynomial being a bounded function (in the interval [a,b]\left[a,b\right]).

If f(x)f\left(x\right) is a polynomial of degree nn, the best approximation μn\mu_{n} equals zero because in this case the function itself is a polynomial TnT_{n}. The reciprocal statement is also true, as it follows from Section LABEL:sec:18 below.

If we know the polynomials TnT_{n} for the function f(x)f\left(x\right) we also know the polynomials TnT_{n} for f(x)+Q(x)f\left(x\right)+Q\left(x\right) and Cf(x)Cf\left(x\right) where Q(x)Q\left(x\right) is a polynomial of degree nn and is a constant CC. Indeed we have

M(|fP|)=M(|f+Q(P+Q)|)=μn(f)M\left(\left|f-P\right|\right)=M\left(\left|f+Q-\left(P+Q\right)\right|\right)=\mu_{n}\left(f\right)

and if R(x)R\left(x\right) is a polynomial of degree nn, we additionally have

M|(f+Q)R|=M(|f(RQ)|)μn(f).M\left|\left(f+Q\right)-R\right|=M\left(\left|f-\left(R-Q\right)\right|\right)\geq\mu_{n}\left(f\right).

It follows that P(x)+Q(x)P\left(x\right)+Q\left(x\right) is a polynomial TnT_{n} for the function f(x)+Q(x)f\left(x\right)+Q\left(x\right) and any polynomial TnT_{n} corresponding to this function has the form P(x)+Q(x)P\left(x\right)+Q\left(x\right). Actually we have

μn(f+Q)=μn(f).\mu_{n}\left(f+Q\right)=\mu_{n}\left(f\right).

We also have the relations

|C|M(|fP|)\displaystyle\left|C\right|M\left(\left|f-P\right|\right) =M(|CfCP|)=|C|μn(f),\displaystyle=M\left(\left|Cf-CP\right|\right)=\left|C\right|\mu_{n}\left(f\right),
M(|CfR|)\displaystyle M\left(\left|Cf-R\right|\right) =M(|CfCRC|)=|C|M(|fRC|)|C|μn(f).\displaystyle=M\left(\left|Cf-C\frac{R}{C}\right|\right)=\left|C\right|M\left(\left|f-\frac{R}{C}\right|\right)\geq\left|C\right|\mu_{n}\left(f\right).

It follows that CP(x)CP\left(x\right) is a TnT_{n} polynomial for the function Cf(x)Cf\left(x\right) and any polynomial TnT_{n} corresponding to this function has the form CP(x)CP\left(x\right). Consequently, we get

μn(Cf)=|C|μn(f).\mu_{n}\left(Cf\right)=\left|C\right|\mu_{n}\left(f\right).

1.6 A preliminary Lemma.

Let us suppose the for some polynomials P(x)P\left(x\right) of degree nn we have

|P(x)|<A,in(a,b).\left|P\left(x\right)\right|<A,in\left(a,b\right). (1.1)

We intend to show that the coefficients ara_{r} are bounded. To this goal we take n+1n+1 distinct points x1,x2,,xn+1x_{1},x_{2},\ldots,x_{n+1}, in the interval [a,b]\left[a,b\right], and consider the system

a0xrn+a1xrn1++an=P(xr),r=1,2,,n+1.a_{0}x_{r}^{n}+a_{1}x_{r}^{n-1}+\cdots+a_{n}=P\left(x_{r}\right),\ \ \ r=1,2,\ldots,n+1.

The determinant of this system does not vanish because is the Van Der Monde determinant of the numbers x1,x2,,xn+1x_{1},x_{2},\ldots,x_{n+1}. Using the Cramer’s rule we can solve for a0,a1,,ana_{0},a_{1},\ldots,a_{n} and taking into account the inequality (1.1) we find the preliminary Lemma:

If a polynomial P(x)P\left(x\right) of degree nn is bounded by AA in the interval [a,b]\left[a,b\right], then the coefficients a0,a1,,ana_{0},a_{1},\ldots,a_{n} remain bounded by λA\lambda A, where λ\lambda depends only on nn and the interval [a,b]\left[a,b\right].

The value of λ\lambda can be determined. The most important is the fact that this number does not depend on the polynomial P(x)P\left(x\right). Of course, the property remains valid whenever the polynomials are considered only on a linear and bounded set containing at least n+1n+1 distinct points.

1.7 The continuity of M(|fP|)M\left(\left|f-P\right|\right).

The maximum M(|fP|)M\left(\left|f-P\right|\right) is not surely attained unless the function f(x)f\left(x\right) is continuous.

Let ε\varepsilon be an arbitrary and let us define

A=M(|x|n+|x|n1++1).A=M\left(\left|x\right|^{n}+\left|x\right|^{n-1}+\cdots+1\right).

Let’s suppose that

|ar=ar|<εA,r=0,1,,n.\left|a_{r}=a_{r}^{\prime}\right|<\frac{\varepsilon}{A},\ r=0,1,\ldots,n.

Defining

P(x)\displaystyle P\left(x\right) =a0xn+a1xn1++an\displaystyle=a_{0}x^{n}+a_{1}x^{n-1}+\cdots+a_{n}
P1(x)\displaystyle P_{1}\left(x\right) =a0xn+a1xn1++an,\displaystyle=a_{0}^{\prime}x^{n}+a_{1}^{\prime}x^{n-1}+\cdots+a_{n}^{\prime},

we have

M(|PP1|)[max(|arar|)]M(|x|n+|x|n1++1)M\left(\left|P-P_{1}\right|\right)\leq\left[\max\left(\left|a_{r}-a_{r}^{\prime}\right|\right)\right]M\left(\left|x\right|^{n}+\left|x\right|^{n-1}+\cdots+1\right)

where as usual we denote by max(c1,c2,,ck)max\left(c_{1},c_{2},\ldots,c_{k}\right) or maxr=1,2,,k(cr)\max_{r=1,2,\ldots,k}\left(c_{r}\right) or in the simplest way max(cr)max\left(c_{r}\right) the largest number from the set c1,c2,,ckc_{1},c_{2},\ldots,c_{k}. An analogous notation will be used for the smallest number from the same set crc_{r}.Consequently, we can write

M(|PP1|)<ε.M\left(\left|P-P_{1}\right|\right)<\varepsilon.

It follows that

M(|fP|)\displaystyle M\left(\left|f-P\right|\right) M(|fP1|)+M(|PP1|)<M(|fP1|)+ε.\displaystyle\leq M\left(\left|f-P_{1}\right|\right)+M\left(\left|P-P_{1}\right|\right)<M\left(\left|f-P_{1}\right|\right)+\varepsilon.
M(|fP1|)\displaystyle M\left(\left|f-P_{1}\right|\right) M|(fP)|+M|(PP1)|<M(|fP|)+ε\displaystyle\leq M\left|\left(f-P\right)\right|+M\left|\left(P-P_{1}\right)\right|<M\left(\left|f-P\right|\right)+\varepsilon

and consequently

|M(fP)|M|(fP1)|<ε\left|M\left(f-P\right)\right|-M\left|\left(f-P_{1}\right)\right|<\varepsilon

which means:

f(x)f\left(x\right) being a continuous function, M|(fP)|M\left|\left(f-P\right)\right| is also continuous with respect to the coefficients a0,a1,,ana_{0},a_{1},\ldots,a_{n}.

Thus the lower bound μn\mu_{n} coincides with the inferior limit of the numbersM(|fP|)M\left(\left|f-P\right|\right).

1.8 The existence of the polynomials of the best approximation.

We intend to examine the existence of the polynomials TnT_{n}. From the previous section we observe that there exists an infinite sequence of polynomials of degree nn.

P1(x),P2(x),,Pm(x),P_{1}\left(x\right),P_{2}\left(x\right),\ldots,P_{m}\left(x\right),\ldots (1.2)

such that

M(|fPm|)\displaystyle M\left(\left|f-P_{m}\right|\right) μn,\displaystyle\rightarrow\mu_{n},
m\displaystyle m \displaystyle\rightarrow\infty

but this does not imply the existence of a polynomial such that the quantity μn\mu_{n} is attained or in other words the existence of a polynomial P(x)P\left(x\right) such that M(|fP|)=μnM\left(\left|f-P\right|\right)=\mu_{n}.

This is not a surprising fact. It is true that M|(fP)|M\left|\left(f-P\right)\right| is continuous with respect to the coefficients of P,P, but the range of variations of these coefficients is open and unbounded. Let’s suppose by contradiction that M(|f|)>μnM\left(\left|f\right|\right)>\mu_{n}.It is then enough to consider only polynomials PP such that

M|(fP)|<M(|f|)..M\left|\left(f-P\right)\right|<M\left(\left|f\right|\right)..

From the last result of the previous Section it is known that there exists an infinity of such polynomials of degree nn. But

M(|P|)M(|fP|)+M(|f|).M\left(\left|P\right|\right)\leq M\left(\left|f-P\right|\right)+M\left(\left|f\right|\right).

and thus

M(|P|)<2M(|f|).M\left(\left|P\right|\right)<2M\left(\left|f\right|\right). (1.3)

In other words we can assume that the polynomials (1.2) are chosen such that they satisfy (1.3. If we put

Pm=a0(m)xn+a1(m)xn1++an(m),m=1,2,P_{m}=a_{0}^{\left(m\right)}x^{n}+a_{1}^{\left(m\right)}x^{n-1}+\cdots+a_{n}^{\left(m\right)},\ m=1,2,\ldots

from Sect. 1.7 we know that there exists a number BB which depends only on M(|f|)M\left(\left|f\right|\right). [B=2λM(|f|)]\left[B=2\lambda M\left(\left|f\right|\right)\right], such that

|ar(m)|<B,r=0,1,,n;m=1,2,.\left|a_{r}^{\left(m\right)}\right|<B,\ r=0,1,\ldots,n;m=1,2,\ldots.

From the bounded sequence

a0(1),a0(2),,a0(m),.a_{0}^{\left(1\right)},a_{0}^{\left(2\right)},\ldots,a_{0}^{\left(m\right)},\ldots.

we can extract a sub sequence convergent to a limit, say aa_{\cup}^{\ast}

a0(k),a0(k12),a0(k13),,a0(k1m),a0.a_{0}^{\left(k\right)},a_{0}^{\left(k_{12}\right)},a_{0}^{\left(k_{13}\right)},\ldots,a_{0}^{\left(k_{1m}\right)},\ldots\rightarrow a_{0}^{\ast}. (1.4)

Let’s consider now the sequence

a1(k1),a1(k12),a1(k13),,a1(k1m),a_{1}^{\left(k_{1}\right)},a_{1}^{\left(k_{12}\right)},a_{1}^{\left(k_{13}\right)},\ldots,a_{1}^{\left(k_{1m}\right)},\ldots

From this sequence we can extract a sub sequence convergent to a limit, say a1a_{1}^{\ast}

a1(k1),a1(k2),a1(k23),,a1(k2m),a1a_{1}^{\left(k_{1}\right)},a_{1}^{\left(k_{2}\right)},a_{1}^{\left(k_{23}\right)},\ldots,a_{1}^{\left(k_{2m}\right)},\ldots\rightarrow a_{1}^{\ast}

We additionally have

a0(k1),a0(k2),al0(k23),,a0(k2m),a0a_{0}^{\left(k_{1}\right)},a_{0}^{\left(k_{2}\right)},al0^{\left(k_{23}\right)},\ldots,a_{0}^{\left(k_{2m}\right)},\ldots\rightarrow a_{0}^{\ast}

because this sequence is extracted from (1.4). If we repeat this procedure n+1n+1 times, eventually we see that from the sequence of polynomials (1.2) we can extract the sub sequence

Pk1,Pk2,,Pkm,P_{k_{1}},P_{k_{2}},\ldots,P_{k_{m}},\ldots

such that

ar(k1),ar(k2),,ar(km),ar(km),ar,r=0,1,,na_{r}^{\left(k_{1}\right)},a_{r}^{\left(k_{2}\right)},\ldots,a_{r}^{\left(k_{m}\right)},\ldots\rightarrow a_{r}^{\left(k_{m}\right)},\ldots\rightarrow a_{r}^{\ast},\ r=0,1,\ldots,n

where ara_{r}^{\ast} are some finite numbers.

If we define now

P(x)=a0xn+a1xn1++an,P^{\ast}\left(x\right)=a_{0}^{\ast}x^{n}+a_{1}^{\ast}x^{n-1}+\cdots+a_{n}^{\ast},

we see that

M(|fP|)=μn.M\left(\left|f-P^{\ast}\right|\right)=\mu_{n}. (1.5)

Thus the polynomial P(x)P^{\ast}\left(x\right) which satisfies the equality (1.5) is one of the best approximation of degree nn for the function f(x)f\left(x\right). We can state now the following property: For any bounded function f(x)f\left(x\right) there exists at least one polynomial of the best approximation of degree nn.

Along with the results of Sect. 1.5 we can now state:

The lower bound μn\mu_{n} vanishes if and only if f(x)f\left(x\right) reduces to a polynomial of degree nn.

We have seen that this condition is sufficient. Its necessity comes from the existence of a polynomial P(x)P\left(x\right) such that M(|fP|)=0M\left(\left|f-P\right|\right)=0, where f(x)P(x)f\left(x\right)\equiv P\left(x\right). Whenever f(x)f\left(x\right) is not a polynomial of degree nn, μn\mu_{n} is a positive number.

1.9 The Chebyshev’s polynomials for a continuous function.

We will suppose now that the function f(x)f\left(x\right) is continuous and let Tn(x)T_{n}\left(x\right) be a polynomial of the best approximation of degree nn. The difference f(x)Tn(x)f\left(x\right)-T_{n}\left(x\right) will attains at least one of the values ±μn\pm\mu_{n}.We intend to make precise the number of points at which these values are attained. Let’s suppose that

f(xr)Tn(xr)=±μn,r=1,2,,mf\left(x_{r}\right)-T_{n}\left(x_{r}\right)=\pm\mu_{n},\ r=1,2,\ldots,m

where x1,x2,,xmx_{1},x_{2},\ldots,x_{m} are mm distinct points such that mn+1m\leq n+1. In all the other points of the interval [a,b]\left[a,b\right] we have |fP|<μn\left|f-P\right|<\mu_{n}. Let Q(x)Q\left(x\right) be the LAGRANGE’s polynomial determined by the conditions

Q(xr)=f(xr)Tn(xr),r=1,2,,m.Q\left(x_{r}\right)=f\left(x_{r}\right)-T_{n}\left(x_{r}\right),\ r=1,2,\ldots,m.

The LAGRANGE’s polynomial provided by the LAGRANGE’s interpolation formula is the polynomial of the lowest degree which takes on the values A1,A2,,AkA_{1},A_{2},\ldots,A_{k} in the points x1,x2,,xkx_{1},x_{2},\ldots,x_{k}. This polynomial is unique and at most of degree k1k-1.

The polynomial Q(x)Q\left(x\right) is at most of degree nn. Let’s introduce an interval IrI_{r} centered at xrx_{r} and of length δr\delta_{r}. Given a positive number ε\varepsilon such that ε<μn\varepsilon<\mu_{n}, we can choose a positive number δ\delta and the lengths δr\delta_{r} such that:

10.{}^{0}.

taking δrδ\delta_{r}\leq\delta the intervals I1,I2,,ImI_{1},I_{2},\ldots,I_{m} have no common points;

20.{}^{0}.

the oscillation of functions f(x)Tn(x)f\left(x\right)-T_{n}\left(x\right) and Q(x)Q\left(x\right) is less than ε\varepsilon in any interval of length δ\leq\delta.

It follows immediately that in an interval IrI_{r} the functions fTnf-T_{n} and QQ do not vanish and keep a constant sign (more exactly the same sign). Let’s suppose that xrx_{r} is a point at which f(xr)Tn(xr)=Q(xr)=μnf\left(x_{r}\right)-T_{n}\left(x_{r}\right)=Q\left(x_{r}\right)=\mu_{n}, then on the interval IrI_{r} we have

μnε<fTnμn,μnε<Qμn+ε.\mu_{n}-\varepsilon<f-T_{n}\leq\mu_{n},\mu_{n}-\varepsilon<Q\mu_{n}+\varepsilon.

Let’s choose a positive λ\lambda such that

λ<μnεμn+ε.\lambda<\frac{\mu_{n}-\varepsilon}{\mu_{n}+\varepsilon}. (1.6)

Then in the interval IrI_{r} we have

0<μnελ(μn+ε)<fTn=λQ<μnλ(μnε).0<\mu_{n}-\varepsilon-\lambda\left(\mu_{n}+\varepsilon\right)<f-T_{n}=\lambda Q<\mu_{n}-\lambda\left(\mu_{n}-\varepsilon\right).

In a point xrx_{r} where f(xr)Tn(xr)=Q(xr),=μnf\left(x_{r}\right)-T_{n}\left(x_{r}\right)=Q\left(x_{r}\right),=-\mu_{n}, we have μnfTn<μn+ε,μnε<Q<μn+ε-\mu_{n}\leq f-T_{n}<-\mu_{n}+\varepsilon,\ -\mu_{n}-\varepsilon<Q<-\mu_{n}+\varepsilon and along with (1.6) we get μn+λ(μnε)<fTnλQ<μn+ε+λ(μn+ε)<0-\mu_{n}+\lambda\left(\mu_{n}-\varepsilon\right)<f-T_{n}-\lambda Q<-\mu_{n}+\varepsilon+\lambda\left(\mu_{n}+\varepsilon\right)<0. It means that in the interval IrI_{r}

|fTnλQ|<μnλ(μnε)<μn.\left|f-T_{n}-\lambda Q\right|<\mu_{n}-\lambda\left(\mu_{n}-\varepsilon\right)<\mu_{n}.

From our initial hypothesis, it follows that in all points of the closed domain obtained taking out the intervals IrI_{r} from [a,b]\left[a,b\right], we have

|fTn|μ<μn,\left|f-T_{n}\right|\leq\mu^{\prime}<\mu_{n},

where μ\mu^{\prime} is a fixed number. If we take λ\lambda small enough such that

λ<μnμ2M(|Q|)\lambda<\frac{\mu_{n}-\mu^{\prime}}{2M\left(\left|Q\right|\right)} (1.7)

we will additionally have

|λQ|\displaystyle\left|\lambda Q\right| <μnμ2,\displaystyle<\frac{\mu_{n}-\mu^{\prime}}{2},
|fTnλQ|\displaystyle\left|f-T_{n}-\lambda Q\right| |fTn|+|λQ|<μ+μnμ2=μn+μ2<μn\displaystyle\leq\left|f-T_{n}\right|+\left|\lambda Q\right|<\mu^{\prime}+\frac{\mu_{n}-\mu^{\prime}}{2}=\frac{\mu_{n}+\mu^{\prime}}{2}<\mu_{n}

except the intervals IrI_{r} and their extremities. It means that everywhere in the interval [a,b]\left[a,b\right], we have

|fTnλQ|<μn.\left|f-T_{n}-\lambda Q\right|<\mu_{n}.

Thus, if λ\lambda verifies the inequalities (1.6) and (1.7) the polynomial Tn+λQT_{n}+\lambda Q provides a better approximation which is contrary to the hypothesis. The following property follows:

The difference f(x)Tn(x)f\left(x\right)-T_{n}\left(x\right) attains the values ±μn\pm\mu_{n} in n+2n+2 points.

1.10 The previous result revisited.

We can supplement the previous result. The difference f(x)Tn(x)f\left(x\right)-T_{n}\left(x\right) must attain both values +μn+\mu_{n} and μn-\mu_{n}.If we suppose for instance that +μn+\mu_{n} can not be attained then we would everywhere have

μnfTnμ<μn,-\mu_{n}\leq f-T_{n}\leq\mu^{\prime}<\mu_{n},

μ\mu^{\prime} being a fixed number. Taking a positive constant λ\lambdawe can write

μn+λfTn+λμ+λ.-\mu_{n}+\lambda\leq f-T_{n}+\lambda\leq\mu^{\prime}+\lambda.

Thus, if we take λ<μnμ\lambda<\mu_{n}-\mu^{\prime}, then everywhere we have

|fTn+λ|<μn.\left|f-T_{n}+\lambda\right|<\mu_{n}.

It means that the polynomial TnλT_{n}-\lambda provides a better approximation which represents a contradiction. Moreover, we can precisely estimate the number of points where μn\mu_{n} and respectively μn-\mu_{n} are actually attained. Let’s suppose by instance that

f(xr)Tn(xr)=μn,r=1,2,,m,f\left(x_{r}\right)-T_{n}\left(x_{r}\right)=\mu_{n},\ r=1,2,\ldots,m,

and in all the other points the following double inequality is valid

μnfTn<μn.-\mu_{n}\leq f-T_{n}<\mu_{n}.

Let again be the intervals IrI_{r} centered at xrx_{r} and with length δr\delta_{r} such that the intervals IrI_{r} are disjoints. Let xr,xr′′x_{r}^{\prime},x_{r}^{\prime\prime} be the end points of the interval IrI_{r} and let define the polynomial

Q(x)=(xx1)(xx1′′)(xx2′′)(xxm)(xxm′′).Q\left(x\right)=\left(x-x_{1}^{\prime}\right)\left(x-x_{1}^{\prime\prime}\right)\left(x-x_{2}^{\prime\prime}\right)\ldots\left(x-x_{m}^{\prime}\right)\left(x-x_{m}^{\prime\prime}\right).

We have Q(x)<0Q\left(x\right)<0 in the open interval IrI_{r} and Q(x)>0Q\left(x\right)>0 outside the closed intervals IrI_{r}. We can take δ\delta small enough such that for δrδ\delta_{r}\leq\delta, in the intervals IrI_{r}

μfTnμn,\mu^{\prime}\leq f-T_{n}\leq\mu_{n},

μ\mu^{\prime} being a positive number such that μ<μn\mu^{\prime}<\mu_{n}. If the positive number λ\lambda verifies the inequality

λ<μM(|Q|),\lambda<\frac{\mu^{\prime}}{M\left(\left|Q\right|\right)}, (1.8)

we have in the intervals IrI_{r}

0<μ+λQfTn+λQμn+λQ<μn.0<\mu^{\prime}+\lambda Q\leq f-T_{n}+\lambda Q\leq\mu_{n}+\lambda Q<\mu_{n}.

The last inequality is justified because we could not have the equality but in a point where we would have simultaneously fT=μnf-T=\mu_{n} and Q=0Q=0. But by construction such points do not exist. Everywhere in the closed domain [a,b]\left[a,b\right] minus the intervals IrI_{r}, we have

μnfTnμ′′<μn,-\mu_{n}\leq f-T_{n}\leq\mu^{\prime\prime}<\mu_{n},

μ′′\mu^{\prime\prime} being fixed. Taking λ\lambdasuch that

λ<μnμ′′2M(|Q|),\lambda<\frac{\mu_{n}-\mu^{\prime\prime}}{2M\left(\left|Q\right|\right)}, (1.9)

we have in this domain

μn<μn+λQfTnλQ<μ′′+μnμ′′2=μn+μ′′2<μn.\mu_{n}<\mu_{n}+\lambda Q\leq f-T_{n}-\lambda Q<\mu^{\prime\prime}+\frac{\mu_{n}-\mu^{\prime\prime}}{2}=\frac{\mu_{n}+\mu^{\prime\prime}}{2}<\mu_{n}.

We can justify the first inequality as above.

For λ\lambda obeying the inequalities (1.8) and (1.9) we have everywhere in the interval [a,b]\left[a,b\right]

|fTn+λQ|<μn\left|f-T_{n}+\lambda Q\right|<\mu_{n}

and we see that the polynomial TnλQT_{n}-\lambda Q provides a better approximation than μn.\mu_{n}. The polynomial Q(x)Q\left(x\right) has degree 2m2m and we come to a contradiction if 2mn2m\leq n. If xrx_{r} would be points where μn-\mu_{n} are attained we can make absolutely analogous considerations, so after all we can state the following property: The difference f(x)Tn(x)f\left(x\right)-T_{n}\left(x\right) attains in at least |n+22|\left|\frac{n+2}{2}\right| points the values μn\mu_{n} and at least in |n+22|\left|\frac{n+2}{2}\right| points the values μn-\mu_{n}. [α]\left[\alpha\right] signifies the largest integer less or equal to α\alpha. The properties analyzed in Sections 1.9 and 1.10 have been elegantly improved by E. Borel as we will see below.

1.11 The set of polynomials TnT_{n}.

Let’s suppose that the function f(x)f\left(x\right) admits two distinct polynomials TnT_{n}. If P,P1P,P_{1}are these two polynomials we have

M(|fP|)=M(|fP1|)=μn.M\left(\left|f-P\right|\right)=M\left(\left|f-P_{1}\right|\right)=\mu_{n}.

If α,β\alpha,\beta are two positive numbers we can write

μn\displaystyle\mu_{n} M(|fαP+βP1α+β|)=M(|αfPα+β+β(fP1)α+β|)\displaystyle\leq M\left(\left|f-\frac{\alpha P+\beta P_{1}}{\alpha+\beta}\right|\right)=M\left(\left|\frac{\alpha^{\prime}f-P}{\alpha+\beta}+\frac{\beta\left(f-P_{1}\right)}{\alpha+\beta}\right|\right)\leq (1.10)
α\(|fP|)+βM|(fP1)|α+β=μn\displaystyle\leq\frac{\alpha\backslash\left(\left|f-P\right|\right)+\beta M\left|\left(f-P_{1}\right)\right|}{\alpha+\beta}=\mu_{n}
M(|fαP+νP1α+β|)=μn.\displaystyle\therefore M\left(\left|f-\frac{\alpha P+\nu P_{1}}{\alpha+\beta}\right|\right)=\mu_{n}.

It means that the polynomial αP+βP1α+β\frac{\alpha P+\beta P_{1}}{\alpha+\beta} is another TnT_{n} polynomial and consequently we can state:

If a bounded function admits two distinct polynomials TnT_{n} it admits an infinity (uncountable) number of such polynomials.

To each polynomial P(x)=a0xn+a1xn1++anP\left(x\right)=a_{0}x^{n}+a_{1}x^{n-1}+\cdots+a_{n}we can assign a point AA of coordinates a0,a1,,ana_{0},a_{1},\ldots,a_{n} from the n+1n+1 dimensional Euclidean space. As a consequence of our previous results we can state the following:

The points AA corresponding to the polynomials TnT_{n} attached to a bounded function are organized as a convex, bounded and closed domain.

If the polynomial TnT_{n} is unique this domain reduces to a single point. If the interval [a,b]\left[a,b\right] is symmetric with respect to the origin, i.e., a=ba=-b and if the function f(x)f\left(x\right) is even, i.e., f(x)=f(x)f\left(-x\right)=f\left(x\right), then, there exists an even polynomial TnT_{n}. Indeed, it is easy to observe that Tn(x)T_{n}\left(-x\right) it is also a TnT_{n} polynomial. In the same way the polynomial Tn(x)+Tn(x)2,\frac{T_{n}\left(x\right)+T_{n}\left(-x\right)}{2}, is an even one. In this situation μ2n+1(f)=μ2n(f)\mu_{2n+1}\left(f\right)=\mu_{2n}\left(f\right). If the function is odd, i.e., f(x)=f(x)f\left(-x\right)=-f\left(x\right), there exists an odd polynomial TnT_{n}. In this case μ2n(f)=μ2n1(f)\mu_{2n}\left(f\right)=\mu_{2n-1}\left(f\right).

1.12 The uniqueness of Chebyshev’s polynomials.

The previous discussion enables us to state the following important conclusion. If P,P1P,P_{1} are two distinct polynomials TnT_{n} the polynomial P2=P+P12P_{2}=\frac{P+P_{1}}{2} it is also a TnT_{n} polynomial. The inequality (1.10) shows that in a point xx^{\prime} where we have f(x)P2(x)=±μnf\left(x^{\prime}\right)-P_{2}\left(x^{\prime}\right)=\pm\mu_{n}, we also must have

f(x)P(x)\displaystyle f\left(x^{\prime}\right)-P\left(x^{\prime}\right) =f(x)P1(x)=±μn\displaystyle=f\left(x^{\prime}\right)-P_{1}\left(x^{\prime}\right)=\pm\mu_{n}
P(x)=P1(x).\displaystyle\therefore P\left(x^{\prime}\right)=P_{1}\left(x^{\prime}\right).

According to the above properties the polynomials P,P1P,P_{1} coincide in at least n+2n+2 points. It means that they are identical. The following property is now fairly clear:

A continuous function f(x)f\left(x\right) admits a unique polynomial of the best approximation of degree nn.

Actually the uniqueness follows from the property proved at Sect. 1.9. More exactly, this uniqueness follows solely from the fact that |fTn||f-T_{n}| attains its maximum in at least n+1n+1 points. Indeed, two polynomials of degree nn which coincide in n+1n+1 points are identical.

If the interval [a,b]\left[a,b\right] is symmetric with respect to the origin and f(x)f\left(x\right) is an even function then Tn(x)T_{n}\left(x\right) is also even and T2n+1T2nT_{2n+1}\equiv T_{2n}. If the function is odd the polynomial Tn(x)T_{n}\left(x\right) has the same property and T2nT2n1T_{2n}\equiv T_{2n-1}.

If a function is not continuous the polynomial TnT_{n} generally is not unique. We observe that T0T_{0} is always unique and equals M(f)+m(f)2\frac{M\left(f\right)+m\left(f\right)}{2}.Let’s introduce the function

f(x)={1,1x<0 1, 0x1f\left(x\right)=\left\{\begin{array}[c]{c}-1,\ -1\leq x<0\\ \ \ \ 1,\ \ \ \ \ \ 0\leq x\leq 1\end{array}\right.

We must have μn1\mu_{n}\geq 1. But the null polynomial provides the approximation 11 such that μn=1\mu_{n}=1 for every nn. All polynomials TnT_{n} must vanish in the origin. The polynomials CxCx where CC is a constant are TnT_{n} polynomials for 0C20\leq C\leq 2 and for any n>0n>0.

Chapter 2 Second lesson. The results of E. Borel.

2.1 The difference f(x)P(x)f\left(x\right)-P\left(x\right).

We will suppose that the function f(x)f\left(x\right) is continuous and we will take a continuous polynomial P(x)P\left(x\right) of degree nn. Let’s consider the difference ϕ(x)=f(x)P(x)\phi\left(x\right)=f\left(x\right)-P\left(x\right) which is also a continuous function.

We will say that a point of the interval [a,b]\left[a,b\right] is an xx^{\prime} point if ϕ(x)=M(|ϕ|)\phi\left(x^{\prime}\right)=M\left(\left|\phi\right|\right) and an x′′x^{\prime\prime} point if ϕ(x′′)=M(|ϕ|)\phi\left(x^{\prime\prime}\right)=-M\left(\left|\phi\right|\right).

Let now be ε<M(|ϕ|)2\varepsilon<\frac{M\left(\left|\phi\right|\right)}{2} a positive number and δ>0\delta^{\prime}>0 another number such that the oscillation of ϕ(x)\phi\left(x\right) in an interval shorter than δ\delta^{\prime} is less than ε\varepsilon. Let’s divide the interval [a,b]\left[a,b\right] in rr sub intervals

I1,I2,,IrI_{1},I_{2},\ldots,I_{r} (2.1)

of the same length δ\delta which is smaller than δ\delta^{\prime}. An interval IrI_{r} can or can not contains points x,x′′,x^{\prime},x^{\prime\prime}, but can contain only points of the same kind.

Let Is1I_{s_{1}} be the first interval in the sequence (2.1) which contains an xx^{\prime} or an x′′x^{\prime\prime}. To fix ideas let’s suppose that it contains one or more xx^{\prime} points. Let then be Is2I_{s_{2}} the first interval following Is1I_{s_{1}} which contains x′′x^{\prime\prime} points. In between Is1I_{s_{1}} and Is2I_{s_{2}} there exists at least three consecutive intervals which do not contain neither xx^{\prime} points nor x′′x^{\prime\prime} points. If we denote by ξ1\xi_{1} the middle of the interval Is22I_{s_{2}-2} there do not exist xx^{\prime} or x′′x^{\prime\prime} points in an interval of length 3δ3\delta centered at ξ1\xi_{1}. Let Is3I_{s_{3}} be the first interval successive to Is2I_{s_{2}} which contains xx^{\prime} points. Let the point ξ2\xi_{2} be the middle point of Is32I_{s_{3}-2}. The point ξ2\xi_{2} enjoys the same properties as ξ1\xi_{1}. Working analogously along all the intervals of (2.1) we find the sequence a,ξ1,ξ2,,ξm1,b,a,\xi_{1},\xi_{2},\ldots,\xi_{m-1},b, which determinates a sequence of mm successive and closed intervals

L1,L2,,Lm.L_{1},L_{2},\ldots,L_{m}. (2.2)

These intervals enjoy the following properties:

10.{}^{0}.

There exists at least one interval LsL_{s}.

20.{}^{0}.

The division points ξs\xi_{s} are separated from the points xx^{\prime} and x′′x^{\prime\prime} by segments of length 32δ\frac{3}{2}\delta .

30.{}^{0}.

Each interval LsL_{s} contains xx^{\prime} or x′′x^{\prime\prime}points. If LsL_{s} contains xx^{\prime} points, then the intervals Ls1,Ls+1L_{s-1},L_{s+1} contain x′′x^{\prime\prime} points.

In an interval LsL_{s} which contains xx^{\prime} points, m(ϕ)m\left(\phi\right) can not equate M(|ϕ|)-M\left(\left|\phi\right|\right); and in an interval containing x′′x^{\prime\prime} points, M(ϕ)M\left(\phi\right) can not equate M(|ϕ|)M\left(\left|\phi\right|\right). We deduce that there exists a positive η\eta such that in every interval LsL_{s} we have

ϕ>M(|ϕ|)+ηϕ<M|(ϕ)|η\phi>-M\left(\left|\phi\right|\right)+\eta\ \ \ \ \ \ \phi<M\left|\left(\phi\right)\right|-\eta

according to the fact that LsL_{s} contains xx^{\prime} or x′′x^{\prime\prime} points.

2.2 The fundamental property of TnT_{n} polynomials.

If we take ϕ(x)=f(x)Tn(x)\phi\left(x\right)=f\left(x\right)-T_{n}\left(x\right) , then M(|ϕ|)=μnM\left(\left|\phi\right|\right)=\mu_{n}. Let’s suppose that the number of intervals in (2.2) is mm less than n+2n+2, i.e., mn+1m\leq n+1.In these conditions the polynomial

Q(x)=(xξ1)(xξ2)(xξm1)Q\left(x\right)=\left(x-\xi_{1}\right)\left(x-\xi_{2}\right)\ldots\left(x-\xi_{m-1}\right)

effectively has the degree nn. Let’s determine the constant λ\lambda such that λQ>0\lambda Q>0 in the interior of the intervals LsL_{s} which contain xx^{\prime} points and

|λ|<ηM(|Q|)\left|\lambda\right|<\frac{\eta}{M\left(\left|Q\right|\right)}

where η\eta is the number found out at the end of the previous Section. In every point of the interval Ls\ \ L_{s}, which contains xx^{\prime} points, we have

μn+ηη<fTnλQ<μn,-\mu_{n}+\eta-\eta<f-T_{n}-\lambda Q<\mu_{n},

and if LsL_{s} contains x′′x^{\prime\prime} points

μn<fTnλQ<μnη+η.-\mu_{n}<f-T_{n}-\lambda Q<\mu_{n}-\eta+\eta.

Thus, in the whole interval [a,b]\left[a,b\right] we have

|fTnλQ|<μn,\left|f-T_{n}-\lambda Q\right|<\mu_{n},

which contradicts the fact that TnT_{n} is a best approximation polynomial of degree nn. Consequently,

If TnT_{n} is the best approximation polynomial of degree nn for the continuous function f(x)f\left(x\right), the difference f(x)Tn(x)f\left(x\right)-T_{n}\left(x\right) attains the values ±μn\pm\mu_{n} in at least n+2n+2 consecutive points with alternating signs.

2.3 The first Borel’s theorem.

Let PP be a polynomial of degree nn, distinct from TnT_{n} and let’s suppose that the difference f(x)P(x)f\left(x\right)-P\left(x\right) attains the values ±M(|fP|)\pm M\left(\left|f-P\right|\right) in at least n+2n+2 consecutive points with alternating signs. Let x1<x1′′<x2<x2′′<x_{1}^{\prime}<x_{1}^{\prime\prime}<x_{2}^{\prime}<x_{2}^{\prime\prime}<\ldots be n+2n+2 points where ±M(|fP|)\pm M\left(\left|f-P\right|\right) is alternatively attained, xrx_{r}^{\prime} being xx^{\prime} points and xr′′x_{r}^{\prime\prime} being x′′x^{\prime\prime} points (the sequence could start also with (x′′)(x^{\prime\prime})).We have

M(|fTn|)<M(|fP|).M\left(\left|f-T_{n}\right|\right)<M\left(\left|f-P\right|\right).

If we introduce the function

ψ(x)=(f(x)P(x))(f(x)Tn(x))=Tn(x)P(x)\psi\left(x\right)=\left(f\left(x\right)-P\left(x\right)\right)-\left(f\left(x\right)-T_{n}\left(x\right)\right)=T_{n}\left(x\right)-P\left(x\right)

it follows

ψ(x1)>0,ψ(x1′′)<0,ψ(x2)>0,ψ(x2′′)<0,\psi\left(x_{1}^{\prime}\right)>0,\ \psi\left(x_{1}^{\prime\prime}\right)<0,\ \psi\left(x_{2}^{\prime}\right)>0,\ \psi\left(x_{2}^{\prime\prime}\right)<0,\ldots

and thus the function ψ(x)\psi\left(x\right) vanishes n+1n+1 times. But this function is a polynomial of degree nn and thus we get TnPT_{n}\equiv P. With these results we can state the following theorem which will be called the first Borel’s theorem:

A polynomial PP is a TnT_{n} polynomial of the best approximation for a continuous function f(x)f\left(x\right) , if and only if the difference fPf-P attains its maximum absolute value in at least n+2n+2 consecutive points with alternating signs. This property can be formulated alternatively in the following way:

Let xrx_{r} be the points where the difference |fP|\left|f-P\right| attains its maximum value. The polynomial PP is a TnT_{n} polynomial of the best approximation for the continuous function f(x)f\left(x\right), if and only if there does not exist a polynomial Q(x)Q\left(x\right) of degree nn which in xrx_{r}, takes non vanishing values of the same sign with f(xr)P(xr)f\left(x_{r}\right)-P\left(x_{r}\right).

The condition is sufficient. If P(x)P\left(x\right) is a TnT_{n}, polynomial of the best approximation, we can write

|f(xr)P(xr)|>|f(xr)Tn(xr)|\left|f\left(x_{r}\right)-P\left(x_{r}\right)\right|>\left|f\left(x_{r}\right)-T_{n}\left(x_{r}\right)\right|

and so

sg(Tn(xr)P(xr))=sg(f(xr)P(xr))sg\left(T_{n}\left(x_{r}\right)-P\left(x_{r}\right)\right)=sg\left(f\left(x_{r}\right)-P\left(x_{r}\right)\right)

because

TnP=(fP)(fTn).T_{n}-P=\left(f-P\right)-\left(f-T_{n}\right).

It follows that

Q(x)=Tn(x)P(x)Q\left(x\right)=T_{n}\left(x\right)-P\left(x\right)

contradicts our hypothesis.

The condition is necessary Among the points xrx_{r} we can choose n+2n+2 consecutive points x1<x2<<xn+2x_{1}<x_{2}<\ldots<x_{n+2}..., where ±μn\pm\mu_{n} is alternatively attains by the difference fTnf-T_{n}.

Let’s define Q(x)=a0Q\left(x\right)=a_{0} xn+a1xn1++anx^{n}+a_{1}x^{n-1}+\cdots+a_{n} a polynomial such that

sgQ(xr)\displaystyle sgQ\left(x_{r}\right) =sg(f(xr)Tn(xr))\displaystyle=sg\left(f\left(x_{r}\right)-T_{n}\left(x_{r}\right)\right)
r\displaystyle r =1,2,,n+2.\displaystyle=1,2,\ldots,n+2.

We must have

a0xrn+a1xrn1++an\displaystyle a_{0}x_{r}^{n}+a_{1}x_{r}^{n-1}+\cdots+a_{n} =Q(xr)\displaystyle=Q\left(x_{r}\right) (2.3)
r\displaystyle r =1,2,,n+2.\displaystyle=1,2,\ldots,n+2.

Actually we have a system of n+2n+2 equations involving only n+1n+1 unknowns a0,a1,,ana_{0},a_{1},\ldots,a_{n}. Its compatibility implies the fact that its characteristic determinant vanishes. Let’s denote by

V(α1,a2,,αk)=|1αrαr2αrk1|V\left(\alpha_{1},a_{2},\ldots,\alpha_{k}\right)=\left|1\alpha_{r}\alpha_{r}^{2}\ldots\alpha_{r}^{k-1}\right| (2.4)

the Van Der Monde determinant of the numbers α1,α2,,αk.\alpha_{1},\alpha_{2},\ldots,\alpha_{k}.

If α1<α2<<αk\alpha_{1}<\alpha_{2}<\cdots<\alpha_{k} then V(α1,α2,,αk)>0V\left(\alpha_{1},\alpha_{2},\ldots,\alpha_{k}\right)>0. The characteristic determinant of system (2.3) equals, possibly with the exception of a sign, the sum

r=1n+2|Q(xr)|V(x1,x2,,xr1,)\sum_{r=1}^{n+2}\left|Q\left(x_{r}\right)\right|V\left(x_{1},x_{2},,x_{r-1},\right)

and does not vanish. Thus, the system (2.3) is incompatible and the theorem is proved. It is possible to show that the first Borel’s theorem follows from the above property. It means that both statements are equivalent. From the previous theorem it follows that just in case the number of intervals (2.2) is larger than n+2n+2 we have

TnTn+1Tm2.T_{n}\equiv T_{n+1}\equiv\cdots\equiv T_{m-2}.

2.4 On the distribution of zeros of TnTn1T_{n}-T_{n-1} polynomials.

From the previous results another interesting property is available. Let’s suppose that the polynomials Tn1,TnT_{n-1},T_{n} are not identical and then μn1>μn\mu_{n-1}>\mu_{n}. Let x1<x1′′<x2<x_{1}^{\prime}<x_{1}^{\prime\prime}<x_{2}^{\prime}<\cdots be the n+1n+1 points where fTn1f-T_{n-1} arbitrarily attains ±μn1\pm\mu_{n-1}.If we define

ψ(x)=(fTn1)(fTn)=TnTn1,\psi\left(x\right)=\left(f-T_{n-1}\right)-\left(f-T_{n}\right)=T_{n}-T_{n-1},

we get

ψ(x1)>0,ψ(x1′′)<0,ψ(x2)>0,\psi\left(x_{1}^{\prime}\right)>0,\ \psi\left(x_{1}^{\prime\prime}\right)<0,\ \psi\left(x_{2}^{\prime}\right)>0,\ldots

and consequently ψ(x)\psi\left(x\right) vanishes in at least nn distinct points in [a,b]\left[a,b\right]. Thus we have the following property:

If Tn1,TnT_{n-1},T_{n} are two consecutive polynomials of the best approximation of a continuous function, the equation TnTn1=0T_{n}-T_{n-1}=0 has real and distinct conditions in (a,b)\left(a,b\right).

2.5 The TnT_{n} polynomials for functions of order nn.

Let’s go back to the notation (2.4) for the Van Der Monde determinant. Let’s denote by U(α1,α2,,αk;f)U\left(\alpha_{1},\alpha_{2},\ldots,\alpha_{k};f\right) the determinant obtained from V(α1,α2,,αk)V\left(\alpha_{1},\alpha_{2},\ldots,\alpha_{k}\right) by replacing the entries in the last column respectively with f(α1),f(α2),,f(αk),f\left(\alpha_{1}\right),f\left(\alpha_{2}\right),\ldots,f\left(\alpha_{k}\right), and thus

U(α1,α2,,αk;f)=|αrαrαrk2f(xr)|U\left(\alpha_{1},\alpha_{2},\ldots,\alpha_{k};f\right)=\left|\alpha_{r}\alpha_{r}\ldots\alpha_{r}^{k-2}f\left(x_{r}\right)\right| (2.5)

The ratio

[α1,α2,,αk;f]=U(α1,α2,,αk;f)V(α 1),α2,,αk\left[\alpha_{1},\alpha_{2},\ldots,\alpha_{k};f\right]=\frac{U\left(\alpha_{1},\alpha_{2},\ldots,\alpha_{k};f\right)}{V\left(\alpha\,1\right),\alpha_{2},\ldots,\alpha_{k}}

is called the divided difference of order k1k-1 of the function f(x)f\left(x\right) on the points α1,α2,,αk\alpha_{1},\alpha_{2},\ldots,\alpha_{k}. It is clear that this divided difference is symmetric with respect to the points α1,α2,,αk\alpha_{1},\alpha_{2},\ldots,\alpha_{k}.

If the divided difference [x1,x2,,xn+2;f]\left[x_{1},x_{2},\ldots,x_{n+2};f\right] of the function f(x)f\left(x\right) does not change sign for any n+2n+2 distinct points x1,x2,,xn+2x_{1},x_{2},\ldots,x_{n+2} from [a,b]\left[a,b\right] we will say that the function is of nn in this interval. More exactly, the function f(x)f\left(x\right) is convex, nonconcave, polynomial, nonconvex or concave of order nn in (a,b)\left(a,b\right) if we have

[x1,x2,,xn+2;f]>,,=,,or<0\left[x_{1},x_{2},\ldots,x_{n+2};f\right]>,\geq,=,\leq,or<0

in this interval.222For the properties of these functions one can see Tiberiu POPOVICIU ”Sur quelques propriétés des fonctions d’une ou de deux variables réelles”. Thèse, Paris (Iunie 1933) sau Mathematica vol.VIII pp.1-86. The polynomial function of order nn is a polynomial of degree nn. Conversely, any polynomial of degree nn is a (polynomial) function of order nn. The convexity character of order nn of a function does not change by adding a polynomial of degree nn. The functions defined in this mode have the following property: A function of order nn can not take in more than n+2n+2 consecutive points non vanishing values with alternative sign.The proof is based on the formula

U(α1,α2,,αk;f)\displaystyle U\left(\alpha_{1},\alpha_{2},\ldots,\alpha_{k};f\right) =\displaystyle= (2.6)
=j=1k(1)kjf(αj)V(α1,,αj1,αj+1,,αk)\displaystyle=\sum_{j=1}^{k}\left(-1\right)^{k-j}f\left(\alpha_{j}\right)V\left(\alpha_{1},\ldots,\alpha_{j-1},\alpha_{j+1},\ldots,\alpha_{k}\right)

which will be useful later. If the property would not be true there would exist n+3n+3 points x1<x2<<xn+3x_{1}<x_{2}<\ldots<x_{n+3} where f(x)f\left(x\right) could takes on non vanishing values with alternative sign. Thus, we would have

[sgf(xr)][sgf(xr+1)]=1,r=1,2,,n+2.\left[sgf\left(x_{r}\right)\right]\cdot\left[sgf\left(x_{r+1}\right)\right]=-1,\ r=1,2,\ldots,n+2.

But using the formula (2.6) we have

sg[x1,x2,,xn+2;f]\displaystyle sg\left[x_{1},x_{2},\ldots,x_{n+2};f\right] =sgf(xn+2)\displaystyle=sg\ f\left(x_{n+2}\right)
sg[x2,x3,,xn+3;f]\displaystyle sg\left[x_{2},x_{3},\ldots,x_{n+3};f\right] =sgf(xn+3)\displaystyle=sg\ f\left(x_{n+3}\right)

which contradicts the convexity property. Our statement is thus proved. We have made the restrictive hypothesis that the function f(x)f\left(x\right) does not vanish in the considered points. One can easily find how this statement can be modified when this hypothesis is neglected. We need this property only in a formal way. The previous property applies also to the function fPf-P, where PP is a polynomial of degree nn. Particularly, we will apply the above property to the function fTnf-T_{n} just in the points where this difference attains the values μn\mu_{n}.

If TnT_{n} is the best approximation polynomial of degree nn of the continuous function f(x)f\left(x\right) of order nn(which is not a polynomial one) then, there exits n+2n+2 and only n+2n+2 consecutive points where the difference fTnf-T_{n} attains the values ±μn\pm\mu_{n} with alternative sign.

In other words, we can say that: if the continuous function f(x)f\left(x\right) is of order nn (and it is not a polynomial one) the polynomials Tn,Tn+1T_{n},\ T_{n+1} are for sure distinct. In this case Tn+1T_{n+1} effectively has degree n+1n+1 and μn+1<μn\mu_{n+1}<\mu_{n}.

2.6 The second Borel’s Theorem.

E. Borel showed that the correspondence between a continuous function and its best approximation polynomial is continuous. Let ff and ff^{\ast} and Tn,TnT_{n},T_{n}^{\ast} their best approximation polynomials of degree nn. Let x1<x1′′<x2<x2′′<x_{1}^{\prime}<x_{1}^{\prime\prime}<x_{2}^{\prime}<x_{2}^{\prime\prime}<\ldots be n+2n+2 points where fTnf-T_{n} takes alternatively the values ±μn\pm\mu_{n}.We can write

M(|fTn|)M(|fTn|)\displaystyle M\left(\left|f^{\ast}-T_{n}^{\ast}\right|\right)M\left(\left|f^{\ast}-T_{n}\right|\right) M(|fTn|)+M(|ff|)\displaystyle\leq M\left(\left|f-T_{n}\right|\right)+M\left(\left|f-f^{\ast}\right|\right)
M(|fTn|)μn+η,\displaystyle\therefore M\left(\left|f^{\ast}-T_{n}^{\ast}\right|\right)\leq\mu_{n}+\eta,

where we have defined for simplicity η=M(|ff|)\eta=M\left(\left|f-f^{\ast}\right|\right). We have

TnTn=fTn(fTn)+(ff),T_{n}-T_{n}^{\ast}=f^{\ast}-T_{n}^{\ast}-\left(f-T_{n}\right)+\left(f-f^{\ast}\right),

in a point xrx_{r}^{\prime}

TnTnμn+ημn+η=2η,T_{n}-T_{n}^{\ast}\leq\mu_{n}+\eta-\mu_{n}+\eta=2\eta,

and in a point xr′′x_{r}^{\prime\prime}

TnTnμnη+μnη=2η.T_{n}-T_{n}^{\ast}\geq-\mu_{n}-\eta+\mu_{n}-\eta=-2\eta.

We intend to show that at least in one of the intervals (x1,x1′′),(x1′′,x2),(x2,x2′′),\left(x_{1}^{\prime},x_{1}^{\prime\prime}\right),\left(x_{1}^{\prime\prime},x_{2}^{\prime}\right),\left(x_{2}^{\prime},x_{2}^{\prime\prime}\right),\ldots we can write the inequality

|TnTn|2η.\left|T_{n}-T_{n}^{\ast}\right|\leq 2\eta. (2.7)

Let’s suppose the contrary. There exists the points x1,x2,,xn+1x_{1},x_{2},\ldots,x_{n+1} in these n+1n+1 intervals such that

|Tn(xr)Tn(xr)|>2η,r=1,2,,n+1.\left|T_{n}\left(x_{r}\right)-T_{n}^{\ast}\left(x_{r}\right)\right|>2\eta,\ r=1,2,\ldots,n+1.

It follows that

Tn(xr)T(xr)2η,Tn(xr+1)Tn(xr+1)2η.T_{n}\left(x_{r}^{\prime}\right)-T^{\ast}\left(x_{r}^{\prime}\right)\leq 2\eta,\ T_{n}\left(x_{r+1}^{\prime}\right)-T_{n}^{\ast}\left(x_{r+1}^{\prime}\right)\leq 2\eta.

Two possibilities can occur:

10.{}^{0}.

We can have one of the inequalities

Tn(x2r1)T2(x2r1)>2η,Tn(x2r)Tn(x2r)>2η.T_{n}\left(x_{2r-1}\right)-T_{2}^{\ast}\left(x_{2r-1}\right)>2\eta,\ T_{n}\left(x_{2r}\right)-T_{n}^{\ast}\left(x_{2r}\right)>2\eta.
20.{}^{0}.

Or both inequalities

Tn(x2r1)Tn(x2r1)<2η,Tn(x2r)Tn(x2r)<2ηT_{n}\left(x_{2r-1}\right)-T_{n}^{\ast}\left(x_{2r-1}\right)<-2\eta,\ T_{n}\left(x_{2r}\right)-T_{n}^{\ast}\left(x_{2r}\right)<-2\eta

are satisfied.

The points x2r1,x2rx_{2r-1},x_{2r} belong to the intervals (xr,xr+1)\left(x_{r}^{\prime},x_{r+1}^{\prime}\right). One can see that in the case 10 the difference TnTnT_{n}-T_{n}^{\ast} has a (relative) maximum in this interval. In case 20 we additionally take into account the relation

Tn(xr′′)Tn(xr′′)2ηT_{n}\left(x_{r}^{\prime\prime}\right)-T_{n}^{\ast}\left(x_{r}^{\prime\prime}\right)\geq-2\eta

and because the point xr′′x_{r}^{\prime\prime} belongs to the interval (xr,xr+1)\left(x_{r}^{\prime},x_{r+1}^{\prime}\right) we remark again that the polynomial TnTnT_{n}-T_{n}^{\ast} has at least a maximum in this interval.

Actually, the polynomial TnTnT_{n}-T_{n}^{\ast} has at least one minimum in each of the intervals (x1,x2),(x2,x3)\left(x_{1}^{\prime},x_{2}^{\prime}\right),\ \left(x_{2}^{\prime},x_{3}^{\prime}\right)\ldots. In the same way we can prove that this polynomial has at least a (relative) minimum in each of the intervals (x1′′,x2′′)\left(x_{1}^{\prime\prime},x_{2}^{\prime\prime}\right), (x2′′,x3′′)\left(x_{2}^{\prime\prime},x_{3}^{\prime\prime}\right)\ldots. Our polynomial which is by hypothesis of degree nn and non identical null, has at least nn maxima and minima which is impossible.

It is now proved that the inequality (2.7) is true at least in one of the n+1n+1 intervals considered. Taking in such an interval n+1n+1 distinct points and working as in Sect. 1.7 we will se that the coefficients of the polynomial TnTnT_{n}-T_{n}^{\ast} are in absolute value less than a number 2ηλ2\eta\lambda, where λ\lambda is a fix number.It follows that

M(|TnTn|)<2ηλA,M\left(\left|T_{n}-T_{n}^{\ast}\right|\right)<2\eta\lambda A,

where

A=M(|x|n+|x|n1++1).A=M\left(\left|x\right|^{n}+\left|x\right|^{n-1}+\cdots+1\right).

If we take

η<ε2λA,\eta<\frac{\varepsilon}{2\lambda A},

we get

M(|TnTn|)<ε.M\left(\left|T_{n}-T_{n}^{\ast}\right|\right)<\varepsilon.

We can now state a result which will be called the second Borel’s Theorem:

For any positive number ε\varepsilon we can find another positive number δ\delta such that the inequality

M(|ff|)<δM\left(\left|f-f^{\ast}\right|\right)<\delta

implies

M(|TnTn|)<ε.M\left(\left|T_{n}-T_{n}^{\ast}\right|\right)<\varepsilon.

2.7 A consequence of the previous Theorem.

From the previous theorem an important consequence follows. Let’s suppose that a sequence of continuous functions

f1(x),f2(x),,fm(x),f_{1}\left(x\right),f_{2}\left(x\right),\ldots,f_{m}\left(x\right),\ldots (2.8)

converges uniformly to a continuous function f(x)f\left(x\right) in the whole interval [a,b]\left[a,b\right]. The second Borel’s Theorem states that for a given positive δ\delta, there exists a positive ε\varepsilon such that

M(|ff|)<δM\left(\left|f-f^{\ast}\right|\right)<\delta

implies

M|Tn(x;f)Tn(x;f)|<ε.M\left|T_{n}\left(x;f\right)-T_{n}\left(x;f^{\ast}\right)\right|<\varepsilon.

But due to the uniform convergence there exists a number AA such that for m>Am>A we have

M(|ffm|)<δM\left(\left|f-f_{m}\right|\right)<\delta

and thus for m>Am>A we also have

M|Tn(x;f)Tn(x;fm)|<ε.M\left|T_{n}\left(x;f\right)-T_{n}\left(x;f_{m}\right)\right|<\varepsilon.

Consequently we can formulate the following result: In the sequence (2.8) of continuous functions converges uniformly to the continuous function f(x)f\left(x\right), then the sequence of polynomials Tn(x;f1),Tn(x;f2),,Tn(x;fm),T_{n}\left(x;f_{1}\right),\ T_{n}\left(x;f_{2}\right),\ldots,T_{n}\left(x;f_{m}\right),\ldots converges to the best approximation polynomial of degree nn of the function f(x)f\left(x\right).

Of course the above sequence of polynomials is uniformly convergent. As a matter of fact, a sequence of polynomials of the same degree is also uniformly convergent in the whole interval [a,b]\left[a,b\right].

2.8 The computation of TnT_{n} polynomial.

The previous results enable us to compute the polynomial TnT_{n} with a desired approximation. If ff is a polynomial, the computation of TnT_{n} is a purely algebraic problem. Indeed, if in an interior point of the interval [a,b]\left[a,b\right] we have |fTn|=μn\left|f-T_{n}\right|=\mu_{n}, the derivative of the polynomial fTnf-T_{n} vanishes in this point. Let’s remark that the equality |fTn|=μn\left|f-T_{n}\right|=\mu_{n} can takes place in an extremum point aa or bb or even in both ends of the interval [a,b]\left[a,b\right]. The polynomial TnT_{n} and the quantity μn\mu_{n} will be determined from the system

{f(xr)Tn(xr;f)=(1)rρr=1,2,,n+2f(xr)T(xr;f)=0x1<x2<<xn+2\left\{\begin{array}[c]{c}f\left(x_{r}\right)-T_{n}\left(x_{r};f\right)=\left(-1\right)^{r}\rho\ r=1,2,\ldots,n+2\\ f^{\prime}\left(x_{r}\right)-T^{\prime}\left(x_{r};f\right)=0\ \ \ \ x_{1}<x_{2}<\cdots<x_{n+2}\end{array}\right. (2.9)

or from one of the systems obtained supposing one or both ends x1=a,xn+2=bx_{1}=a,x_{n+2}=b satisfied and suppressing from the second sequence of (2.9) the equations corresponding to these indices. The system (2.9) along with the other three obtained from this one determinates the coefficients of TnT_{n}, the value of ρ\rho, and the points xrx_{r}. The system is well defined, i.e., the number of unknowns equal the number of equations. These systems accept a certain number of solutions which can be found algebraically. From this set of solutions we know that a specific one provides the polynomial TnT_{n} and the best approximation μn\mu_{n}. From the following considerations will result that for a specified solution |ρ|\left|\rho\right| will have the maximum value and this solution will provide just the polynomial TnT_{n}.

We will prove below that for any continuous function f(x)f\left(x\right) and any positive δ\delta, we can find a polynomial P(x)P\left(x\right) such that

M(|fP|)<δ.M\left(\left|f-P\right|\right)<\delta.

Particularly, we can find a polynomial PP such that for a positive ε\varepsilon, mentioned a priori, we have

M(|Tn(x;f)Tn(x;P)|)<ε,M\left(\left|T_{n}\left(x;f\right)-T_{n}\left(x;P\right)\right|\right)<\varepsilon,

Consequently, we can compute with a desired approximation the polynomials of the best approximation for a continuous function.

2.9 The best approximation of xn+1x^{n+1}.

As an example, let’s compute the polynomial Tn(x;xn+1)T_{n}\left(x;x^{n+1}\right) in the interval (1,1)\left(-1,1\right). We immediately observe that μn\mu_{n} is attained even in the ends 1-1 or 11 because the derivative of fTnf-T_{n} is a polynomial of degree nn, which can not vanish in more than nn points. We must have

Pμn2=Q2(x21),P-\mu_{n}^{2}=Q^{2}\left(x^{2}-1\right),

where by definition P=xn+1Tn(x;xn+1)P=x^{n+1}-T_{n}\left(x;x^{n+1}\right) and QQ is a polynomial of degree nn.The last equation states the fundamental property of TnT_{n} polynomials. Differentiating this equality we get

P.P=QQ(x21)+xQ2=Q[Q(x21)+xQ].P.P^{\prime}=Q\cdot Q^{\prime}\left(x^{2}-1\right)+xQ^{2}=Q\left[Q^{\prime}\left(x^{2}-1\right)+xQ\right].

But PP is mutually prime with QQ, thus we can write P=λQP^{\prime}=\lambda Q, where λ\lambda is a constant (in fact λ=±(n+1)\lambda=\pm\left(n+1\right)).Thus we have

±λμn2P=P1x2\pm\lambda\sqrt{\mu_{n}^{2}-P}=P^{\prime}\sqrt{1-x^{2}}

or

dPμn2P2=±λdx1x2\frac{dP}{\sqrt{\mu_{n}^{2}-P^{2}}}=\frac{\pm\lambda dx}{\sqrt{1-x^{2}}}

and now one can see that PP has the form

P=±μncos(λarccosx+α)P=\pm\mu_{n}\cos\left(\lambda\arccos\ x+\alpha\right)

with α\alpha a constant. PP must be a polynomial of degree n+1n+1 with the first term xn+1x^{n+1} and thus α=0\alpha=0, and λ=n+1\lambda=n+1, i.e.,

xn+1Tn(x;xn+1)=12ncos(n+1¯.arccosx)x^{n+1}-T_{n}\left(x;x^{n+1}\right)=\frac{1}{2^{n}}\cos\left(\overline{n+1}.\arccos\ x\right)

and

μn(xn+1)=12n.\mu_{n}\left(x^{n+1}\right)=\frac{1}{2^{n}}.

The polynomial Tn(x;xn+1)T_{n}\left(x;x^{n+1}\right) corresponding to an arbitrary interval (a,b)\left(a,b\right) can be obtained using a linear transformation and thus we find

xn+1Tn(x;xn+1)=(b1)n+122n+1cos(n+1¯.arccos2xabba)x^{n+1}-T_{n}\left(x;x^{n+1}\right)=\frac{\left(b-1\right)^{n+1}}{2^{2n+1}}\cos\left(\overline{n+1}.\arccos\frac{2x-a-b}{b-a}\right) (2.10)

and

μnxn+1=(ba)n+122n+1.\mu_{n}x^{n+1}=\frac{\left(b-a\right)^{n+1}}{2^{2n+1}}.

The polynomial (2.10) is the one which differs at the least extent from zero among all the polynomials of degree n+1n+1 which have the first term xn+1x^{n+1}. This polynomial was for the first time determined by Chebyshev.

Chapter 3 The third lesson. The results of Ch. de la Vallée Poussin.

3.1 The best approximation on n+2n+2 points.

Let’s consider now a uniform function f(x)f\left(x\right) defined only on n+2n+2 points, namely

x1<x2<<xn+2.x_{1}<x_{2}<\cdots<x_{n+2}. (3.1)

The maximum M(|fP|)M\left(\left|f-P\right|\right) will be defined by formula

M(|fP|)\displaystyle M\left(\left|f-P\right|\right) =max(|f(xr)P(xr)|)\displaystyle=\max\left(\left|f\left(x_{r}\right)-P\left(x_{r}\right)\right|\right)
r\displaystyle r =1,2,,n+2.\displaystyle=1,2,\ldots,n+2.

For all polynomials P(x)P\left(x\right) of degree nn the expression M(|fP|)M\left(\left|f-P\right|\right) has a minimum denoted by ρn(f)\rho_{n}\left(f\right) or simply ρn\rho_{n}. It is easy to show that this minimum is attained by at least a polynomial of degree nn. We will denote with En(x)E_{n}\left(x\right) , or simply with EnE_{n}, such a polynomial. EnE_{n} is a polynomial of the best approximation of degree nn for the function ff on the n+2n+2 points of (E) and ρn\rho_{n} is the best approximation for ff using polynomials of degree nn on these n+2n+2 points.

Lat ξ0\xi_{0} a point on the left of x1x_{1}, ξn+2\xi_{n+2} a point on the right of xn+2x_{n+2}, and ξr\xi_{r} the middle point of the interval (xr,xr+1),r=1,2,,n+1\left(x_{r},x_{r+1}\right),\ r=1,2,\ldots,n+1.

From the sequence of points ξ0,ξ1,,ξn+1,ξn+2\xi_{0},\xi_{1},\ldots,\xi_{n+1},\xi_{n+2} we can choose another sequence ξ0,ξj1,,ξjm+1,ξn+2\xi_{0},\xi_{j_{1}},\ldots,\xi_{j_{m+1}},\xi_{n+2} which determinates mm consecutive intervals

L1,L2,,LmL_{1},L_{2},\ldots,L_{m}

with the following properties:

10.{}^{0}.

There exists at least one interval LsL_{s}.

20.{}^{0}.

Each interval contains points xrx_{r} where f=En=ρnf=E_{n}=\rho_{n} or points xrx_{r} where fEn=ρnf-E_{n}=-\rho_{n} but exclusively points of the same kind. If LsL_{s} contains a type of points then Ls1L_{s-1} and Ls+1L_{s+1} contain points of the opposite type. To fix the ideas let’s suppose that L1L_{1} contains points where fEn=ρnf-E_{n}=\rho_{n}.

It follows immediately that in the intervals L1,L3,L5,,L_{1},L_{3,}L_{5},\ldots, we have

ρn<fEnρn,-\rho_{n}<f-E_{n}\leq\rho_{n},

and in the intervals L2,L4,L6,,L_{2},L_{4},L_{6},\ldots, we have

ρnfEn<ρn.-\rho_{n}\leq f-E_{n}<\rho_{n}.

Let’s take now the polynomial of degree nn

Q(x)=(xξ1)(xξ2)(xξjm+1)(mn+1)Q\left(x\right)=\left(x-\xi_{1}\right)\left(x-\xi_{2}\right)\ldots\left(x-\xi_{j_{m+1}}\right)\ \ \ \ \left(m\leq n+1\right)

and we will determine the sign of the constant λ\lambda such that λQ>0\lambda Q>0 in the interval L1L_{1}. The points (E) being a finite set we immediately observe that we can take λ\lambda small enough in absolute value such that

ρn<fEnλQ<ρn,\rho_{n}<f-E_{n}-\lambda Q<\rho_{n},

and this implies the theorem:

If EnE_{n} is a best approximation polynomial of degree nn for the function ff on the n+2n+2 points of (E)(E), the difference fEnf-E_{n} takes equal and of contrary sign values in two consecutive points of (E)(E).

We neglect here and in the subsequent part the case ρn=0\rho_{n}=0. In this case there exists a polynomial of degree nn which takes on the values f(xr)f\left(x_{r}\right) in the points xrx_{r}.

3.2 The determination of the polynomial EnE_{n}.

The property proved above shows immediately that the polynomial EnE_{n} is uniquely determined. The computation of EnE_{n}, along with the best approximation ρn\rho_{n}, is carried out solving the system

En(xr)\displaystyle E_{n}\left(x_{r}\right) =f(xr)+(1)rpn(ρn=|ρ|)\displaystyle=f\left(x_{r}\right)+\left(-1\right)^{r}p_{n}^{\prime}\ \ \ \left(\rho_{n}=\left|\rho^{\prime}\right|\right)
r\displaystyle r =1,2,,n+2\displaystyle=1,2,\ldots,n+2

which must be compatible. In order to find explicitly ρn\rho_{n} and EnE_{n} we will use the notations introduced in Sect. 2.4 and Sect. 2.5 as well as formula (2.6). With these notations we have

ρn=(1)n+1U(x1,x2,,xn+2;f)r=1n+2V(x1,,xr1,xr1,,xn+2)\rho_{n}^{\prime}=\frac{\left(-1\right)^{n+1}U\left(x_{1},x_{2},\ldots,x\ _{n+2};f\right)}{\sum\limits_{r=1}^{n+2}V\left(x_{1},\ldots,x_{r-1},x_{r-1},\ldots,x_{n+2}\right)}\cdot (3.2)

The polynomial EnE_{n} will be determined using the LAGRANGE’s interpolation formula

En=r=1n+2[f(xr)+(1)rρn]G(x)(xxr)G(xr)E_{n}=\sum_{r=1}^{n+2}\left[f\left(x_{r}\right)+\left(-1\right)^{r}\rho_{n}^{\prime}\right]\frac{G\left(x\right)}{\left(x-x_{r}\right)G^{\prime}\left(x_{r}\right)}

where

G(x)=(xx1)(xx2)(xxn+2).G\left(x\right)=\left(x-x_{1}\right)\left(x-x_{2}\right)\ldots\left(x-x_{n+2}\right).

We also have

G(xr)=(1)n+2rV(x1,x2,,xn2)V(x1,,xr1,xr+1,,xn+2)G^{\prime}\left(x_{r}\right)=\frac{\left(-1\right)^{n+2-r}V\left(x_{1},x_{2},\ldots,x_{n-2}\right)}{V\left(x_{1},\ldots,x_{r-1},x_{r+1},\ldots,x_{n+2}\right)}

and thus we can write

En\displaystyle E_{n} =(1)n+2V(x1,x2,,xn+2)r=1n+2[ρn+(1)rf(xr)]\displaystyle=\frac{\left(-1\right)^{n+2}}{V\left(x_{1},x_{2},\ldots,x_{n+2}\right)}\sum_{r=1}^{n+2}\left[\rho_{n}^{\prime}+\left(-1\right)^{r}f\left(x_{r}\right)\right]\cdot
V(x1,,xr1,xr+1,,xn+2)G(r)xxr.\displaystyle\cdot V\left(x_{1},\ldots,x_{r-1},x_{r+1},\ldots,x_{n+2}\right)\frac{G\left(r\right)}{x-x_{r}}.

Apparently, this polynomial is of degree n+1n+1 but using (3.2) we see that the coefficient of xn+1x^{n+1} vanishes. The best approximation ρn\rho_{n} equals

ρn=|U(x1,x2,,xn+2)|r=1n+2V(x1,,xr1,xr+1,,xn2)\rho_{n}=\frac{\left|U\left(x_{1},x_{2},\ldots,x_{n+2}\right)\right|}{\sum\limits_{r=1}^{n+2}V\left(x_{1},\ldots,x_{r-1},x_{r+1},\ldots,x_{n-2}\right)} (3.3)

3.3 The first theorem of Ch. de la Vallée Poussin.

Let’s suppose now that a polynomial P(x)P\left(x\right) of degree nn is such that the numbers f(xr)P(xr)f\left(x_{r}\right)-P\left(x_{r}\right) , r=1,2,,n+2r=1,2,\ldots,n+2 are of alternative sign. We observe that the best approximation of ff equals that of fPf-P, and thus formula (3.3) becomes

ρn=r=1n+2|f(xr)P(xr)|V(x1,,xr1,xr+1,,xn+2)r=1n+2V(x1,,xr1,xr+1,,xn+2)\rho_{n}=\frac{\sum\limits_{r=1}^{n+2}\left|f\left(x_{r}\right)-P\left(x_{r}\right)\right|V\left(x_{1},\ldots,x_{r-1},x_{r+1},\ldots,x_{n+2}\right)}{\sum\limits_{r=1}^{n+2}V\left(x_{1},\ldots,x_{r-1},x_{r+1},\ldots,x_{n+2}\right)}

which is a mean value of the numbers |f(xr)P(xr)|\left|f\left(x_{r}\right)-P\left(x_{r}\right)\right| and thus we can state the result which will be called the the first Theorem of Ch. de la Vallée Poussin:

If a polynomial PP of degree nn is such that fPf-P takes on values of contrary sign in two consecutive points of (E) then we have

minr=1,2,,n+2(|f(xr)P(xr)|)<ρn<maxr=1,2,,n+2(|f(xr)P(xr)|)\min_{r=1,2,\ldots,n+2}\left(\left|f\left(x_{r}\right)-P\left(x_{r}\right)\right|\right)<\rho_{n}<\max_{r=1,2,\ldots,n+2}\left(\left|f\left(x_{r}\right)-P\left(x_{r}\right)\right|\right)

supposing that the numbers |f(xr)P(xr)|\left|f\left(x_{r}\right)-P\left(x_{r}\right)\right| are not mutually equal.

This property will be useful in concluding on the best approximation on a whole interval.

3.4 The second theorem of Ch. de la Vallée Poussin.

Let’s consider the function ff defined and continuous on the interval [a,b]\left[a,b\right]. The first Borel’s theorem assures the existence of a set (E)\left(E^{\ast}\right) of n+2n+2 points x1<x2<<xn+2x_{1}<x_{2}<\ldots<x_{n+2} such that the best approximation ρn\rho_{n}^{\ast} on these points equals the best approximation μn\mu_{n} of ff on [a,b]\left[a,b\right]. Let EnE_{n} be the polynomial of the best approximation of degree nn on (E)\left(E^{\ast}\right). If |fEn|ρn\left|f-E_{n}\right|\leq\rho_{n}^{\ast} in [a,b]\left[a,b\right] then, the best approximation ρn\rho_{n} on any set (E) of n+2n+2 points is at most equal with ρn\rho_{n}^{\ast}. Let’s suppose by contradiction that there exists a point xx^{\prime} such that f(x)E(x)|>ρnf\left(x\right)-E\left(x\right)|>\rho_{n}^{\ast}. If xx^{\prime} will be placed between xrx_{r} and xr+1x_{r+1} the difference fEnf-E_{n} has the same sign in xx^{\prime} as it takes on in xrx_{r} or xr+1x_{r+1}. Using the results from the previous Section, the best approximation is larger than ρn\rho_{n}^{\ast} on at least one of the sets of points

{x1,,xr,x,xr+2,,xn+2x1,,xr1,xxr+1,,xn+2.\left\{\begin{array}[c]{c}x_{1},\ldots,x_{r},x^{\prime},x_{r+2},\ldots,x_{n+2}\\ x_{1},\ldots,x_{r-1},x^{\prime}x_{r+1},\ldots,x_{n+2}.\end{array}\right. (3.4)

The same thing happens if xx^{\prime} is placed outside the interval (x1,xn+2)\left(x_{1},x_{n+2}\right).

On the other hand formula (3.3) shows that ρn\rho_{n} is a continuous function of x1,x2,,xn+2x_{1},x_{2},\ldots,x_{n+2} and must attains a maximum for at least a set (E)(E).

Taking again into account the first Borel’s theorem we can enounce the following property:

The best approximation μn\mu_{n} of a continuous function f(x)f\left(x\right) on an interval [a,b]\left[a,b\right] equals the best approximation on n+2n+2 points belonging to this interval, these points being chosen such that ρn\rho_{n} has the largest possible value. In other words

μn(f)=maxρn(f).\mu_{n}\left(f\right)=\max\rho_{n}\left(f\right).

This theorem is true even when the function f(x)f\left(x\right) is defined on a finite number of points or on a finite and closed arbitrary set.

3.5 Applications to functions with bounded differences.

In some cases formula (3.3) provides some refinements of the best approximation. We will say that the function f(x)f\left(x\right) has the nthnth divided difference bounded in the interval [a,b],\left[a,b\right], if quantity

[x1,x2,,xn+1;f],\left[x_{1},x_{2},\ldots,x_{n+1};f\right],

defined in Sect. 2.5, remains bounded whenever x1,x2,,xn+1x_{1},x_{2},\ldots,x_{n+1} are n+1n+1 arbitrary points in [a,b]\left[a,b\right].The number

Δn[f]=max(a,b)|[x1,x2,,xn+1;f]|\Delta_{n}\left[f\right]=\max_{\left(a,b\right)}\left|\left[x_{1},x_{2},\ldots,x_{n+1};f\right]\right|

is called the nthnth boundary or the boundary of order nn of ff in the interval [a,b]\left[a,b\right]. Supposing we have x1<x2<<xn+2x_{1}<x_{2}<\ldots<x_{n+2}, formula (3.3) can be written as

ρn=V(x1,x2,,xn+2)i=1n+2V(x1,,xi1,xi+1,,xn+2)|[x1,x2,,xn+2]|.\rho_{n}=\frac{V\left(x_{1},x_{2},\ldots,x_{n+2}\right)}{\sum\limits_{i=1}^{n+2}V\left(x_{1},\ldots,x_{i-1},x_{i+1},\ldots,x_{n+2}\right)}\left|\left[x_{1},x_{2},\ldots,x_{n+2}\right]\right|.

But

max[a,b]V(x1,x2,,x+2)i=1n+2V(x1,,xi1,xi+1,,xn+2)\max_{\left[a,b\right]}\frac{V\left(x_{1},x_{2},\ldots,x_{+2}\right)}{\sum\limits_{i=1}^{n+2}V\left(x_{1},\ldots,x_{i-1},x_{i+1},\ldots,x_{n+2}\right)}

equals the best approximation of xn+1x^{n+1} using polynomials of degree nn. This maximum equals (Sect. 2.9)

(ba)n+122n+1\frac{\left(b-a\right)^{n+1}}{2^{2n+1}}

and thus:

If the function f(x)f\left(x\right) has the (n+1)th(n+1)th divided difference bounded in the interval [a,b]\left[a,b\right] we have

μn(f)(ba)n+122n+1Δn+1[f].\mu_{n}\left(f\right)\leq\frac{\left(b-a\right)^{n+1}}{2^{2n+1}}\Delta_{n+1}\left[f\right].

Particularly, if ff admits a bounded derivative of order n+1n+1 and if we denote by Δ0[f(n+1)]\Delta_{0}\left[f^{\left(n+1\right)}\right] the maximum or the upper boundary of |f(n+1)|\left|f^{\left(n+1\right)}\right| in the interval (a,b)\left(a,b\right) we have

Δ0[f(n+1)]=(n+1)!Δn+1[f]\Delta_{0}\left[f^{\left(n+1\right)}\right]=\left(n+1\right)!\Delta_{n+1}\left[f\right]

and consequently

μn(f)(ba)n+122n+1(n+1)!Δ0[f(n+1)].\mu_{n}\left(f\right)\leq\frac{\left(b-a\right)^{n+1}}{2^{2n+1}\left(n+1\right)!}\Delta_{0}\left[f^{\left(n+1\right)}\right].

3.6 Oscillation modulus of a function.

In order to refine μn\mu_{n} as well as for the problem which follows in the next lesson we have to introduce the oscillation modulus ω(δ)\omega\left(\delta\right) of a function f(x)f\left(x\right). This modulus is a function of δ,\delta, and is defined by

ω(δ)=max|f(x)f(x′′)|\omega\left(\delta\right)=\max\left|f\left(x^{\prime}\right)-f\left(x^{\prime\prime}\right)\right|

whenever x,x′′x^{\prime},x^{\prime\prime} are two arbitrary points in the interval (a,b)\left(a,b\right), such that |xx′′|δ\left|x^{\prime}-x^{\prime\prime}\right|\leq\delta.

ω(δ)\omega\left(\delta\right) is a function defined for δ\delta in the interval 0<δba,0<\delta\leq b-a, non decreasing and which does not become negative. The following inequality is obvious

|f(x)f(x′′)|ε(|xx′′|).\left|f\left(x^{\prime}\right)-f\left(x^{\prime\prime}\right)\right|\leq\varepsilon\left(\left|x-x^{\prime\prime}\right|\right). (3.5)

The function ω(δ)\omega\left(\delta\right) enjoys some properties which will be recalled below. Given an ε>0\varepsilon>0, there exists a couple of two points x<x′′x^{\prime}<x^{\prime\prime} such that we have |xx′′|δ\left|x^{\prime}-x^{\prime\prime}\right|\leq\deltaand

ω(δ)ε<|f(x)f(x′′)|.\omega\left(\delta\right)-\varepsilon<\left|f\left(x^{\prime}\right)-f\left(x^{\prime\prime}\right)\right|.

Let’s divide the interval (x,x′′)\left(x^{\prime},x^{\prime\prime}\right) in kk subintervals of equal length using the nodes x=x0,x1,,xk1,xk=x′′x^{\prime}=x_{0},x_{1},\ldots,x_{k-1},x_{k}=x^{\prime\prime} and we will have

f(x)f(x′′)=i=1k[f(xi)f(xi+1)].f\left(x^{\prime}\right)-f\left(x^{\prime\prime}\right)=\sum_{i=1}^{k}\left[f\left(x_{i}\right)-f\left(x_{i+1}\right)\right].

From this equality, we get

|f(x)f(x′′)|kω(δk)\left|f\left(x^{\prime}\right)-f\left(x^{\prime\prime}\right)\right|\leq k\omega\left(\frac{\delta}{k}\right)

and thus

ω(δ)<kω(δk)+ε\omega\left(\delta\right)<k\omega\left(\frac{\delta}{k}\right)+\varepsilon

for any ε\varepsilon. kk being is a positive integer and letting kδk\delta instead of δ\deltawe eventually get

ω(kδ)kω(δ).\omega\left(k\delta\right)\leq k\omega\left(\delta\right).

If kk is a positive number and kk^{\prime} is the largest integer less or equal with kkwe can write

ω(kδ)ω(k+1δ¯)(k+1)ω(δ).\omega\left(k\delta\right)\leq\omega\left(\overline{k+1\delta}\right)\leq\left(k^{\prime}+1\right)\omega\left(\delta\right).

It follows that

ω(kδ)<(k+1)ω(δ)\omega\left(k\delta\right)<\left(k+1\right)\omega\left(\delta\right)

for any positive kk (of course δ\delta and kδk\delta must be <ba<b-a). Thus for δba\delta\leq b-a we can write

|f(x)g(x′′)|<[|xx′′1|δ+1]ε(δ).\left|f\left(x^{\prime}\right)-g\left(x^{\prime\prime}\right)\right|<\left[\frac{\left|x^{\prime}-x^{\prime\prime}1\right|}{\delta}+1\right]\varepsilon\left(\delta\right). (3.6)

Eventually, the necessary and sufficient condition for the continuity of ff is ω(δ)0\omega\left(\delta\right)\rightarrow 0, for δ0\delta\rightarrow 0.

  1. 29.
    • The upper limit ofμn\mu_{n}In the next lesson we will indicate the upper bound ofμn\mu_{n}. We want to indicate here a direct path which, if it could be followed to the end, could eventually give us the solution of this problem. The denominator of expression (22) can be written in the form

2D(x1,x2,,xn+2)2\mathrm{D}\left(x_{1},x_{2},\ldots,x_{n+2}\right)

where

D(x1,x2,,xn+2)=|x2x1x22x12x2nx1n(1)nx3x2x32x22x3nx2n1)n1....xn+1xnxn+1xn2xn+1nxnn1xn+2xn+1xn+22xn+12xn+2nxn+1n1|.\mathrm{D}\left(x_{1},x_{2},\ldots,x_{n+2}\right)=\left|\begin{array}[]{ccccc}x_{2}-x_{1}&x_{2}^{2}-x_{1}^{2}&\ldots&x_{2}^{n}-x_{1}^{n}&(-1)^{n}\\ x_{3}-x_{2}&x_{3}^{2}-x_{2}^{2}&\ldots&x_{3}^{n}-x_{2}^{n}&-1)^{n-1}\\ \ldots..&\ldots..&\ldots&\ldots&\ldots\\ x_{n+1}-x_{n}&x_{n+1}-x_{n}^{2}&\ldots x_{n+1}^{n}-x_{n}^{n}&-1\\ x_{n+2}-x_{n+1}&x_{n+2}^{2}-x_{n+1}^{2}&\ldots&x_{n+2}^{n}-x_{n+1}^{n}&1\end{array}\right|.

It is worth noting that the minors of the last column are positive, because we assume here toox1<x2<<xn+2x_{1}<x_{2}<\cdots<x_{n+2}.

We really have

|xandx1x22x1x2nx1nx3x2x32x22x3nx2nxandxand1xand2xand12xandnxand1nxand+2xand+1xand+22xand+12xand+2nxand+1nxn+2xn+1xn+22xn+12xn+2nxn+1n|=\displaystyle\left|\begin{array}[]{cccc}x_{i}-x_{1}&x_{2}^{2}-x_{1}^{\prime}&\cdots&x_{2}^{n}-x_{1}^{n}\\ x_{3}-x_{2}&x_{3}^{2}-x_{2}^{2}&\cdots&x_{3}^{n}-x_{2}^{n}\\ \cdots&\cdots&\cdots&\cdots\\ x_{i}-x_{i-1}&x_{i}^{2}-x_{i-1}^{2}&\cdots&x_{i}^{n}-x_{i-1}^{n}\\ x_{i+2}-x_{i+1}&x_{i+2}^{2}-x_{i+1}^{2}&\cdots&x_{i+2}^{n}-x_{i+1}^{n}\\ \cdots\cdots&\cdots&\cdots&\cdots\\ x_{n+2}-x_{n+1}&x_{n+2}^{2}-x_{n+1}^{2}&\cdots&x_{n+2}^{n}-x_{n+1}^{n}\end{array}\right|=
=n!x1x2xand1xandxand+1xand+2xn+1xn+2V(t1,t2,,tn)𝑑t1𝑑t2𝑑tn>0.\displaystyle=n!\int_{x_{1}}^{x_{2}}\cdots\int_{x_{i-1}}^{x_{i}}\int_{x_{i+1}}^{x_{i+2}}\cdots\int_{x_{n+1}}^{x_{n+2}}V\left(t_{1},t_{2},\ldots,t_{n}\right)dt_{1}dt_{2}\ldots dt_{n}>0.

If we subtract each line from the next and take into account (24),
we deduce

|x2x1x22x12x2nx1n(1)nω(x2x1)x3x2x32x22x3nx2n(1)n1ω(x3x2)xn+2xn+1xn+22xn+12xn+2nxn+1nω1(xn+2xn+1)|.\leq\left|\begin{array}[]{ccccc}x_{2}-x_{1}&x_{2}^{2}-x_{1}^{2}&\cdots&x_{2}^{n}-x_{1}^{n}&(-1)^{n}\omega\left(x_{2}-x_{1}\right)\\ x_{3}-x_{2}&x_{3}^{2}-x_{2}^{2}&\cdots&x_{3}^{n}-x_{2}^{n}&(-1)^{n-1}\omega\left(x_{3}-x_{2}\right)\\ \cdots\cdots&\cdots&\cdots&\cdots&\cdots\\ x_{n+2}-x_{n+1}&x_{n+2}^{2}-x_{n+1}^{2}&\cdots&x_{n+2}^{n}-x_{n+1}^{n}&\omega_{1}\left(x_{n+2}-x_{n+1}\right)\end{array}\right|.

Taking into account (25) and the previous observation, we deduce|U(x1,x2,,xn+2;f)|<|D1(x1,x2,.,xn+2)δ+D(x1,x2,,xn+2)|ω(δ)\left|\mathrm{U}\left(x_{1},x_{2},\ldots,x_{n+2};f\right)\right|<\left|\frac{\mathrm{D}_{1}\left(x_{1},x_{2},\ldots.,x_{n+2}\right)}{\delta}+\mathrm{D}\left(x_{1},x_{2},\ldots,x_{n+2}\right)\right|\omega(\delta).

We have here

D1(x1,x2,,xn+2)==|x2x1x22x12x2nx1nx3x2x32x22x3nx2n(1)n1(x2x1)xn+2xn+1xn+22xn+12xn+2nxn+1nxn+2xn+1|==2(n!)x1x2x2x3xn+1xn+2D(t1,t2,,tn+1)𝑑t1,dt2dtn+1.\begin{gathered}\mathrm{D}_{1}\left(x_{1},x_{2},\ldots,x_{n+2}\right)=\\ =\left|\begin{array}[]{cccc}x_{2}-x_{1}&x_{2}^{2}-x_{1}^{2}&\cdots&x_{2}^{n}-x_{1}^{n}\\ x_{3}-x_{2}&x_{3}^{2}-x_{2}^{2}&\cdots&x_{3}^{n}-x_{2}^{n}\\ \cdots&\cdots&(-1)^{n-1}\left(x_{2}-x_{1}\right)\\ x_{n+2}-x_{n+1}&x_{n+2}^{2}-x_{n+1}^{2}&\cdots x_{n+2}^{n}-x_{n+1}^{n}&x_{n+2}-x_{n+1}\end{array}\right|=\\ =2(n!)\int_{x_{1}}^{x_{2}}\int_{x_{2}}^{x_{3}}\cdots\int_{x_{n+1}}^{x_{n+2}}\mathrm{D}\left(t_{1},t_{2},\ldots,t_{n+1}\right)dt_{1},dt_{2}\ldots dt_{n+1}.\end{gathered}

So if we denote byθn\theta_{n}maximum of the quotient

D1(x1,x2,,xn+2)D(x1,x2,,xn+2),\frac{\mathrm{D}_{1}\left(x_{1},x_{2},\ldots,x_{n+2}\right)}{\mathrm{D}\left(x_{1},x_{2},\ldots,x_{n+2}\right)}, (26)

when the pointsx1<x2<<xn+2x_{1}<x_{2}<\cdots<x_{n+2}describe the range(A,b)(a,b), we have

μn(f)<[θn2δ+12]ω(δ).\mu_{n}(f)<\left[\frac{\theta_{n}}{2\delta}+\frac{1}{2}\right]\omega(\delta).

It can easily be seen thatθn<bA\theta_{n}<b-a, therefore takingδ=θn\delta=\theta_{n}, we find

μn(f)<ω(θn).\mu_{n}(f)<\omega\left(\theta_{n}\right).

Unfortunately, his determinationθn\theta_{n}seems to be a complicated problem. It is likely that fornn\rightarrow\inftythis number is of the order of1n\frac{1}{n}It would be interesting to demonstrate, as a first result, thatθn0\theta_{n}\rightarrow 0fornn\rightarrow\infty.

It can easily be shown that if two or more pointsxandx_{i}tend to be confused, expression (26) tends to 0. The ratio (26) is a
homogeneous function of degree 1 with respect tox1,x2,,xn+2x_{1},x_{2},\ldots,x_{n+2}and it depends only on the differencesxandxjx_{i}-x_{j}It follows that the maximum can only be reached forx1=A,xn+2=bx_{1}=a,x_{n+2}=b.

Either, in particular,n=2n=2We have dots.x1=A,x2=y,x3=x,x4=bx_{1}=a,x_{2}=y,x_{3}=x,x_{4}=band the ratio (26) is written

2(yA)(bx)(xy)(bA+xy)(xA)(by)(bA+yx).\frac{2(y-a)(b-x)(x-y)(b-a+x-y)}{(x-a)(b-y)(b-a+y-x)}.

To calculate the maximum, differential calculus can be applied. By canceling the logarithmic partial derivatives, we find

1bx+1xy+1bA+xy1xA+1bA+yx=01yA1xy1bA+xy+1by1bA+yx=0\begin{array}[]{r}-\frac{1}{b-x}+\frac{1}{x-y}+\frac{1}{b-a+x-y}-\frac{1}{x-a}+\frac{1}{b-a+y-x}=0\\ \frac{1}{y-a}-\frac{1}{x-y}-\frac{1}{b-a+x-y}+\frac{1}{b-y}-\frac{1}{b-a+y-x}=0\end{array}

By adding together, we find

bA(yA)(by)=bA(xA)(bx)\frac{b-a}{(y-a)(b-y)}=\frac{b-a}{(x-a)(b-x)}

or

(xy)(x+yAb)=0.(x-y)(x+y-a-b)=0.

The maximum is therefore obtained forx+y=A+bx+y=a+b, that is, Peterxxandyysymmetrical about the middle of the interval (A,ba,b). It is then found that æ must be the root contained betweenA+b2\frac{a+b}{2}andbbof the equation

((xA+b2)2+(bA)(xA+b2)(bA)24=0\left(\left(x-\frac{a+b}{2}\right)^{2}+(b-a)\left(x-\frac{a+b}{2}\right)-\frac{(b-a)^{2}}{4}=0\right.

so

x=A+b2+bA2(21)x=\frac{a+b}{2}+\frac{b-a}{2}(\sqrt{2}-1)

and

θ2=2(bA)(21)2.\theta_{2}=2(b-a)(\sqrt{2}-1)^{2}.

It is important to note that the pointsx,yx,ydo not rationally divide the interval (A,ba,b). Furthermore, the coefficients of the polynomial

(zx4)(zx2)(zx3)(zx4),\left(z-x_{4}\right)\left(z-x_{2}\right)\left(z-x_{3}\right)\left(z-x_{4}\right),

Whenx1,x2,x3,x4x_{1},x_{2},x_{3},x_{4}are the points for which the maximum is reached, are not rational with respect to a and b. This fact, which probably occurs for any n, is the main cause of the difficulty in determining the maximumθm\theta_{m}.

LESSON IV

Weierstrass's theorem

  1. 30.
    • Weierstrass's theorem. K. Weierstrass proved the following theorem ( 8 ):

Any continuous function on the interval (A,ba,b) is the limit of a sequence of polynomials, uniformly convergent in this interval.

The proof is not based on polynomial theory.𝟏n\mathbf{1}_{n}. However, it follows from this theorem that

μn(f)0, for n\mu_{n}(f)\rightarrow 0\text{, pentru }n\rightarrow\infty (27)

if the function is continuous.
It is obvious, moreover, that for any functionffHAVE

μ0μ1μn\mu_{0}\geq\mu_{1}\geq\cdots\geq\mu_{n}\geq\cdots

so the limit

limμn(f)=μ, for n.\lim\mu_{n}(f)=\mu,\text{ pentru }\quad n\rightarrow\infty.

there is and is0\geq 0If
μ=0\mu=0the polynomial sequenceTn\mathrm{T}_{n}converges absolutely and uniformly in (A,ba,b). It follows that for a discontinuous function there must bemμ0m\mu\neq 0Weierstrass's theorem tells us that for a continuous function we have certaintyμ=0\mu=0.

The important problem would be to prove the relation (27) directly, relying only on the properties of polynomialsTn\mathrm{T}_{n}. If for example it could be shown that the numberθn\theta_{n}defined in No. 29 tends to zero fornn\rightarrow\infty, the problem would be solved.

Before proving Weierstrass's theorem we will show a result of Mr. L. Tonelli in connection with such a direct proof.
31. - Mr. L. Tonelli's theorem. Suppose that the sequence of polynomials

T0(x;f),T1(x;f),,Tn(x;f),\mathrm{T}_{0}(x;f),\mathrm{T}_{1}(x;f),\ldots,\mathrm{T}_{n}(x;f),\ldots (28)

converges uniformly to a continuous functionF(x)F(x)and that we haveμ>0\mu>0, then

M(|fF|)M(|fTn|)+M(|FTn|)μn+M(|FTn|).\mathrm{M}(|f-\mathrm{F}|)\leq\mathrm{M}\left(\left|f-\mathrm{T}_{n}\right|\right)+\mathrm{M}\left(\left|\mathrm{~F}-\mathrm{T}_{n}\right|\right)\leq\mu_{n}+\mathrm{M}\left(\left|\mathrm{~F}-\mathrm{T}_{n}\right|\right).

It is easily deduced that

M(|fF|)μ.\mathrm{M}(|f-\mathrm{F}|)\leq\mu.

fFf-\mathrm{F}being a continuous function, we can determine aδ>0\delta>0thus, in any length intervalδ\leq\delta, the oscillation of this function is smaller thanμ\muOn the other hand we can find a numbern>bAδn>\frac{b-a}{\delta}so that we have

M(|FTn|)<ε<μ2.\mathrm{M}\left(\left|\mathrm{~F}-\mathrm{T}_{n}\right|\right)<\varepsilon<\frac{\mu}{2}.

We know that there is at leastn+2n+2points in carets±μn\pm\mu_{n}is alternatively reached and, from the way it was chosenn[n>bAδ]n\left[n>\frac{b-a}{\delta}\right], it follows that there are among thesen+2n+2at least two pointsx,x"x^{\prime},x^{\prime\prime}so that

|xx"|<δ,f(x)Tn(x)=μn,f(x")Tn(x")=μn,\begin{gathered}\left|x^{\prime}-x^{\prime\prime}\right|<\delta,\\ f\left(x^{\prime}\right)-\mathrm{T}_{n}\left(x^{\prime}\right)=\mu_{n},\quad f\left(x^{\prime\prime}\right)-\mathrm{T}_{n}\left(x^{\prime\prime}\right)=-\mu_{n},\end{gathered}

from where

f(x)F(x)\displaystyle f\left(x^{\prime}\right)-\mathrm{F}\left(x^{\prime}\right) =[f(x)Tn(x)]+[Tn(x)F(x)]>μnεμε>μ2;\displaystyle=\left[f\left(x^{\prime}\right)-\mathrm{T}_{n}\left(x^{\prime}\right)\right]+\left[\mathrm{T}_{n}\left(x^{\prime}\right)-\mathrm{F}\left(x^{\prime}\right)\right]>\mu_{n}-\varepsilon\geq\mu-\varepsilon>\frac{\mu}{2};
f(x")F(x")\displaystyle f\left(x^{\prime\prime}\right)-\mathrm{F}\left(x^{\prime\prime}\right) =[f(x")Tn(x")]+[Tn(x")F(x")]<μn+εμ+ε<μ2.\displaystyle=\left[f\left(x^{\prime\prime}\right)-\mathrm{T}_{n}\left(x^{\prime\prime}\right)\right]+\left[\mathrm{T}_{n}\left(x^{\prime\prime}\right)-\mathrm{F}\left(x^{\prime\prime}\right)\right]<-\mu_{n}+\varepsilon\leq-\mu+\varepsilon<-\frac{\mu}{2}.

It follows that the oscillation of the functionfFf-\mathrm{F}in the interval (x,x"x^{\prime},x^{\prime\prime}) is greater thanμ\mu, which is impossible. The hypothesisyou>0u>0so it is not good. Therefore we must haveμ=0\mu=0We have the following theorem of Mr. Tonelli:

If the series of polynomials (28) converges absolutely and uniformly to a function (necessarily continuous), this function coincides with f(x).
32. - Mr. S. Bernstein's polynomials. We will prove Weierstrass's theorem with the help of Mr. S. Bernstein's polynomials. We must therefore, first of all, give the definition of these polynomials.

Let's divide the interval (A,ba,b) innnequal parts and either

Aand=A+andbAn,and=0,1,,n(A0=A,An=b)a_{i}=a+i\frac{b-a}{n},\quad i=0,1,\ldots,n\quad\left(a_{0}=a,a_{n}=b\right)

points of division.
A polynomial of degree n whose coefficients depend linearly and homogeneously on thosen+1n+1importantf(Aand),and=0,1,,nf\left(a_{i}\right),i=0,1,\ldots,n, is called an interpolation polynomial of degreennof the functionf(x)f(x)We will study, in particular, the interpolation polynomial introduced by DI S. Bernstein (9)

Pn(x;f)=1(bA)nand=0n(nand)f(Aand)(xA)and(bx)nandP_{n}(x;f)=\frac{1}{(b-a)^{n}}\sum_{i=0}^{n}\binom{n}{i}f\left(a_{i}\right)(x-a)^{i}(b-x)^{n-i}

It is interesting to note how this polynomial can be obtained in a somewhat geometric way.

Whether𝐀𝐧𝐞𝐫0,𝐀𝐧𝐞𝐫1,,𝐀𝐧𝐞𝐫n\mathbf{A}_{0},\mathbf{A}_{1},\ldots,\mathbf{A}_{n}representative points of the functionf(x)f(x)forx=A0,A1,,Anx=a_{0},a_{1},\ldots,a_{n}that is, the points of forgivenessAand,f(Aand)a_{i},f\left(a_{i}\right)Let's build the polygonal lineA0A1An\mathrm{A}_{0}\mathrm{~A}_{1}\ldots\mathrm{~A}_{n}.

Let's take the sidesA0A1,A1A2,,An1An\mathrm{A}_{0}\mathrm{~A}_{1},\mathrm{~A}_{1}\mathrm{~A}_{2},\ldots,\mathrm{~A}_{n-1}\mathrm{~A}_{n}of the polyline

gonal pointsA0,A1,,An1A_{0}^{\prime},A_{1}^{\prime},\ldots,A_{n-1}^{\prime}which intersect these sides in the same direction and in the same ratio. We choose this ratio so that

A0A0=A1A1==An1An1=SnbAn,\mathrm{A}_{0}\mathrm{~A}_{0}^{\prime}=\mathrm{A}_{1}\mathrm{~A}_{1}^{\prime}=\cdots=\mathrm{A}_{n-1}\mathrm{~A}_{n-1}^{\prime}=\frac{s}{n}\cdot\frac{b-a}{n},

Ssbeing a whole,0Sn0\leq s\leq n. In the polygonal lineA0A1An1A_{0}^{\prime}A_{1}^{\prime}\ldots A_{n-1}^{\prime}we inscribe the polygonal line in the same wayA0"A1"An2"\mathrm{A}_{0}^{\prime\prime}\mathrm{A}_{1}^{\prime\prime}\ldots\mathrm{A}_{n-2}^{\prime\prime}preserving the meaning and the meaning of dividing the sides; therefore we have everything

A0A0"=A1A1"==An2An2"=SnbAn.\mathrm{A}_{0}^{\prime}\mathrm{A}_{0}^{\prime\prime}=\mathrm{A}_{1}^{\prime}\mathrm{A}_{1}^{\prime\prime}=\cdots=\mathrm{A}_{n-2}^{\prime}\mathrm{A}_{n-2}^{\prime\prime}=\frac{s}{n}\cdot\frac{b-a}{n}.

Continuing this process, we successively insert the polygonal linesA0(k)A1(k)Ank(k),k=3,4,,nA_{0}^{(k)}A_{1}^{(k)}\ldots A_{n-k}^{(k)},k=3,4,\ldots,nThe last one comes down to a point.A0(n)A_{0}^{(n)}We have

A0A0=A0A0"==A0(n1)A0(n)=SnbAn\mathrm{A}_{0}\mathrm{~A}_{0}^{\prime}=\mathrm{A}_{0}^{\prime}\mathrm{A}_{0}^{\prime\prime}=\cdots=\mathrm{A}_{0}^{(n-1)}\mathrm{A}_{0}^{(n)}=\frac{s}{n}\cdot\frac{b-a}{n}

so its abscissaA0(n)\mathrm{A}_{0}^{(n)}it is precisely

A+SbAn=AS.a+s\frac{b-a}{n}=a_{s}.

Let's note this point.A0(n)\mathrm{A}_{0}^{(n)}withAp\mathrm{A}_{p}^{*}, to highlight the numberSs, and let's calculate its ordinateAS\mathrm{A}_{s}^{*}Forand=0i=0andand=ni=npointA\mathrm{A}_{\text{\& }}coincides withA0\mathrm{A}_{0}andAn\mathrm{A}_{n}respectively. In general, let us denote bybSb_{s}his/her orderAS\mathrm{A}_{s}, withbR(k)b_{r}^{(k)}his/her orderAR(k)\mathrm{A}_{r}^{(k)}and withbSb_{s}^{*}his/her orderAS\mathrm{A}_{s}^{*}We have

bR(k)=(nS)bR(k1)+SbR+1(k1)n,R=0,1,,nk,k=1,2,,n1b_{r}^{(k)}=\frac{(n-s)b_{r}^{(k-1)}+sb_{r+1}^{(k-1)}}{n},r=0,1,\ldots,n-k,k=1,2,\ldots,n-1 (29)

and

bS=(nS)b0(n1)+Sb1(n1)n.b_{s}^{*}=\frac{(n-s)b_{0}^{(n-1)}+sb_{1}^{(n-1)}}{n}. (30)

From (29) we successively deduce

bR(1)=(nS)bR+SbR+1n\displaystyle b_{r}^{(1)}=\frac{(n-s)b_{r}+sb_{r+1}}{n}
bR(2)=(nS)2bR+2S(nS)bR+1+S2bR+2n2\displaystyle b_{r}^{(2)}=\frac{(n-s)^{2}b_{r}+2s(n-s)b_{r+1}+s^{2}b_{r+2}}{n^{2}}

and in general

bR(k)=1nkand=0k(kand)Sand(nS)kandbR+and,R=0,1,,nk.b_{r}^{(k)}=\frac{1}{n^{k}}\sum_{i=0}^{k}\binom{k}{i}s^{i}(n-s)^{k-i}b_{r+i},\quad r=0,1,\ldots,n-k.

Formula (30) therefore gives us

bS=1nnand=0n(nand)Sand(nS)nandband.b_{s}^{*}=\frac{1}{n^{n}}\sum_{i=0}^{n}\binom{n}{i}s^{i}(n-s)^{n-i}b_{i}.

Returning now to the polynomialPn(x;f)\mathrm{P}_{n}(x;f)we notice that we have:

Pn[A+SbAn;f]=1nnand=0n(nand)Sand(nS)nandf(Aand).P_{n}\left[a+s\frac{b-a}{n};f\right]=\frac{1}{n^{n}}\sum_{i=0}^{n}\binom{n}{i}s^{i}(n-s)^{n-i}f\left(a_{i}\right).

It follows that Mr. Bernstein's polynomialPn(x;f)\mathrm{P}_{n}(x;f)is the Lagrange polynomial that takes the valuesbSb_{s}^{*}for the pointsASa_{s}.
33.- Determining an upper limit for|f(x)Pn(x;f)|\left|f(x)-\mathrm{P}_{n}(x;f)\right|Let's determine an upper limit for|fPn(x;f)|\left|f-\mathrm{P}_{n}(x;f)\right|We note thatPn(x;1)1\mathrm{P}_{n}(x;1)\equiv 1, from which we deduce, using the oscillation modulusω(δ)\omega(\delta)defined in No. 27,

f\displaystyle\mid f- Pn(x;f)|=|1(bA)nand=0n(nand)[f(x)f(Aand)](xA)and(bx)nand|\displaystyle P_{n}(x;f)\left|=\left|\frac{1}{(b-a)^{n}}\sum_{i=0}^{n}\binom{n}{i}\left[f(x)-f\left(a_{i}\right)\right](x-a)^{i}(b-x)^{n-i}\right|\leq\right.
1(bA)nand=1n(nand)ω(|xAand|)(xA)and(bx)nand<\displaystyle\leq\frac{1}{(b-a)^{n}}\sum_{i=1}^{n}\binom{n}{i}\omega\left(\left|x-a_{i}\right|\right)(x-a)^{i}(b-x)^{n-i}<
<{1δ1(bA)nand=0n(nand)|xAand|(xA)(bx)nand+1}ω(0)\displaystyle<\left\{\frac{1}{\delta}\cdot\frac{1}{(b-a)^{n}}\sum_{i=0}^{n}\binom{n}{i}\left|x-a_{i}\right|(x-a)^{\prime}(b-x)^{n-i}+1\right\}\omega(0)

Let's put

ψ(x)=1(bA)nand=0n(nand)|xAand|(xA)and(bx)nt\psi(x)=\frac{1}{(b-a)^{n}}\sum_{i=0}^{n}\binom{n}{i}\left|x-a_{i}\right|(x-a)^{i}(b-x)^{n-t} (31)

and

Nn==MAXψ(x) in (A,b)\begin{gathered}\mathrm{N}_{n}==\max\psi(x)\\ \text{ in }(a,b)\end{gathered}

Finally,δ=2Nn\delta=2\mathrm{~N}_{n}, we deduce (it will be seen that actuallyδbA\delta\leq b-a)

|fPn(x;f)|<32ω(2Nn)\left|f-\mathrm{P}_{n}(x;f)\right|<\frac{3}{2}\omega\left(2\mathrm{~N}_{n}\right) (32)
  1. 34.
    • The approximation given by the polynomialPn(x;f)\mathrm{P}_{n}(x;f)We can now calculate the approximation given by the polynomialsPn(x;f)\mathrm{P}_{n}(x;f)Let's first calculate the functionψ(x)\psi(x)We have in the interval (Aj,Aj+1a_{j},a_{j+1}),

ψ(x)=1(bA)n\displaystyle\psi(x)=\frac{1}{(b-a)^{n}} and=0j(nj)(xAand)(xA)it(bx)nand+\displaystyle\sum_{i=0}^{j}\binom{n}{j}\left(x-a_{i}\right)(x-a)^{l}(b-x)^{n-i}+
+1(bA)nand=j+1n(nand)(Aandx)(xA)it(bx)nand=\displaystyle+\frac{1}{(b-a)^{n}}\sum_{i=j+1}^{n}\binom{n}{i}\left(a_{i}-x\right)(x-a)^{l}(b-x)^{n-i}=
=2(bA)nand=0j(nand)(xAand)xA)and(bx)nand\displaystyle\left.=\frac{2}{(b-a)^{n}}\cdot\sum_{i=0}^{j}\binom{n}{i}\left(x-a_{i}\right)^{\prime}x-a\right)^{i}(b-x)^{n-i}

because it is easy to pore that

and=0n(nand)(Aandx)(xA)and(bx)nand=0\sum_{i=0}^{n}\binom{n}{i}\left(a_{i}-x\right)(x-a)^{i}(b-x)^{n-i}=0

Doing the calculations, we find

ψ(x)=2(bA)n(n1,)(xA)j+1(bxjnj\psi(x)=\frac{2}{(b-a)^{n}}\binom{n-1}{,}(x-a)^{j+1}\left(b-x_{j}^{n-j}\right.

The maximum of this polynomial in the interval(Aj,Aj+1)\left(a_{j},a_{j+1}\right)is reached for

x=(j+1)b+(n1)An+1x=\frac{(j+1)b+(n-1)a}{n+1}

and has the value

2(bA)(n1)(j+1)j+1(nj)nj(n+1)n+1=2(bA)λj2(b-a)(n-1)\frac{(j+1)^{j+1}(n-j)^{n-j}}{(n+1)^{n+1}}=2(b-a)\lambda_{j} (33)

Function(x+1x)x+1\left(\frac{x+1}{x}\right)^{x+1}it is decreasing whenx1x\geq 1increases It results
that we have

(j+2j+1)j+2>(njnj1)nj for n>j+12\left(\frac{j+2}{j+1}\right)^{j+2}>\left(\frac{n-j}{n-j-1}\right)^{n-j}\quad\text{ pêntru }\quad n>\frac{j+1}{2}

or

λj+1>λj\lambda_{j+1}>\lambda_{j}

We deduce from this that (33) reaches its maximum forj=n2j=\frac{n}{2}orj=n12j=\frac{n-1}{2}asnnis it even or odd.

We have that but

Nn=2(bA)(n1n2)(n2+1)n2+1(n2)n2(n+1)n+1 for n hair Nn=bA2n(n12) for n odd. \begin{array}[]{ll}\mathrm{N}_{n}=2(b-a)\binom{n-1}{\frac{n}{2}}\frac{\left(\frac{n}{2}+1\right)^{\frac{n}{2}+1}\left(\frac{n}{2}\right)^{\frac{n}{2}}}{(n+1)^{n+1}}&\text{ pentru }n\text{ par }\\ \mathrm{N}_{n}=\frac{b-a}{2^{n}}\left(\frac{n-1}{2}\right)&\text{ pentru }n\text{ impar. }\end{array}

It is immediately demonstrated that

2n1N2n1>2n+1N2n+1N1=bA2,N3=bA4\begin{gathered}\sqrt{2n-1}\mathrm{~N}_{2n-1}>\sqrt{2n+1}\mathrm{~N}_{2n+1}\\ \mathrm{~N}_{1}=\frac{b-a}{2},\quad\mathrm{~N}_{3}=\frac{b-a}{4}\end{gathered}

from where

N2n+1<bA22n+1N2n+1=3(bA)42n+1 for n1\begin{array}[]{ll}\mathrm{N}_{2n+1}<\frac{b-a}{2\sqrt{2n+1}}&\\ \mathrm{~N}_{2n+1}=\frac{\sqrt{3(b-a)}}{4\sqrt{2n+1}}&\text{ pentru }n\geq 1\end{array}

Fornnwe seem to have

N2n=N2n+1(n+1)n+1nn(2n+1)2n+122n+1<N2n+122n+1(n+1)(2n+1)2n+1(2n+12)2n=\displaystyle\mathrm{N}_{2n}=\mathrm{N}_{2n+1}\frac{(n+1)^{n+1}n^{n}}{(2n+1)^{2n+1}}2^{2n+1}<\mathrm{N}_{2n+1}\cdot\frac{2^{2n+1}(n+1)}{(2n+1)^{2n+1}}\left(\frac{2n+1}{2}\right)^{2n}=
=\displaystyle= N2n+12(n+1)2n+13(bA)42n+12(n+1)2n+1=123(n+1)(Ab)(2n+1)2n+1<bA22n\displaystyle\mathrm{N}_{2n+1}\frac{2(n+1)}{2n+1}\leq\frac{\sqrt{3}(b-a)}{4\sqrt{2n+1}}\cdot\frac{2(n+1)}{2n+1}=\frac{1}{2}\cdot\frac{\sqrt{3}(n+1)(a-b)}{(2n+1)\sqrt{2n+1}}<\frac{b-a}{2\sqrt{2n}}

so in general

NnbA2n\mathrm{N}_{n}\leq\frac{b--a}{2\sqrt{n}}

Formula (32) therefore becomes

|fPn(x;f)|<32ω(bAn)\left|f-\mathrm{P}_{n}(x;f)\right|<\frac{3}{2}\omega\left(\frac{b-a}{\sqrt{n}}\right)

If the functionffis continuousω(bAn)0\omega\left(\frac{b-a}{\sqrt{n}}\right)\rightarrow 0fornn\rightarrow\inftyand Weierstrass' theorem is proved. Furthermore, it is seen that the best approximation of a continuous function by polynomials of degreenn, that is, the numberμn\mu_{n}, is at least of the order ofω(bAn)\omega\left(\frac{b-a}{\sqrt{n}}\right).

The approximation given by Mr. S. Bernstein's polynomials cannot be improved in general. For example, let the function

f2(x)=|xA+b2|f_{2}(x)=\left|x-\frac{a+b}{2}\right|

We have in thiscaseω(δ)=δ\operatorname{caz}\omega(\delta)=\deltaforδbA2\delta\leq\frac{b-a}{2}A
simple calculation shows that
d2Pn(x;f2)dx2=n(n1)(bA)nand=0n2(n2and)[f2(Aand)2f2(Aand+1)+f2(Aand+2)](xA)and(bx)nand\frac{d^{2}\mathrm{P}_{n}\left(x;f_{2}\right)}{dx^{2}}=\frac{n(n-1)}{(b-a)^{n}}\sum_{i=0}^{n-2}\binom{n-2}{i}\left[f_{2}\left(a_{i}\right)-2f_{2}\left(a_{i+1}\right)+f_{2}\left(a_{i+2}\right)\right](x-a)^{i}(b-x)^{n-i}from where

d2P2n(x;f2)dx2=d2P2n+1(x;f2)dx2=n(bA)2n1(2nn)[(xA)(bx)]n1.\frac{d^{2}\mathrm{P}_{2n}\left(x;f_{2}\right)}{dx^{2}}=\frac{d^{2}\mathrm{P}_{2n+1}\left(x;f_{2}\right)}{dx^{2}}=\frac{n}{(b-a)^{2n-1}}\binom{2n}{n}[(x-a)(b-x)]^{n-1}.

From this it follows thatP2n(x;f2)P2n+1(x;f2)\mathrm{P}_{2n}\left(x;f_{2}\right)\equiv\mathrm{P}_{2n+1}\left(x;f_{2}\right)and thatP2n(x;f2)\mathrm{P}_{2n}\left(x;f_{2}\right)is a convex function (in the usual sense) in the interval (A,ba,b). We therefore have

MAXin (A,b)|f2P2n(x;f2)|=P2n(A+b2;f2)f2(A+b2)==122nand=02n(nand)|AandA+b2|=bA22n+1(2nn)\begin{gathered}\max_{\text{in }(a,b)}\left|f_{2}-\mathrm{P}_{2n}\left(x;f_{2}\right)\right|=\mathrm{P}_{2n}\left(\frac{a+b}{2};f_{2}\right)-f_{2}\left(\frac{a+b}{2}\right)=\\ =\frac{1}{2^{2n}}\sum_{i=0}^{2n}\binom{n}{i}\left|a_{i}-\frac{a+b}{2}\right|=\frac{b-a}{2^{2n+1}}\binom{2n}{n}\end{gathered}

We now have

122n+1(2nn)2n>122n1(2n2n1)2n2\frac{1}{2^{2n+1}}\binom{2n}{n}\sqrt{2n}>\frac{1}{2^{2n-1}}\binom{2n-2}{n-1}\sqrt{2n-2}

so

122n+1(2nn)>12212n\frac{1}{2^{2n+1}}\binom{2n}{n}>\frac{1}{2\sqrt{2}}\frac{1}{\sqrt{2n}}

from where

MAXIn(A,b)|f2P2n(x;f2)|>12γ2¯bA2n=122ω(bA2n),\max_{\ln(a,b)}\left|f_{2}-\mathrm{P}_{2n}\left(x;f_{2}\right)\right|>\frac{1}{2\gamma\overline{2}}\cdot\frac{b-a}{\sqrt{2n}}=\frac{1}{2\sqrt{2}}\omega\left(\frac{b-a}{\sqrt{2n}}\right),

which proves our statement.
35. - Approximation of convex functions of higher order. Mr. Bernstein's polynomials still allow us to establish some
interesting results on the approximation of convex functions of higher order ()\left.{}^{\circ}\right)_{\text{a }}Using the notations from No. 17, let's put

Δkand=[Aand,Aand+1,Aand+k;f],and=0,1,,nk,k=1,2,\Delta_{k}^{i}=\left[a_{i},a_{i+1}\ldots,a_{i+k};f\right],\quad i=0,1,\ldots,n-k,k=1,2,\ldots

A simple calculation shows us that

dPn(x:f)dx=1(bA)n1and=0n1(n1and)Δ1and(xA)and(bx)n1÷and\frac{d\mathrm{P}_{n}(x:f)}{dx}=\frac{1}{(b-a)^{n-1}}\sum_{i=0}^{n-1}\binom{n-1}{i}\Delta_{1}^{i}(x-a)^{i}(b-x)^{n-1\div i}

and in general

dkPn(x:f)dxk\displaystyle\frac{d^{k}\mathrm{P}_{n}(x:f)}{dx^{k}} =k!(11n)(12n)\displaystyle=k!\left(1-\frac{1}{n}\right)\left(1-\frac{2}{n}\right)\cdots
(1k1n)1(bA)nkand=0n1(nkand)Δkand(xA)(bx)nkand\displaystyle\cdots\left(1-\frac{k-1}{n}\right)\frac{1}{(b-a)^{n-k}}\sum_{i=0}^{n-1}\binom{n-k}{i}\Delta_{k}^{i}(x-a)(b-x)^{n-k-i}

It is immediately seen that if the functionf(x)f(x)enjoys a property of determinate convexity, the polynomials of D. S. Bernstein enjoy the same property of convexity. We assume here that convexity proper and polynomiality are particular cases of non-concavity. The property follows from the definition of higher-order functions and from the fact that if a function is differentiable, the necessary and sufficient condition for it to be non-concave of order n is that its derivative of the ordern+1n+1not to become negative, etc.

We can however state the property:
A continuous functionff, which enjoys certain convexity properties, is the limit of a sequence of polynomials, uniformly convergent in the interval (A,ba,b) and which enjoy the same convexity property.
36. - Approximation of functions with bounded divided differences. We can also obtain some results on functions with bounded divided differences. Let us consider the relation

k!Δk[f]=Δ0[f(k)]k!\Delta_{k}[f]=\Delta_{0}\left[f^{(k)}\right]

defined at No. 26. For the polynomialPn(A;f)P_{n}{}^{\prime}(a;f)we will have

k!Δk[Pn]=Δ0[Pn(k)]k!\Delta_{k}\left[\mathrm{P}_{n}\right]=\Delta_{0}\left[\mathrm{P}_{n}^{(k)}\right]

whence, taking into account (34),

Δk[Pn]<(11n)(12n)(1k1n)Δk[f]\Delta_{k}\left[\mathrm{P}_{n}\right]<\left(1-\frac{1}{n}\right)\left(1-\frac{2}{n}\right)\ldots\left(1-\frac{k-1}{n}\right)\Delta_{k}[f]

It can still be written.

Δk[Pn]Δk[f],k=0,1;Δk[Pn]<Δk[f],k>1\Delta_{k}\left[\mathrm{P}_{n}\right]\leq\Delta_{k}[f],\quad k=0,1;\quad\Delta_{k}\left[\mathrm{P}_{n}\right]<\Delta_{k}[f],\quad k>1

We have the property:
A continuous functionffwhich is with a bounded divided difference, is the limit of a sequence of polynomials, uniformly convergent in the interval a, b), which have the limits of order 0 and 1 at most equal to those of the function and the limits of order>1>1smaller than that of the function.
37. Approximation of functions with bounded variation. Let

x1<x2<<xm(mn+1)x_{1}<x_{2}<\cdots<x_{m}\quad(m\geq n+1)

a series of points in the interval(A,b)(a,b)Number

Vm=and=1mn1|[xand+1,xand+2,,xand+n+1;f][xand,xand+1,,xand+n;f]|v_{m}=\sum_{i=1}^{m-n-1}\left|\left[x_{i+1},x_{i+2},\ldots,x_{i+n+1};f\right]-\left[x_{i},x_{i+1},\ldots,x_{i+n};f\right]\right|

it is called anAn^{a}variation off(x)f(x)on the pointsxandx_{i}considered.
If we put

MAXin (A,b)Vm=Vn[f]\max_{\text{in }(a,b)}v_{m}=V_{n}[f]

the maximum being taken when both the points varyxandx_{i}as well as their number, the numberVn[f]V_{n}[f]it is called anAn^{a}total variation off(x)f(x)in the interval (A,ba,b). IfVn[f]\mathrm{V}_{n}[f]is a finite number the function is said to be with a n bounded variation.

We also have the relationship here

k!Vk[f]=V0[f(k)]k!\mathrm{V}_{k}[f]=\mathrm{V}_{0}\left[f^{(k)}\right]

as well as the formula

V0[f]=Ab|f|𝑑xV_{0}[f]=\int_{a}^{b}\left|f^{\prime}\right|dx

well known from the theory of functions with bounded variation (of order 0).
For polynomialsPn(x;f)\mathrm{P}_{n}(x;f)HAVE

Vk[Pn(k)]=1k!Ab|Pn(k+1)|𝑑x\mathrm{V}_{k}\left[\mathrm{P}_{n}^{(k)}\right]=\frac{1}{k!}\int_{a}^{b}\left|\mathrm{P}_{n}^{(k+1)}\right|dx

Taking into account formula (34), we deduce

Vk[Pn]k+1(bA)nk1(11n)(12n)(1kn)=1nk1(nk1and)|Δk+1and|Ab(xA)and(bx)nk1and𝑑x\begin{gathered}\mathrm{V}_{k}\left[\mathrm{P}_{n}\right]\leq\frac{k+1}{(b-a)^{n-k-1}}\left(1-\frac{1}{n}\right)\left(1-\frac{2}{n}\right)\cdots\left(1-\frac{k}{n}\right)\\ \sum_{=1}^{n-k-1}\binom{n-k-1}{i}\left|\Delta_{k+1}^{i}\right|\int_{a}^{b}(x-a)^{i}(b-x)^{n-k-1-i}dx\end{gathered}

But

Ab(xA)and(bx)nk1and𝑑x=(bA)nkand!(nandk1)!(nk)!=(bA)nk(nk)(nk1and)\displaystyle\int_{a}^{b}(x-a)^{i}(b-x)^{n-k-1-i}dx=(b-a)^{n-k}\frac{i!(n-i-k-1)!}{(n-k)!}=\frac{(b-a)^{n-k}}{(n-k)\binom{n-k-1}{i}}
so
Vk[Pn](11n)(12n)(1k1n)(k+1)(bA)nand=1nk1|Δk+1and|.\mathrm{V}_{k}\left[\mathrm{P}_{n}\right]\leq\left(1-\frac{1}{n}\right)\left(1-\frac{2}{n}\right)\cdots\left(1-\frac{k-1}{n}\right)\frac{(k+1)(b-a)}{n}\sum_{i=1}^{n-k-1}\left|\Delta_{k+1}^{i}\right|.

But we also have the relationship

(k+1)(bA)nΔk+1and=ΔkandΔkand+1\frac{(k+1)(b-a)}{n}\Delta_{k+1}^{i}=\Delta_{k}^{i}-\Delta_{k}^{i+1}

therefore

(k+1)(bA)nand=1nk1|Δk+1and|=and=1nk1|ΔkandΔkand+1|Vk[f]\frac{(k+1)(b-a)}{n}\sum_{i=1}^{n-k-1}\left|\Delta_{k+1}^{i}\right|=\sum_{i=1}^{n-k-1}\left|\Delta_{k}^{i}-\Delta_{k}^{i+1}\right|\leq\mathrm{V}_{k}[f]

We therefore deduce that

Vk[Pn]Vk[f];k=0,1;Vk[Pn]<Vk[f],k>1,\mathrm{V}_{k}\left[\mathrm{P}_{n}\right]\leq\mathrm{V}_{k}[f];\quad k=0,1;\quad\mathrm{V}_{k}\left[\mathrm{P}_{n}\right]<\mathrm{V}_{k}[f],\quad k>1,

We have the property:
A functionffcontinue withnAn^{a}Bounded variation is the limit of: a sequence of polynomials, uniformly convergent in the interval (A,ba,b), which have: the total variations of order 0 and 1 at most equal to those of the function and the total variations of order>1>1smaller than those of the function.
38. - Approximation of differentiable functions. Let us finally see what results Mr. Bernstein's polynomials for differentiable functions lead us to. Let us therefore assume that the functionf(x)f(x)has a continuous derivative of orderkkand beωk(δ)\omega_{k}(\delta)the oscillation modulus of this derivative. We know that we have the generalized average formula

k!Δkand=f(k)(A+bAn(and+θk)),0<θ<1,k!\Delta_{k}^{i}=f(k)\left(a+\frac{b-a}{n}(i+\theta k)\right),\quad 0<\theta<1,

using the notations above.
We deduce from this that

|k!Δkandf(k)(x)|ωk(|xAbAn(and+θk)|)\displaystyle\left|k!\Delta_{k}^{i}-f^{(k)}(x)\right|\leq\omega_{k}\left(\left|x-a-\frac{b-a}{n}(i+\theta k)\right|\right)\leq
ωk(MAX(|xAand|,|xAand+1|,,|xAand+k|))\displaystyle\leq\omega_{k}\left(\max\left(\left|x-a_{i}\right|,\left|x-a_{i+1}\right|,\ldots,\left|x-a_{i+k}\right|\right)\right)

Now let the polynomial be,

Qn,k(x;f)=Pn(k)(x;f)(11n)(12n)(1k1n)=\displaystyle Q_{n,k}(x;f)=\frac{\mathrm{P}_{n}^{(k)}(x;f)}{\left(1-\frac{1}{n}\right)\left(1-\frac{2}{n}\right)\cdots\left(1-\frac{k-1}{n}\right)}=
=1(bA)nkand=0nk(nkand)k!Δkand(xA)and(bx)nkand.\displaystyle=\frac{1}{(b-a)^{n-k}}\sum_{i=0}^{n-k}\binom{n-k}{i}k!\Delta_{k}^{i}(x-a)^{i}(b-x)^{n-k-i}.

we

|f(k)Qn,k(x;f)|<{1δ(1(bA)nkand=0S(nkand)|xAand|(xA)and(bx)nkand+\displaystyle\left|f^{(k)}-Q_{n,k}(x;f)\right|<\left\{\frac{1}{\delta}\left(\frac{1}{(b-a)^{n-k}}\sum_{i=0}^{s}\binom{n-k}{i}\left|x-a_{i}\right|(x-a)^{i}(b-x)^{n-k-i}+\right.\right.
+1(bA)nkand=S+1nk(nkand)|xAand+k|(xA)and(bx)nkand)+1}ωk(δ)\displaystyle\left.\left.\quad+\frac{1}{(b-a)^{n-k}}\sum_{i=s+1}^{n-k}\binom{n-k}{i}\left|x-a_{i+k}\right|(x-a)^{i}(b-x)^{n-k-i}\right)+1\right\}\omega_{k}(\delta)

whereSsis determined in the following way:

S=jk2 if k is even and AjxAj+1S=jk+12 if k is odd and AjxAj+Aj+12S=jk12 if k is odd and Aj+Aj+12xAj+1.\begin{array}[]{lll}s=j-\frac{k}{2}&\text{ dacă }k\text{ este par şi }&a_{j}\leq x\leq a_{j+1}\\ s=j-\frac{k+1}{2}&\text{ dacă }k\text{ este impar şi }&a_{j}\leq x\leq\frac{a_{j}+a_{j+1}}{2}\\ s=j-\frac{k-1}{2}&\text{ dacă }k\text{ este impar şi }&\frac{a_{j}+a_{j+1}}{2}\leq x\leq a_{j+1}.\end{array}

Of course, if in these formulas we haveS<0s<0orSnks\geq n-k, the first or second term in the second parenthesis disappears.

Noting that

|xAand+1||xAand|+|Aand+kAand||xAand|+k(bA)n\left|x-a_{i+1}\right|\leq\left|x-a_{i}\right|+\left|a_{i+k}-a_{i}\right|-\left|x-a_{i}\right|+\frac{k(b-a)}{n}

we can also write
|f(k)Qn,k(x;f)|<|1δ(1(bA,nkand=0nk(nkand)|xAand|(xA)and(bx)nkand+ψ1(x))+1|ωk(δ)\left|f^{(k)}-Q_{n,k}(x;f)\right|<\left|\frac{1}{\delta}\left(\frac{1}{\left(b-a,^{n-k}\right.}\sum_{i=0}^{n-k}\binom{n-k}{i}\left|x-a_{i}\right|(x-a)^{i}(b-x)^{n-k-i}+\psi_{1}(x)\right)+1\right|\omega_{k}(\delta)
where

ψ1(x)=kn1(bA)nk1and=S+1nk(nkand)(xA)and(bx)nkand\psi_{1}(x)=\frac{k}{n}\cdot\frac{1}{(b-a)^{n-k-1}}\sum_{i=s+1}^{n-k}\binom{n-k}{i}(x-a)^{i}(b-x)^{n-k-i}

and we have to takeψ1(x)0\psi_{1}(x)\equiv 0ifSnks\geq n-k
We now have

1(bA)nkand=0nk(nkand)|xAand|(xA)and(bx)nkand\displaystyle\frac{1}{(b-a)^{n-k}}\sum_{i=0}^{n-k}\binom{n-k}{i}\left|x-a_{i}\right|(x-a)^{i}(b-x)^{n-k-i}\leq (35)
1(bA)nkand=0nk(nkand)|xAandbAnk|(xA)and(bx)nkand+\displaystyle\leq\frac{1}{(b-a)^{n-k}}\sum_{i=0}^{n-k}\binom{n-k}{i}\left|x-a-i\frac{b-a}{n-k}\right|(x-a)^{i}(b-x)^{n-k-i}+
+kn(nk)(bA)nk1and=0nk(nkand)and(xA)and(bx)nkand\displaystyle+\frac{k}{n(n-k)(b-a)^{n-k-1}}\sum_{i=0}^{n-k}\binom{n-k}{i}i(x-a)^{i}(b-x)^{n-k-i}

which immediately results from the relationship

xAand=xAandbAn+andk(bA)n(nk).x-a_{i}=x-a-i\frac{b-a}{n}+i\frac{k(b-a)}{n(n-k)}.

But we know from No. 34 that
1(bA)nkand=0nk(nkand)|xAandbAnk|(xA)and(bx)nkandbA2nk\frac{1}{(b-a)^{n-k}}\sum_{i=0}^{n-k}\binom{n-k}{i}\left|x-a-i\frac{b-a}{n-k}\right|(x-a)^{i}(b-x)^{n-k-i}\leq\frac{b-a}{2\sqrt{n-k}}.
On the other hand

kn(nk)(bA)nk1and=0nk(nkand)and(xA)and(bx)nkand=k(xA)n\frac{k}{n(n-k)(b-a)^{n-k-1}}\sum_{i=0}^{n-k}\binom{n-k}{i}i(x-a)^{i}(b-x)^{n-k-i}=\frac{k(x-a)}{n}

and we see that the first member of relation (35) is

bA2nk+k(bA)n.\leqslant\frac{b-a}{2\sqrt{n-k}}+\frac{k(b-a)}{n}.

We now have obviously and

ψ1(x)kn1(bA)nk1and=0nk(nkand)(xA)and(bx)nkand=k(bA)n\psi_{1}(x)\leq\frac{k}{n}\cdot\frac{1}{(b-a)^{n-k-1}}\sum_{i=0}^{n-k}\binom{n-k}{i}(x-a)^{i}(b-x)^{n-k-i}=\frac{k(b-a)}{n}

from which it follows that

|f(k)Qn,k(x;f)|<|1δ(bA2nk+2k(bA)n)+1}ωk(δ)\left.\left|f^{(k)}-Q_{n,k}(x;f)\right|<\left\lvert\,\frac{1}{\delta}\left(\frac{b-a}{2\sqrt{n-k}}+\frac{2k(b-a)}{n}\right)+1\right.\right\}\omega_{k}(\delta)

or, puttingδ=bAnk\delta=\frac{b-a}{\sqrt{n-k}},
(36)|f(k)Qn,k(x;f)|<(32+2knkn)ωk(bAnk)\left|f^{(k)}-Q_{n,k}(x;f)\right|<\left(\frac{3}{2}+2\frac{k\sqrt{n-k}}{n}\right)\omega_{k}\left(\frac{b-a}{\sqrt{n-k}}\right)\leq

3+2k2ωk(bAnk)(nk+1).\leq\frac{3+2\sqrt{k}}{2}\omega_{k}\left(\frac{b-a}{\sqrt{n-k}}\right)\quad(n\geq k+1).
  1. 39.
    • Convergence of derivatives of Mr. Bernstein's polynomials.

Derivative of the orderkkof the functionffbeing assumed continuous, the upper edgeΔ0[f(k)]\Delta_{0}\left[f^{(k)}\right]is finite. We have .

f(k)Pn(k)(x;f)=\displaystyle f^{(k)}-\mathrm{P}_{n}^{(k)}(x;f)= f(k)Qn,k(x;f)+\displaystyle f^{(k)}-\mathrm{Q}_{n,k}(x;f)+
+[1(11n)(12n)(1k1n)]Qn,k(x;f)\displaystyle+\left[1-\left(1-\frac{1}{n}\right)\left(1-\frac{2}{n}\right)\cdots\left(1-\frac{k-1}{n}\right)\right]\mathrm{Q}_{n,k}(x;f)

The results from No. 36 show us that

|Qn,k(x;f)|Δ0[f(k)]\left|Q_{n,k}(x;f)\right|\leq\Delta_{0}\left[f^{(k)}\right]

and on the other hand we have the inequality

1(11n)(12n)(1k1n)k(k1)2n1-\left(1-\frac{1}{n}\right)\left(1-\frac{2}{n}\right)\cdots\left(1-\frac{k-1}{n}\right)\leq\frac{k(k-1)}{2n}

Taking into account formula (36), we deduce

|f(k)Pn(k)(x;f)|<3+2k2ωk(bAnk)+k(k1)2nΔ0[f(k)]\left|f^{(k)}-\mathrm{P}_{n}^{(k)}(x;f)\right|<\frac{3+2\sqrt{k}}{2}\omega_{k}\left(\frac{b-a}{\sqrt{n-k}}\right)+\frac{k(k-1)}{2n}\Delta_{0}\left[f^{(k)}\right] (37)

which shows us that:
If the functionf(x)f(x), defined in the interval (A,ita,l), is continuous with the firstkkits derivations, the polynomial sequencesPn(x;f)\mathrm{P}_{n}(x;f),P(x;f)n,Pn(k)(x;f)\mathrm{P}^{\prime}{}_{n}(x;f),\ldots\mathrm{P}_{n}^{(k)}(x;f)tend absolutely and uniformly towardsf(x),f(x),,f(k)(x)f(x),f^{\prime}(x),\ldots,f^{(k)}(x)respectively, throughout the interval (A,ba,b).

DI E. Borel first posed the problem of finding a sequence of polynomials uniformly convergent to a continuous functionf(x)f(x), so that the series formed with derivatives of a given orderkkof these mixed polynomials be uniformly convergent to the derivativef(k)(x)f^{(k)}(x), assumed continuous, of the functionf(x)f(x). As can be seen, Mr. Bernstein's polynomials solve this problem in an elegant way. This qualitative result is due to Mr. S. Wigert 11 ).

In particular, for the first-order derivative, the second term in the second member of inequality (37) vanishes so that

|fPn(x;f)|<52ω1(bAn1)\left|f^{\prime}-\mathrm{P}_{n}^{\prime}(x;f)\right|<\frac{5}{2}\omega_{1}\left(\frac{b-a}{\sqrt{n-1}}\right)

We can still observe that iff(k)(x)f^{(k)}(x)verifies a Lipschitz condition: ordinary, its approximationf(k)f^{(k)}byPn(k)(x;f)\mathrm{P}_{n}^{(k)}(x;f)it is his order1n\frac{1}{\sqrt{n}}therefore of the same order as the approximation byPn(x;f(k))\mathrm{P}_{n}\left(x;f^{(k)}\right).

Mr. S. Bernstein's polynomials also enjoy numerous properties which have been studied mainly by Mr. Bernstein himself as well as by his students.
40. - The upper limit ofμn\mu_{n}We saw in No. 28 that the best approximation by polynomials of degreennof a continuous functionf(x)f(x)is, in general, at least of the order ofω(bAn)\omega\left(\frac{b-a}{n}\right)where

ω(δ)\omega(\delta)is the oscillation modulus off(x)f(x)Mr. D. Jackson demonstrated for the first time thatμn\mu_{n}it is even his orderω(bAn)(12)\omega\left(\frac{b-a}{n}\right){}^{(12)}. Various proofs of this result are known. We will not insist here on these proofs, however. One can usefully consult the cited book by Mr. Ch. de la Vallée Poussin (13) ). It would be interesting to see if the numberθn\theta_{n}defined in No. 29 is not really of the order1n\frac{1}{n}In this case the polynomialsTn\mathrm{T}_{n}would be sufficient for the demonstration, both qualitatively and quantitatively, of Weierstrass's theorem.

LECTURE V

The case of functions of two independent variables

  1. 41.
    • The problem of the best approximation for a function of two real variables. The preceding results can be extended, to a large extent, to functions of more than one and in particular to those of two real variables. We will briefly examine this generalization. It is important to note that uniqueness no longer holds in general if the function is continuous.

So let's take a real functionf(x,y)f(x,y)of two real variablesxxandyy, uniform and defined in a certain bounded and closed domain (D). To keep things simple, we will assume that this domain is bounded by a simple and closed curve. The domain (D) can be, for example, a rectangle

Axb,cyd.a\leq x\leq b,\quad c\leq y\leq d. (38)

functionf(x,y)f(x,y)will be assumed continuous in (D).
The problem is posed as for the case of functions of a single variable.

We consider the set of polynomials

P(x,y)=A00+A10x+A01y++An0xn+An11xn1y++A0ny"\mathrm{P}(x,y)=a_{00}+a_{10}x+a_{01}y+\cdots+a_{n0}x^{n}+a_{n-11}x^{n-1}y+\cdots+a_{0n}y^{\prime\prime}

of two variablesxxandyyof the degreennA polynomial of the set is completely determined by the coefficientsAandja_{ij}.

We still note withM(f)\mathrm{M}(f)the maximum or upper bound of the functionf(x,y)f(x,y)in the domain (D). The error or approximation with which the polynomialP(x,y)\mathrm{P}(x,y)represents the functionf(x,y)f(x,y)is equal, by definition, toM(|fP|)\mathrm{M}(|f-\mathrm{P}|). The best approximation of the functionf(x,y)f(x,y)by polynomials of degreennis equal, by definition, to the lower edgeμn(f)\mu_{n}(f)or simplerμn\mu_{n}his/herM(|fP|)\mathrm{M}(|f-\mathrm{P}|)WhenP(x,y)\mathrm{P}(x,y)traverses the set of polynomials of degreenn.

The problem that must now be examined is posed as for functions of a single variable:

Given the functionf(x,y)f(x,y), to determine the polynomials of degree n for whichM(|fP|)\mathrm{M}(|f-\mathrm{P}|)reaches its lower edgeμn\mu_{n}and to study this numberμn\mu_{n}.

Problema existenţei, a unicității și principalele proprietăţi ale polinoamelor de cea mai bună aproximatie au fost examinate de Dl L. Tonelli ( 14 ).

Un polinom pentru care minimul μn\mu_{n} este atins se poate şi aici numi un polinom de cea mai bună aproximatie de gradul nn al funcției ff si se poate nota cu Tn(x,y;f)\mathrm{T}_{n}(x,y;f) sau mai simplu cu Tn\mathrm{T}_{n}. Vom zice si aici că un astfel de polinom este un polinom Tn\mathrm{T}_{n}.

In ce priveşte numărul μn\mu_{n} el este pozitiv sau nul și de altfel nu se poate anula decât dacă f(x,y)f(x,y) coincide cu un polinom de gradul nn. In cele ce urmează vom presupune că suntem in cazul μn>0\mu_{n}>0.

Dacă P(x,y)\mathrm{P}(x,y) este un polinom Tn\mathrm{T}_{n} al functiei f(x,y)f(x,y), polinomul P(x,y)+Q(x,y)\mathrm{P}(x,y)+\mathrm{Q}(x,y), unde Q(x,y)\mathrm{Q}(x,y) este un polinom de gradul nn, este un polinom Tn\mathrm{T}_{n} al funcției f(x,y)+Q(x,y)f(x,y)+Q(x,y). Reciproc, orice polinom Tn\mathrm{T}_{n} al functiei f(x,y)+Q(x,y)f(x,y)+Q(x,y) este de forma P(x,y)+Q(x,y)\mathrm{P}(x,y)+Q(x,y). Avem

μn(f+Q)=μn(f)\mu_{n}(f+Q)=\mu_{n}(f)

Deasemenea, C find o constantă, CP(x,y)\mathrm{CP}(x,y) este un polinom Tn\mathrm{T}_{n} al functiei Cf(x,y)\mathrm{C}f(x,y) și reciproc. orice polinom Tn\mathrm{T}_{n} al lui Cf(x,y)\mathrm{C}f(x,y) este de forma CP(x,y)\operatorname{CP}(x,y). Avem

μn(Cf)=|C|μn(f).\mu_{n}(\mathrm{C}f)=|\mathrm{C}|\mu_{n}(f).
  1. 42.
    • Existenţa polinoamelor de cea mai bună aproximatie. Lema preliminară dela Nr. 6 se extinde imediat :

Dacă un polinom P(x,y)\mathrm{P}(x,y) de gradul n rămâne mărginit de un număr A, în domeniul (D), coeficientii aij rămân mărginiti de un număr λA\lambda\mathrm{A}, unde λ\lambda nu depinde decât de n si de domeniul (D).

Demonstratia se face la fel. Luăm N=(n+22)\mathrm{N}=\binom{n+2}{2} puncte Mr(xr,yr)\mathrm{M}_{r}\left(x_{r},y_{r}\right),

π=1,2,,N\pi=1,2,\ldots,N in (D) astfel ca determinantul
(39)
1xryrxrnxrn1yryrn|\left.\begin{array}[]{lllll}1&x_{r}&y_{r}&\ldots&x_{r}^{n}\end{array}\quad x_{r}^{n-1}y_{r}\ldots y_{r}^{n}\right\rvert\,
să fie diferit de zero. Rezolvăm apoi sistemul

a00+a10xr+a01yr++an0xrn+an11xrn1yr++a02yrn=P(xr,yr)r=1,2,,N\begin{gathered}a_{00}+a_{10}x_{r}+a_{01}y_{r}+\ldots+a_{n0}x_{r}^{n}+a_{n-11}x_{r}^{n-1}y_{r}+\ldots+a_{02}y_{r}^{n}=\mathrm{P}\left(x_{r},y_{r}\right)\\ r=1,2,\ldots,\mathrm{~N}\end{gathered}

in raport cu coeficienţii aija_{ij} cu ajutorul regulei lui Cramer şi tinem seamă de

|P(xr,yr)|<A,r=1,2,,N.\left|\mathrm{P}\left(x_{r},y_{r}\right)\right|<\mathrm{A},\quad r=1,2,\ldots,\mathrm{~N}.

Se pot uşor alege punctele Mr astfel ca determinantul (39) să fie diferit de zero. E destul să luăm N puncte distincte formând o reţea triunghiulară astfel

(xr,ys),r=1,2,,n+1,s=r,r+1,,n+1.\left(x_{r},y_{s}\right),\quad r=1,2,\ldots,n+1,\quad s=r,r+1,\ldots,n+1.

Determinantul sistemului este atunci egal, afară poate de semn, cu

V(x1,x2)V(x1,x2,x3)V(x1,x2,,xn+1)V(y1,y2,,yn+1)..V(y2,y3,,yn+1)V(yn1,yn,yn+1)V(yn,yn+1),\begin{gathered}\mathrm{V}\left(x_{1},x_{2}\right)\mathrm{V}\left(x_{1},x_{2},x_{3}\right)\ldots\mathrm{V}\left(x_{1},x_{2},\ldots,x_{n+1}\right)\mathrm{V}\left(y_{1},y_{2},\ldots,y_{n+1}\right).\\ .\mathrm{V}\left(y_{2},y_{3},\ldots,y_{n+1}\right)\ldots\mathrm{V}\left(y_{n-1},y_{n},y_{n+1}\right)\mathrm{V}\left(y_{n},y_{n+1}\right),\end{gathered}

intrebuinţând notația deja semnalată a determinatului lui Van der Monde.

Rezultatele dela Nr. 7 sunt aplicabile. M(|fP|)\mathrm{M}(|f-\mathrm{P}|) este o functie continuă de coeficienții aija_{ij}. Rezultă că marginea inferioară μn\mu_{n} a numerilor M(|fP|)\mathrm{M}(|f-\mathrm{P}|) coincide cu limita lor inferioară.

Repetând acum rationamentul dela Nr. 8 putem enunta propriedatea :

Oricare ar fi functia continuă f(x,y)f(x,y), există cel puțin un polinom de cea mai bună aproximatie de gradul n.

E de observat că acest rezultat rămâne adevărat chiar şi pentru - funcţie mărginită oarecare.
43. - Prima proprietate a polinoamelor de cea mai bunã approximatie, Dacă P(x,y)P(x,y) este un polinom de cea mai bună aproximaţie de gradul nn, există cel puțin un punct ( x,yx,y ) unde avem

|f(x,y)P(x,y)|=μn|f(x,y)-\mathrm{P}(x,y)|=\mu_{n} (40)

Numărul acestor puncte capătă o primă precizare prin proprietatea următoare:

Dacă P(x,y)\mathrm{P}(x,y) este un polinom de cea mai bună aproximasie de gradul n, există cel putin n+2n+2 puncte unde avem egalitatea (40).

Pentru a demonstra această proprietate să considerăm întâi n𝔸n\not+\mathbb{A} puncte distincte Mr(xr,yr),r=1,2,,n+1\mathrm{M}_{r}\left(x_{r},y_{r}\right),r=1,2,\ldots,n+1 şi fie tabloul

1xryrxrnxrn1yryrn\displaystyle\left\|1\quad x_{r}\quad y_{r}\ldots x_{r}^{n}x_{r}^{n-1}y_{r}\ldots y_{r}^{n}\right\| (41)
r=1,2,,n+1\displaystyle r=1,2,\ldots,n+1

cu n=(n+22)n=\binom{n+2}{2} coloane si n+1n+1 linii. Să inmulțim acest tablou cus urniătorul

1000000000001i0000000000012i1000000000013i3i000000000001(n1)i(n2)i2\|\begin{array}[]{cccccccccccccccc}1&0&0&0&0&0&0&0&0&0&\ldots&\ldots&\ldots&\ldots&\ldots&0\\ 0&1&i&0&0&0&0&0&0&0&\ldots&\ldots&\ldots&\ldots&\ldots&0\\ 0&0&0&1&2i&-1&0&0&0&0&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots\\ 0&0&0&0&0&0&1&3i&-3&-i&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots\\ \ldots&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots&\ldots\\ 0&0&0&0&0&0&0&0&0&0&0&\ldots&1&\binom{n}{1}i&\binom{n}{2}i^{2}&\ldots\end{array}

unde i=γ1¯i=\gamma\overline{-1}.
Dacă punem zr=xr+iyrz_{r}=x_{r}+iy_{r} vedem că produsul celor două tablouris este egal cu determinantul V(z1,z2,,zn+1)\mathrm{V}\left(z_{1},z_{2},\ldots,z_{n+1}\right) şi deci este diferit dezero. Formula [cunoscută a lui Cauchy ne arată atunci că există îno tabloul (41) cel putin un determinant de crdinul n+1n+1 diferit de zero.

Să presupunem acum că egalitatea (40) nu are loc decât in mn+1m\leq n+1 puncte Mr(xr,yr),r=1,2,,m\mathrm{M}_{r}\left(x_{r},y_{r}\right),r=1,2,\ldots,m. From the demonstrated property of the array (41) it follows that we can find in the firstmmlines a determinant of the ordermmdifferent from zero. Either for fixing ideas,

|1xRxR2xRm2xRyR|,R=1,2,,\left|1\quad x_{r}\quad x_{r}^{2}\ldots x_{r}^{m-2}\quad x_{r}y_{r}\right|,\quad r=1,2,\ldots,

such a determinant.
Let's construct the polynomialQ(x,y)Q(x,y)of the degreennand of shape

Q(x,y)=b0+b1x++bm2xm2+bxyQ(x,y)=b_{0}+b_{1}x+\cdots+b_{m-2}x^{m-2}+bxy

which checks the conditions

Q(xR,yR)=f(xR,yR)P(xR,yn),R=1,2,,m,\mathrm{Q}\left(x_{r},y_{r}\right)=f\left(x_{r},y_{r}\right)-\mathrm{P}\left(x_{r},y_{n}\right),\quad r=1,2,\ldots,m,

what is possible.
Let us consider the closed circles (CR\mathrm{C}_{r}) with the center inMR\mathrm{M}_{r}and the radius equal to a positive numberδ\deltaWe choose this numberδ\deltaso that
101^{0}. The circles (CR\mathrm{C}_{r}) not to be cut.
20:f(x,y)P(x,y),Q(x,y)2^{0}:f(x,y)-\mathrm{P}(x,y),\mathrm{Q}(x,y)not to be canceled in these circles.

It follows that in each circle the functionsf(x,y)P(x,y);Q(x,y)f(x,y)-\mathrm{P}(x,y);\mathrm{Q}(x,y)keeps the same sign.

Let (J') be the closed domain obtained from (D) by removing the interior of the corks (CR\mathrm{C}_{r}In this field (D\mathrm{D}^{\prime}) we have

|f(x,y)P(x,y)|μ<μn|f(x,y)-\mathrm{P}(x,y)|\leq\mu^{\prime}<\mu_{n}

Be it nowλ\lambdaa positive number chosen so that

λ<μnμ2M(|Q|)(<μnM(|Q|))\lambda<\frac{\mu_{n}-\mu^{\prime}}{2M(|Q|)}\left(<\frac{\mu_{n}}{M(|Q|)}\right)

we have then

|f(x,y)(x,y)λQ(x,y)|<μ+μnμ2=μn+μ2<μn|f(x,y)-\mathbb{R}(x,y)-\lambda Q(x,y)|<\mu^{\prime}+\frac{\mu_{n}-\mu^{\prime}}{2}=\frac{\mu_{n}+\mu^{\prime}}{2}<\mu_{n}

from all over the field (D\mathrm{D}^{\prime}).
In a circle (Cφ\mathrm{C}_{\varphi}) we have

|f(x,y)P(x,y)λQ(x,y)|<μn|f(x,y)-\mathrm{P}(x,y)-\lambda Q(x,y)|<\mu_{n}

In fact, for example,

f(xR,yR)P(xR,yR)=μnf\left(x_{r},y_{r}\right)-\mathrm{P}\left(x_{r},y_{r}\right)=\mu_{n}

then in (CR\mathrm{C}_{r})
μn<λQ(x,y)f(x,y)P(x,y)λQ(x,y)f(x,y)P(x,y)μ1it is-\mu_{n}<-\lambda Q(x,y)\leq f(x,y)-\mathrm{P}(x,y)-\lambda Q(x,y)\leq f(x,y)-\mathrm{P}(x,y)\leq\mu_{1_{e}}
equality cannot occur unlessf(x,y)P(x,y)=0f(x,y)-\mathrm{P}(x,y)=0orQ(x,y)=0\mathrm{Q}(x,y)=0, which I saw was impossible.

It follows that, in the entire domain (D), we have

|f(x,y)P(x,y)λQ(x,y)|<μn|f(x,y)-\mathrm{P}(x,y)-\lambda\mathrm{Q}(x,y)|<\mu_{n}

so the polynomialP(x,y)+λQ(x,y)\mathrm{P}(x,y)+\lambda\mathrm{Q}(x,y)gives a better approximation. This is not a contradiction with the hypothesis thatP(x,y)\mathrm{P}(x,y)is a polynomialTn\mathrm{T}_{n}; thus the property is proven.

44.- Completion of the previous result. The previous property-

The tooth can be specified as follows:

IfP(x,y)\mathrm{P}(x,y)is a polynomialTn\mathrm{T}_{n}, there is at leastn+22\left\lceil\frac{n+2}{2}\right\rfloorpunctureMR(xR,yR)\mathrm{M}_{r}\left(x_{r},y_{r}\right)where

f(xR,yR)P(xR,yR)=μnf\left(x_{r},y_{r}\right)-P\left(x_{r},y_{r}\right)=\mu_{n}

and at leastn+22\left\lceil\frac{n+2}{2}\right\rfloorpunctureMR(xR,yR)\mathrm{M}_{r}^{\prime}\left(x_{r}^{\prime},y_{r}^{\prime}\right)where

f(xR,yR)P(x,Ry)R=μRf\left(x_{r}^{\prime},y_{r}^{\prime}\right)-\mathrm{P}\left(x^{\prime}{}_{r},y^{\prime}{}_{r}\right)=-\mu_{r}\ldots

to denote the largest integer contained in æ.
Let us demonstrate the first part of the statement for example.

There cannot be no pointM\mathrm{M}_{\text{r }}, because otherwise we would have

μnf(x,y)P(x,y)μ<μn, in (D) -\mu_{n}\leq\equiv f(x,y)-\mathrm{P}(x,y)\leq\mu^{\prime}<\mu_{n}\quad,\quad\text{ in (D) }

so

μn<μn+μ2f(x,y)P(x,y)+μnμ2μn+μ2<μn-\mu_{n}<-\frac{\mu_{n}+\mu^{\prime}}{2}\leq f(x,y)-\mathrm{P}(x,y)+\frac{\mu_{n}-\mu^{\prime}}{2}\leq\frac{\mu_{n}+\mu^{\prime}}{2}<\mu_{n}

and the polynomialP(x,y)μnμ2\mathrm{P}(x,y)-\frac{\mu_{n}-\mu^{\prime}}{2}would give a better approximation.
Let us therefore assume that there are onlym<[n+22]m<\left[\frac{n+2}{2}\right]punctureMr, \mathrm{M}_{\text{r, }}
R=1,2,,mr=1,2,\ldots,mWe still consider the closed circles (CR\mathrm{C}_{r}) defined in the previous No. We still take their common radiusδ\deltasmall enough so that the circles don't intersect and so on.f(x,y)P(x,y)f(x,y)-\mathrm{P}(x,y)to remain positive in these circles. In each circle (CRRC_{rr}) we take a pointMR(ξR,ηR)\mathrm{M}_{r}^{*}\left(\xi_{r},\eta_{r}\right)at a distance14δ\frac{1}{4}\deltaofMR\mathrm{M}_{r}, and be(CR)\left(\mathrm{C}_{r}^{*}\right)^{*}the closed circle with the center inMR\mathrm{M}_{r}^{*}and the radius equal to34δ=δ\frac{3}{4}\delta=\delta^{*}.
Now let the polynomial of degreenn

Q(x,y)=[(xξ1)2+(yη1)2δ2][(xξ2)2+(yη2)2δ2][(xξm)2+(yηm)2δ2].\begin{gathered}Q(x,y)=\left[\left(x-\xi_{1}\right)^{2}+\left(y-\eta_{1}\right)^{2}-\delta^{*}2\right]\left[\left(x-\xi_{2}\right)^{2}+\left(y-\eta_{2}\right)^{2}-\delta^{*2}\right]\ldots\\ \ldots\left[\left(x-\xi_{m}\right)^{2}+\left(y-\eta_{m}\right)^{2}-\delta^{*2}\right].\end{gathered}

This polynomial only vanishes on the contour of the circles (C2\mathrm{C}_{2}^{*}). We haveQ(x,y)<0Q(x,y)<0inside these circles andQ(x,y)>0Q(x,y)>0in the open domain (D'), which is obtained from (D) by removing the circles (CR\mathrm{C}_{r}^{*}).

In (D\mathrm{D}^{\prime}) we have

μnf(x,y)P(x,y)μ<μn.-\mu_{n}\leq f(x,y)-P(x,y)\leq\mu^{\prime}<\mu_{n}.

Let's takeλ\lambdapositive so that

λ<μnμ2M(|Q|)(<μnM(|Q|)).\lambda<\frac{\mu_{n}-\mu^{\prime}}{2M(|Q|)}\left(<\frac{\mu_{n}}{M(|Q|)}\right).

We have in the domain (D\mathrm{D}^{\prime})

f(x,y)P(x,y)+λQ(x,y)<μ+μnμ2=μn+μ2<μnf(x,y)-\mathrm{P}(x,y)+\lambda\mathrm{Q}(x,y)<\mu^{\prime}+\frac{\mu_{n}-\mu^{\prime}}{2}=\frac{\mu_{n}+\mu^{\prime}}{2}<\mu_{n}

and

μn<f(x,y)P(x,y)+λQ(x,y),-\mu_{n}<f(x,y)-\mathrm{P}(x,y)+\lambda Q(x,y),

For this inequality, it is worth noting that we have the sign\leq. Equality could only occur ifQ(x,y)=0Q(x,y)=0but-
thenf(x,y)P(x,y)>0f(x,y)-\mathrm{P}(x,y)>0We have so but

f(x,y)P(x,y)+λQ(x,y)<μn\mid f(x,y)-\mathrm{P}(x,y)+\lambda Q(x,y)<\mu_{n}

in (D\mathrm{D}^{\prime}).
In the circles (CR\mathrm{C}_{r}^{*}) we have
s.

f(x,y)P(x,y)+λQ(x,y)<μnf(x,y)-\mathrm{P}(x,y)+\lambda Q(x,y)<\mu_{n}
μn<λQ(x,y)<f(x,y)P(x,y)+λQ(x,y).-\mu_{n}<\lambda Q(x,y)<f(x,y)-\mathrm{P}(x,y)+\lambda Q(x,y).

It follows that we have

|f(x,y)P(x,y)+λQ(x,y)|<μn|f(x,y)-\mathrm{P}(x,y)+\lambda Q(x,y)|<\mu_{n}

everywhere in (D). 1 The polynomialP(x,y)λQ(x,y)\mathrm{P}(x,y)-\lambda\mathrm{Q}(x,y)therefore a better approximation, contrary to the hypothesis. This contradiction proves the property.

The proof is done the same for the points𝐌𝒓\mathbf{M}_{\boldsymbol{r}}^{\boldsymbol{\prime}}.
45. - Theorem of Mr. L. Tonelli. WhetherP(x,y)\mathrm{P}(x,y)a polynomial of degree n and E the set of points (x,yx^{\prime},y^{\prime}) in which M—(fPf-\mathrm{P})— is reached. The set E may be finite or an arbitrary closed set. D1 L. Tonelli gave the following theorem, which is somewhat analogous to the first theorem of Mr. E. Borel (No. 15):

The necessary and sufficient condition thatP(x,y)\mathrm{P}(x,y)to be a polynomial T n is such that no polynomial can be foundQ(x,y)Q(x,y)of degree n verifying conditions
10.sgQ(x,y)=sg(f(x,y)P(x,y))\operatorname{sg}\mathrm{Q}\left(x^{\prime},y^{\prime}\right)=\operatorname{sg}\left(f\left(x^{\prime},y^{\prime}\right)-\mathrm{P}\left(x^{\prime},y^{\prime}\right)\right),
20.Γ>|Q(x,y)|>γ>02^{0}.\Gamma>\left|Q\left(x^{\prime},y^{\prime}\right)\right|>\gamma>0,
at all points of the manifold E.
To show that this condition is sufficient, it is enough to show that ifP(x,y)\mathrm{P}(x,y)is not a polynomialTn\mathrm{T}_{n}we can find such a polynomialQ(x,y)Q(x,y)Let us therefore suppose that

M(|fP|)=μ>μn\mathrm{M}(|f-\mathrm{P}|)=\mu^{\prime}>\mu_{n}

From the relationship

Tn(x,y;f)P(x,y)=f(x,y)P(x,y)(f(x,y)Tn(x,y;f))\mathrm{T}_{n}(x,y;f)-\mathrm{P}(x,y)=f(x,y)-\mathrm{P}(x,y)-\left(f(x,y)-\mathrm{T}_{n}(x,y;f)\right)

it results that

sg(Tn(x,y;f)P(x,y))=sg(f(x,y)P(x,y))0<μμn|Tn(x,y;f)P(x,y)|μ+μn.\begin{gathered}\operatorname{sg}\left(\mathrm{T}_{n}\left(x^{\prime},y^{\prime};f\right)-\mathrm{P}\left(x^{\prime},y^{\prime}\right)\right)=\operatorname{sg}\left(f\left(x^{\prime},y^{\prime}\right)-\mathrm{P}\left(x^{\prime},y^{\prime}\right)\right)\\ 0<\mu^{\prime}-\mu_{n}\leq\left|\mathrm{T}_{n}\left(x^{\prime},y^{\prime};f\right)-\mathrm{P}\left(x^{\prime},y^{\prime}\right)\right|\leq\mu^{\prime}+\mu_{n}.\end{gathered}

So we can takeQ(x,y)=Tn(x,y;f)P(x,y)Q(x,y)=T_{n}(x,y;f)-\mathrm{P}(x,y).
Let us now show that this condition is also necessary. Let us assume that

|f(x,y)P2(x,y)|μ<μn\left|f(x,y)-{}^{2}\mathrm{P}(x,y)\right|\leq\mu^{\prime}<\mu_{n}

Be it nowλ\lambdaa positive number chosen so that

λ<min(μnμ2M(|Q|),μnεΓ+ε)\lambda<\min\left(\frac{\mu_{n}-\mu^{\prime}}{2\mathrm{M}(|Q|)},\frac{\mu_{n}-\varepsilon}{\Gamma+\varepsilon}\right)

we have then

|f(x,y)P(x,y)λQ(x,y)|<μ+μnμ2=μn+μ2<μn|f(x,y)-P(x,y)-\lambda Q(x,y)|<\mu^{\prime}+\frac{\mu_{n}-\mu^{\prime}}{2}=\frac{\mu_{n}+\mu^{\prime}}{2}<\mu_{n}

in (D\mathrm{D}^{\prime}).
In a circle (C\mathrm{C}^{\prime}) where

f(x,y)P(x,y)=μnf\left(x^{\prime},y^{\prime}\right)-\mathrm{P}\left(x^{\prime},y^{\prime}\right)=\mu_{n}

HAVE

0<μnε<f(x,y)P(x,y)μnλ(Γ+ε)<λQ(x,y)<λ(γε)\begin{array}[]{r}0<\mu_{n}-\varepsilon<f(x,y)-\mathrm{P}(x,y)\leq\mu_{n}\\ -\lambda(\Gamma+\varepsilon)<\lambda Q(x,y)<-\lambda(\gamma-\varepsilon)\end{array}

so
0<μnελ(Γ+ε)<f(x,y)P(x,y)λQ(x,y)<μnλ(γε)<μn0<\mu_{n}-\varepsilon-\lambda(\Gamma+\varepsilon)<f(x,y)-P(x,y)-\lambda Q(x,y)<\mu_{n}-\lambda(\gamma-\varepsilon)<\mu_{n}.
Similarly, we observe that in a circle (C\mathrm{C}^{\prime}) where

f(x,y)P(x,y)=μxf\left(x^{\prime},y^{\prime}\right)-\mathrm{P}\left(x^{\prime},y^{\prime}\right)=-\mu_{x}

HAVE
μn<μn+λ(γε)</(x,y)P(x,y)λQ(x,y)<μn+ε+λ(Γ+ε)<0-\mu_{n}<-\mu_{n}+\lambda(\gamma-\varepsilon)</(x,y)-\mathrm{P}(x,y)-\lambda Q(x,y)<-\mu_{n}+\varepsilon+\lambda(\Gamma+\varepsilon)<0It follows therefore
that in the circles (C\mathrm{C}^{\prime}),

|f(x,y)P(x,y)λQ(x,y)|<μnλ(γε)<μn.|f(x,y)-\mathrm{P}(x,y)-\lambda Q(x,y)|<\mu_{n}-\lambda(\gamma-\varepsilon)<\mu_{n}.

It is seen, however, that forλ\lambdawe have quite a bit

|f(x,y)P(x,y)λQ(x,y)|<μn|f(x,y)-\mathrm{P}(x,y)-\lambda Q(x,y)|<\mu_{n}

in the whole domain (D). This inequality contains the contradiction that proves the theorem.
47. - Multiplicity of polynomials𝐓n\mathbf{T}_{n}We will now show, by an example, that the polynomialTn(x,y;f)\mathrm{T}_{n}(x,y;f)may not be unique.

Whetherf(x)f(x)a continuous function of a variable defined in the interval(A,b)(a,b)andTn(x)\mathrm{T}_{n}(x)its best-approximation polynomial of degree n. Let us denote byμn\mu_{n}^{*}the approximation given byTn(x)\mathrm{T}_{n}(x){}^{*}

Let us now consider the function

f(x,y)=1dc[(yc)Tn(x)(yd)f(x)]f(x,y)=\frac{1}{d-c}\left[(y-c)\mathrm{T}_{n}(x)-(y-d)f(x)\right]

defined in the rectangle (38). Letμn\mu_{n}the best approximation off(x,y)f(x,y)by polynomials of degree n. We have

f(x,y)Tn(x)=dydc(f(x)Tn(x))f(x,y)-\mathrm{T}_{n}(x)=\frac{d-y}{d-c}\left(f(x)-\mathrm{T}_{n}(x)\right)

which shows that

|f(x,y)Tn(x)|μn\left|f(x,y)-\mathrm{T}_{n}(x)\right|\leq\mu_{n}^{*}

equality is only possible fory=cy=cand for certain values ​​ofxxSo we have for sureμnμn\mu_{n}\leq\mu_{n}^{*}.

We now have

f(x,c)=f(x)f(x,c)=f(x)

and it follows that ifP(x,y)\mathrm{P}(x,y)is a polynomialTn\mathrm{T}_{n}his/herf(x,y)f(x,y)it must be

P(x,c)Tn(x).\mathrm{P}(x,c)\equiv\mathrm{T}_{n}(x).

Otherwise there would be at least one valuexxfor which

|f(x,c)P(x,c)|>μn|f(x,c)-\mathrm{P}(x,c)|>\mu_{n}^{*}

We therefore have
Now let the polynomial

μn=μn.\mu_{n}=\mu_{n}^{*}.
P(x,y)=Tn(x)+λμnxycdc\mathrm{P}(x,y)=\mathrm{T}_{n}(x)+\lambda\mu_{nx}^{*}\frac{y-c}{d-c}

sounds|λ|1|\lambda|\leq 1We have

|f(x,y)P(x,y)|=1dc|(dy)(f(x)Tn(x))λμn(yc)|μndc[(dy)+|λ|(yc)]μn=μn\begin{gathered}|f(x,y)-\mathrm{P}(x,y)|=\frac{1}{d-c}\left|(d-y)\left(f(x)-\mathrm{T}_{n}(x)\right)-\lambda\mu_{n}^{*}(y-c)\right|\leq\\ \leq\frac{\mu_{n}^{*}}{d-c}[(d-y)+|\lambda|(y-c)]\leq\mu_{n}^{*}=\mu_{n}\end{gathered}

so all these polynomials are polynomialsTn\mathrm{T}_{n}Without
further ado, we only point out that DI L. Tonelli has also established various other properties of polynomials.Tn\mathrm{T}_{n}. You can see the cited article by Mr. Tonelli.
47. - Weierstrass's theorem. Weierstrass's theorem, stated in No. 30 for continuous functions of one variable, remains true. This theorem tells us that if the function is continuous we have

μn(f)0 for n\mu_{n}(f)\rightarrow 0\quad\text{ pentru }\quad n\rightarrow\infty

For simplicity, let us assume that (D) is the rectangle (38). In very general cases we can return to this case by conveniently extending the functionf(x,y)f(x,y)We can prove Weierstrass's theorem with the help of Mr. S. Bernstein's polynomials of two variables.

Pm,n(x,y;f)=1(bA)m(dc)nand=0mj=0n(mand)(nand)f(Aand,cj)(xA)and(bx)mand(yc)j(dy)nj\begin{gathered}\mathrm{P}_{m,n}(x,y;f)=\frac{1}{(b-a)^{m}(d-c)^{n}}\sum_{i=0}^{m}\sum_{j=0}^{n}\binom{m}{i}\binom{n}{i}f\left(a_{i},c_{j}\right)(x-a)^{i}(b-x)^{m-i}\ldots\\ \cdot(y-c)^{j}(d-y)^{n-j}\end{gathered}

where

Aand=A+andbAm,\displaystyle a_{i}=a+i\frac{b-a}{m}, and=0,1,,m\displaystyle i=0,1,\ldots,m
cj=c+jdcn,\displaystyle c_{j}=c+j\frac{d-c}{n}, j=0,1,,n\displaystyle j=0,1,\ldots,n

these polynomials.
To limit the approximation given by this polynomial we define the oscillation modulusω(δ)\omega(\delta)his/herf(x,y)f(x,y)in the following way

ω(δ)=MAX|f(x,y)f(x",y")|\omega(\delta)=\max\left|f\left(x^{\prime},y^{\prime}\right)-f\left(x^{\prime\prime},y^{\prime\prime}\right)\right|

when (x,yx^{\prime},y^{\prime}), (x",y"x^{\prime\prime},y^{\prime\prime}) are two points in (D) such that

|xx"|+|yy"|δ.\left|x^{\prime}-x^{\prime\prime}\right|+\left|y^{\prime}-y^{\prime\prime}\right|\leq\delta.

functionω(δ)\omega(\delta)enjoys properties analogous to those of the case of functions of one variable. These properties are proved in the same way. Let us recall them here for the case of two variables.
ω(δ)\omega(\delta)is a function defined forδbA+dc\delta\leq b-a+d-c, non-decreasing and which does not become negative. We have

|f(x,y)f(x",y")|ω(|xx"|+|yy"|)\left|f\left(x^{\prime},y^{\prime}\right)-f\left(x^{\prime\prime},y^{\prime\prime}\right)\right|\leq\omega^{\prime}\left(\left|x^{\prime}-x^{\prime\prime}\right|+\left|y^{\prime}-y^{\prime\prime}\right|\right)

and

ω(kδ)<(k+1)ω(δ)\omega(k\delta)<(k+1)\omega(\delta)

for a positive numberkkso thatka¯k\bar{o}andδ\deltato bebA+dc\leq b-a+d-c.
The necessary and sufficient condition that the functionf(x,y)f(x,y)to be continuous in (D) is such that we haveω(δ)0\omega(\delta)\rightarrow 0forδ0\delta\rightarrow 0.

Returning now to our problem, we can write, taking into account the properties of the oscillation modulus,

|f(x,y)Pm,n(x,y;f)|1(bA)m(dc)nand=0mj=0n(mand)(nj).|f(x,y)f(Aand,cj)|(xA)and(bx)mand(yc)j(dy)nj\begin{gathered}\left|f(x,y)-\mathrm{P}_{m,n}(x,y;f)\right|\leq\frac{1}{(b-a)^{m}(d-c)^{n}}\sum_{i=0}^{m}\sum_{j=0}^{n}\binom{m}{i}\binom{n}{j}.\\ \cdot\left|f(x,y)-f\left(a_{i},c_{j}\right)\right|(x-a)^{i}(b-x)^{m-i}(y-c)^{j}(d-y)^{n-j}\end{gathered}

and

|f(x,y)f(Aand,cj)|<[|xAand|+|ycj|δ+1]ω(δ)\left|f(x,y)-f\left(a_{i},c_{j}\right)\right|<\left[\frac{\left|x-a_{i}\right|+\left|y-c_{j}\right|}{\delta}+1\right]\omega(\delta)

Doing the calculations, it is found that

|f(x,y)Pm,n(x,y;f)|{1δ[1(bA)mand=0m(mand)|xAand|(xA)and(bx)mand++1(dc)nj=0n(nj)|ycj|(yc)and(dy)nj]+1}ω(δ)\begin{gathered}\left|f(x,y)-\mathrm{P}_{m,n}(x,y;f)\right|\leq\left\{\frac{1}{\delta}\left[\frac{1}{(b-a)^{m}}\sum_{i=0}^{m}\binom{m}{i}\left|x-a_{i}\right|(x-a)^{i}(b-x)^{m-i}+\right.\right.\\ \left.\left.\quad+\frac{1}{(d-c)^{n}}\sum_{j=0}^{n}\binom{n}{j}\left|y-c_{j}\right|(y-c)^{i}(d-y)^{n-j}\right]+1\right\}\omega(\delta)\end{gathered}

However, we showed in No. 34 that

1(bA)mnand=0m(mand)|xAand|(xA)and(bx)mandbA2m\displaystyle\frac{1}{(b-a)^{mn}}\sum_{i=0}^{m}\binom{m}{i}\left|x-a_{i}\right|(x-a)^{i}(b-x)^{m-i}\leq\frac{b-a}{2\sqrt{m}}
1(dc)nj=1n(nand)|ycj|(yc)j(dy)nanddc2n\displaystyle\frac{1}{(d-c)^{n}}\sum_{j=1}^{n}\binom{n}{i}\left|y-c_{j}\right|(y-c)^{j}(d-y)^{n-i}\leq\frac{d-c}{2\sqrt{n}}

So if we take

δ=bAm+dcn\delta=\frac{b-a}{\sqrt{m}}+\frac{d-c}{\sqrt{n}}

FIND

|f(x,y)Pm,n(x,y;f)|<32ω(bAm+dρn)\left|f(x,y)-\mathrm{P}_{m,n}(x,y;f)\right|<\frac{3}{2}\omega\left(\frac{b-a}{\sqrt{m}}+\frac{d-\rho}{\sqrt{n}}\right)

If we dom,nm\rightarrow\infty,n\rightarrow\inftywe come across Weierstrass's theorem-
48. - The problem of the best approximation for a function of a complex variable. So far we have studied the case of functions of real variables. Let us briefly examine the case of functions of a complex variable. A functionf(x,y)f(x,y)of two real variables that takes real or complex values ​​can also be called a function of a complex variable...z=x+andy(and=1)z=x+iy(i=\sqrt{-1})Such a function is of the formf1(x,y)+andf2(x,y)f_{1}(x,y)+if_{2}(x,y)\cdotswheref1f_{1}andf2f_{2}are real functions. The necessary and sufficient condition that the functionf(x,y)f(x,y)to be continuous is that the functionsf1f_{1}andf2f_{2}to be continuous.

For abbreviation functionf(x,y)f(x,y)it is also noted withf(z)f(z). Vomiting. assumes as above thatf(z)f(z)is defined and continuous in the domain (D).

Let us now consider the set of analytic polynomials of degree n-

P(z)=A0zn+A1zn1++An\mathrm{P}(z)=a_{0}z^{n}+a_{1}z^{n-1}+\cdots+a_{n}

A polynomial of the set is completely determined by its coefficientsA0,A1,,Ana_{0},a_{1},\ldots,a_{n}real or complex.

The modulus of a function is a real function soM(|fP|U)\mathrm{M}(|f-\mathrm{P}|\mathrm{U}\mid)has a well-defined meaning here as well and represents, by definition, the error or approximation with which the polynomialP(z)\mathrm{P}(z)represents the functionf(z)f(z)in the domain (D). The best approximationμn(f)\mu_{n}(f), or shorterμn\mu_{n}, is here too, —by definition, the lower edge of the numbersM(|fP|)\left.\mathrm{M}_{(}|f-\mathrm{P}|\right)when Po of degree n.

The problem that interests us is posed as before:
Given a functionf(z)f(z), to determine the polynomials of degree n for whichM(|fP|)\mathrm{M}(|f-\mathrm{P}|)reaches its lower edgeμn\mu_{n}and to study this numberμn\mu_{n}.

The definition of a best-fitting polynomial is self-explanatory. We will denote such a polynomial byTn(z;f)\mathrm{T}_{n}(z;f)and we will say it is a polynomialTn\mathrm{T}_{n}.

It is proven, exactly as above, that:
Any continuous functionf(z)f(z)admits at least one polynomial of best approximation of degree n.

This result remains exact for any bounded function.

numberμn\mu_{n}is positive or null and cannot be canceled unlessf(z)f(z)reduces to an analytic polynomial of degreennWe will assume, in the following, thatμn>0\mu_{n}>0.

This best approximation problem was also studied by Mr. L. Tonelli in the cited work.
49. - Fundamental property of polynomials𝐓𝒏\mathbf{T}_{\boldsymbol{n}}The first property of best-approximation polynomials is the following:

IfP(z)\mathrm{P}(z)is a best-approximation polynomial of degree n, there exists at leastn+2n+2points where

|f(z)P(z)|=μn|f(z)-\mathrm{P}(z)|=\mu_{n} (42)

Let us assume the opposite and letz1,z2,,zmz_{1},z_{2},\ldots,z_{m}the points, in number only ofmn+1m\leq n+1where we have the equality (42). Let

f(zR)P(zR)=μnit isandARR=1,2,,m.f\left(z_{r}\right)-\mathrm{P}\left(z_{r}\right)=\mu_{n}e^{ia_{r}}\quad r=1,2,\ldots,m.

Lagrange's interpolation formula allows us to determine an apolynomialQ(z)Q(z)of the degreennso that

Q(zR)=μnit isandARR=1,2,,m.Q\left(z_{r}\right)=\mu_{n}e^{ia_{r}}\quad r=1,2,\ldots,m.

Let's put

f(z)P(z)=μit isandα,Q(z)=Vit isandβf(z)-\mathrm{P}(z)=\mu e^{i\alpha},\quad Q(z)=ve^{i\beta}

where, of course,μ,n,α,β\mu,\nu,\alpha,\betadepend on the pointzzLet us now
consider the closed corks (CRC_{r}) with the center inzRz_{r}and
radius §. We takeδ\deltasmall enough because
10. The circles (CR\mathrm{C}_{r}) not to be cut.
20.f(z)P(z),Q(z)2^{0}.f(z)-\mathrm{P}(z),Q(z)not to cancel in the circles (CR)\left.\mathrm{C}_{\mathrm{r}}\right). There will be. then a positive numberγ\gammaso thatμγ,γγ\mu\geq\gamma,\gamma\geq\gammain circles(CR)\left(\mathrm{C}_{r}\right).
30. Let's have

|ααR|α<π4,|βαR|α<π4\left|\alpha-\alpha_{r}\right|\leq\alpha^{\prime}<\frac{\pi}{4},\left|\beta-\alpha_{r}\right|\leq\alpha^{\prime}<\frac{\pi}{4}

in the circles (CR\mathrm{C}_{r}).
All these circumstances can be achieved by virtue of the continuity of functionsf(z)P(z),Q(z)f(z)-\mathrm{P}(z),\mathrm{Q}(z).

In the whole field (D\mathrm{D}^{\prime}) which is obtained from (D) by removing the interior of the circles (CR\mathrm{C}_{r}), we have

|f(z)P(z)|μ<μn|f(z)-\mathrm{P}(z)|\leq\mu^{\prime}<\mu_{n}

μ\mu^{\prime}being a fixed number.
Let us now take a positive numberλ\lambdaso that

λ<min(μnμ2M(|Q|),2γCart2αM(|Q|)).\lambda<\min\left(\frac{\mu_{n}-\mu^{\prime}}{2\mathrm{M}(|Q|)},\frac{2\gamma\cos 2\alpha^{\prime}}{\mathrm{M}(|Q|)}\right).

we

f(z)P1z)λQ(z)|2=|μit isandαλVit isandβ|=μ2+λ2μ22μλVCart(αβ)V\left.\mid f(z)-P_{1}z\right)-\left.\lambda Q(z)\right|^{2}=\left|\mu e^{i\alpha}-\lambda ve^{i\beta}\right|=\mu^{2}+\lambda^{2}\mu^{2}-2\mu\lambda v\cos(\alpha-\beta)v

But in the circles (CR\mathrm{C}_{r})

Cart(αβ)Cart2α>0\cos(\alpha-\beta)\geq\cos 2\alpha^{\prime}>0

and

λ2γ22λμnCart(αβ)<λn(λM(|Q|)2γCart2α)<0\lambda^{2}\gamma^{2}-2\lambda\mu\nu\cos(\alpha-\beta)<\lambda\nu\left(\lambda M(|Q|)-2\gamma\cos 2\alpha^{\prime}\right)<0

We therefore have, in the circles (CR\mathrm{C}_{r}),

|f(z)P(z)λQ(z)|μ"<μn|f(z)-P(z)-\lambda Q(z)|\leq\mu^{\prime\prime}<\mu_{n}

μ"\mu^{\prime\prime}being a fixed number.
On the other hand, in the domain (D\mathrm{D}^{\prime}) we have

f(z)P(z)λQ(z)|μ+μnμ2=μn+μ2<μn.f(z)-\mathrm{P}(z)-\lambda\mathrm{Q}(z)\left\lvert\,\leq\mu^{\prime}+\frac{\mu_{n}-\mu^{\prime}}{2}=\frac{\mu_{n}+\mu^{\prime}}{2}<\mu_{n}.\right.

It follows that in the entire domain (D)

|f(z)P(z)λQ(z)|MAX(μ",μn+μ2)<μn.|f(z)-P(z)-\lambda Q(z)|\leq\max\left(\mu^{\prime\prime},\frac{\mu_{n}+\mu^{\prime}}{2}\right)<\mu_{n}.

which contradicts the hypothesis thatP(z)\mathrm{P}(z)is a polynomialTn\mathrm{T}_{n}. The stated property is therefore proven.
50. - The uniqueness of the polynomial𝐓n\mathbf{T}_{n}From the previous property it immediately follows that:

A continuous functionf(z)f(z)admits a single polynomial of best approximation of degree n.

Let us assume the opposite and letP(z),P1(z)\mathrm{P}(z),\mathrm{P}_{1}(z)two polynomialsTn\mathrm{T}_{n}distinct. The polynomialP2=P+P12P_{2}=\frac{P+P_{1}}{2}is also a polynomialTnT_{n}, because

M(|fP2|12{M(|fP|)+M(|fP1|)}μnM\left(\left|f-P_{2}\right|\leq\frac{1}{2}\left\{M(|f-P|)+M\left(\left|f-P_{1}\right|\right)\right\}\leq\mu_{n}\right.

Whetherzz^{\prime}a point where

|f(z)P2(z)|=μn\left|f\left(z^{\prime}\right)-P_{2}\left(z^{\prime}\right)\right|=\mu_{n}

we

|f(z)P(z)|μn,|f(z)P1(z)|μn\left|f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right)\right|\leq\mu_{n},\quad\left|f\left(z^{\prime}\right)-\mathrm{P}_{1}\left(z^{\prime}\right)\right|\leq\mu_{n}

and

μn=|f(z)P(z)+(f(z)P(z))2||f(z)P(z)|+|f(z)P(z)|2μn\mu_{n}=\left|\frac{f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right)+\left(f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right)\right)}{2}\right|\leq\frac{\left|f(z)-\mathrm{P}\left(z^{\prime}\right)\right|+\left|f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right)\right|}{2}\leq\mu_{n}

It follows that we have the sign = everywhere. Then we must firstf(z)P(z),f(z)P1(z)f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right),f\left(z^{\prime}\right)-\mathrm{P}_{1}\left(z^{\prime}\right)to have the same modeμn\mu_{n}and then, the modulus of the sum being equal to the sum of the moduluses, it must have the same argument. We have so

f(z)P(z)\displaystyle f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right) =f(z)P(z)\displaystyle=f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right)
P(z)\displaystyle\mathrm{P}\left(z^{\prime}\right) =P(z)\displaystyle=\mathrm{P}\left(z^{\prime}\right)

Or, we saw in the previous No. that there is at leastn+1n+1(and even at leastn+2n+2) pointszz^{\prime}. Polynomials of degreen,P(z)n,\mathrm{P}(z)andP1(z)\mathrm{P}_{1}(z), coincide in at leastn+1n+1points and are therefore identical, contrary to the hypothesis. The theorem is proven.
51. - Mr. L. Tonelli's theorem. Mr. Tonelli found a theorem here too, analogous to Mr. Borel's first theorem.

Let E be the set of points\boldsymbol{z}^{\prime}whereM(|fP|)\mathrm{M}(|f-\mathrm{P}|)is reached. We have the property:

The necessary and sufficient condition forP(z)\mathrm{P}(z)to be a polynomialTnT{}^{\mathrm{T}}\mathrm{T}_{n}is that no polynomial can be foundQ(z)\mathrm{Q}(z), of degree n, - such that
10.c>|Q(z)|>c>0\quad c^{\prime}>\left|Q\left(z^{\prime}\right)\right|>c>0

20|silverQ(z)silver(f(z)P(z))|<α<90\quad\left|\arg\mathrm{Q}\left(z^{\prime}\right)-\arg\left(f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right)\right)\right|<\alpha^{\prime}<90^{\circ}
in all pointszz^{\prime}of E.
To show that this condition is sufficient, it is enough to show that, ifP(z)\mathrm{P}(z)is not a polynomialTn\mathrm{T}_{n}, we can construct the polynomialQzQ^{\prime}z).

Let us therefore suppose that

M(|fP|)=μ>μn,(P(z)Tn(z;f)).\mathrm{M}(|f-\mathrm{P}|)=\mu^{\prime}>\mu_{n},\quad\left(\mathrm{P}(z)\equiv\equiv\mathrm{T}_{n}(z;f)\right).

WhetherA,A1,A2\mathrm{A}_{,}\mathrm{A}_{1},\mathrm{~A}_{2}the points that representf(z),P(z),Tn(z;f)f\left(z^{\prime}\right),\mathrm{P}\left(z^{\prime}\right),\mathrm{T}_{n}\left(z^{\prime};f\right)The point.A2\mathrm{A}_{2}is in the circle with center A and radius equal toμn\mu_{n}.

-We

μμn|Tn(z;f)P(z)|μ+μn|silver(Tn(z;f)P(z))silver(f(z)P(z))|Arcsineμnμ<90n\begin{gathered}\mu^{\prime}-\mu_{n}\leq\left|\mathrm{T}_{n}\left(z^{\prime};f\right)-\mathrm{P}\left(z^{\prime}\right)\right|\leq\mu^{\prime}+\mu_{n}\\ \left|\arg\left(\mathrm{~T}_{n}\left(z^{\prime};f\right)-\mathrm{P}\left(z^{\prime}\right)\right)-\arg\left(f\left(z^{\prime}\right)-\mathrm{P}\left(z^{\prime}\right)\right)\right|\leq\operatorname{Arcsin}\frac{\mu_{n}}{\mu^{\prime}}<90^{n}\end{gathered}

So we can takeQ(z)=Tn(z;f)P(z)Q(z)=\mathrm{T}_{n}(z;f)-\mathrm{P}(z).
It remains to show the necessity of the condition.
Suppose there were a polynomialQ(z)Q(z)which satisfies the stated properties and eitherTn(z;f)\mathrm{T}_{n}(z;f)the best-fitting polynomial. We can assumec<μnc<\mu_{n}. For a given positive number,ε<μn\varepsilon<\mu_{n}, corresponds to another positive numberδ\deltaso that the oscillation of the functionsf(z)Tn(z;f)f(z)-\mathrm{T}_{n}(z;f),Q(z)Q(z)to be smaller thanε\varepsilon, in a circle of radiusδ\leq\delta. Then eitherφ(z)=f(z)Tn(z;f)λQ(z),λ\varphi(z)=f(z)-\mathrm{T}_{n}(z;f)-\lambda Q(z),\lambdabeing a positive number.

Let's take nowε\varepsilonsmall enough for us to have

ε<min(cmy90α2,μnmy15)\varepsilon<\min\left(c\sin\frac{90^{\circ}-\alpha^{\prime}}{2},\mu_{n}\sin 15^{\circ}\right)

And let us denote by E the projection of M on OD.
If we take

λ<μnCart(MODE)ε+γc2ε2¯\lambda<\frac{\mu_{n}\cos(\mathrm{MOD})}{\varepsilon+\gamma\overline{c^{2}-\varepsilon^{2}}}

domain (B) is completely inside the triangle MOE. On the other hand

f(z)Tn(z;f)λQ(z)<μnf(z)-\mathrm{T}_{n}(z;f)-\lambda Q(z)\mid<\mu_{n}

in the circles C . In the closed domain obtained by taking out the interior of the circles C we have

|f(z)Tn(z;f)λQ(z)|μ<μn,\left|f(z)-T_{n}(z;f)-\lambda Q(z)\right|\leq\mu^{\prime}<\mu_{n},

so

|f(z)Tn(z;f)λQ(z)|<μn\left|f(z)-T_{n}(z;f)-\lambda Q(z)\right|<\mu_{n}

everywhere, which is in contradiction with the fact thatTn(z;f)\mathrm{T}_{n}(z;f)is the best-fitting polynomial.

The property is therefore completely demonstrated.

1933, 1933-1934

Related Posts