A Halley-Aitken type method for approximating the solutions of scalar equations

Ion Păvăloiu

1 Introduction

Let $f:\left[a,b\right]\rightarrow\mathbb{R},$ where $a,b\in\mathbb{R},$ $a<b$ , and suppose that $f$ has the first order derivative, which is positive: $f^{\prime}\left(x\right)>0,\forall x\in\left[a,b\right]$ . Consider the function $h:\left[a,b\right]\rightarrow\mathbb{R}$

(1.1)

h\left(x\right)=\frac{f\left(x\right)}{\sqrt{f^{\prime}\left(x\right)}}.

In [2] there is shown that the Halley method for solving

(1.2)

f\left(x\right)=0

is in fact the Newton method for solving (1.1). This method consists therefore in generating the sequence $\left(x_{n}\right)_{n\geq 0}$ by

(1.3)

x_{n+1}=x_{n}-\frac{h\left(x_{n}\right)}{h^{\prime}\left(x_{n}\right)},\;x_{0}% \in\left[a,b\right],\;n=0,1,...,.

The first and second order derivatives of $h$ are given by

(1.4)

h^{\prime}\left(x\right)=\frac{2\left[f^{\prime}\left(x\right)\right]^{2}-f^{% \prime\prime}\left(x\right)f\left(x\right)}{2\left[f^{\prime}\left(x\right)% \right]^{3/2}},\;x\in\left[a,b\right]

and

(1.5)

h^{\prime\prime}\left(x\right)=\frac{\left[3\left[f^{\prime\prime}\left(x% \right)\right]^{2}-2f^{\prime\prime\prime}\left(x\right)f^{\prime}\left(x% \right)\right]f\left(x\right)}{4\left[f^{\prime}\left(x\right)\right]^{5/2}},% \;x\in\left[a,b\right].

These relations imply

(1.6)

h^{\prime}\left(\overline{x}\right)=\left[f^{\prime}\left(\overline{x}\right)% \right]^{1/2}

and

(1.7)

h^{\prime\prime}\left(\overline{x}\right)=0

where $\overline{x}\in\left[a,b\right]$ denotes the solution of (1.2). As shown in [1], equality (1.7) characterizes the Halley method, in the sense that ensures its convergence order $3 .$ The authors of [4], analyzing an algorithm of Heron for approximating $\sqrt[3]{100}$ , give a general algorithm which can be used for approximating the cubic root of any real positive number.

In [7] it is shown that the algorithm from [4] is nothing else than the chord method applied to equation $h\left(x\right)=0$ where $h\left(x\right)=\frac{f\left(x\right)}{\sqrt{f^{\prime}\left(x\right)}}$ , with $f\left(x\right)=x^{3}-N$ . In this case the equation $h\left(x\right)=0$ has the form $x^{2}-\frac{N}{x}=0$ , when $N>0$ , $N\in\mathbb{R}.$

It is clear that between the Heron algorithm and the Halley method there exists a connection, in the sense that the transformed equation to which we apply the Newton or the chord method is the same. In [7] and [10], the authors study the convergence and error bounds for the Steffensen and Aitken-Steffensen methods applied to (1.1). In this note we shall study a variant of the Aitken-Steffensen method, which differs from those presented in [10] and [11].

We shall consider other two equations, equivalent to (1.2), having the form

(1.8)

x-\varphi_{1}\left(x\right)=0

and

(1.9)

x-\varphi_{2}\left(x\right)=0,

where $\varphi_{1},\varphi_{2}:\left[a,b\right]\rightarrow\left[a,b\right]$ will be convenably chosen.

We shall study the sequence $\left(x_{n}\right)_{n\geq 0}$ given by

(1.10)

x_{n+1}=\varphi_{1}\left(x_{n}\right)-\frac{h\left(\varphi_{1}\left(x_{n}% \right)\right)}{\left[\varphi_{1}\left(x_{n}\right),\varphi_{2}\left(\varphi_{% 1}\left(x_{n}\right)\right);h\right]},\;x_{0}\in\left[a,b\right],\;n=0,1,...\;.

We shall consider the following assumptions on $f,\varphi_{1}$ and $\varphi_{2}$ :

i. $f\in C^{4}\left[a,b\right];$

ii. equation (1.2) has a solution $\overline{x}\in\left(a,b\right);$

iii. $f^{\prime}\left(x\right)>0,$ $\forall x\in\left[a,b\right];$

iv. $\varphi_{1}$ obeys $0<\left[x,y;\varphi_{1}\right]<1,\forall x,y\in\left[a,b\right],$ where $\left[x,y;\varphi_{1}\right]$ denotes the first order divided difference of $\varphi_{1}$ on $x$ and $y$ :

\left[x,y;\varphi_{1}\right]=\left(\varphi_{1}\left(y\right)-\varphi_{1}\left(% x\right)\right)/\left(y-x\right);

v. $\varphi_{2}$ obeys $-1<\left[x,y;\varphi_{2}\right]<0,\;\forall x,y\in\left[a,b\right].$

2 The local convergence and error bounds

We shall use the following identities:

(2.1)		$\displaystyle\varphi_{1}\left(x_{n}\right)-\frac{h\left(\varphi_{1}\left(x_{n}% \right)\right)}{\left[\varphi_{1}\left(x_{n}\right),\varphi_{2}\left(\varphi_{% 1}\left(x_{n}\right)\right);h\right]}$	$\displaystyle=\varphi_{2}\left(\varphi_{1}\left(x_{n}\right)\right)-\frac{h% \left(\varphi_{2}\left(\varphi_{1}\left(x_{n}\right)\right)\right)}{\left[% \varphi_{1}\left(x_{n}\right),\varphi_{2}\left(\varphi_{1}\left(x_{n}\right)% \right);h\right]}$
	$\displaystyle n$	$\displaystyle=0,1,...,\;.$

and also the Newton identity

(2.2)

h\left(x\right)=h\left(y\right)+\left[y,z;h\right]\left(x-y\right)+\left[x,y,z% ;h\right]\left(x-y\right)\left(x-z\right)

where $\left[x,y,z;h\right]$ is the second order divided difference of $h$ on $x, y, z .$

We notice that equality (1.6) and hypothesis i. ensure the existence of $\alpha,\beta\in\mathbb{R},$ $a\leq\alpha<\overline{x}<\beta\leq b$ such that $h^{\prime}\left(x\right)>0$ $\forall x\in\left[\alpha,\beta\right]$ .

The following theorem holds:

Theorem 2.1

Let $\left[\alpha,\beta\right]\subseteq\left[a,b\right]$ be such that $h^{\prime}\left(x\right)>0$ $\forall x\in\left[\alpha,\beta\right]$ . If the functions $f,\varphi_{1},\varphi_{2}$ and the initial approximation $x_{0}$ satisfy:
a) $x_{0}\in\left[\alpha,\beta\right]$ can be chosen such that $\varphi_{1}\left(x_{0}\right)\in\left[\alpha,\beta\right]$ and $\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)\in\left[\alpha,\beta% \right];$
b) the hypotheses i-v an satisfied.
Then the following properties are true:

j. for all $n\in\mathbb{N}$ we have

\left|x_{n+1}-\overline{x}\right|\leq\max\left\{\left|x_{n+1}-\varphi_{1}\left% (x_{n}\right)\right|,\left|x_{n+1}-\varphi_{2}\left(\varphi_{1}\left(x_{n}% \right)\right)\right|\right\};

jj. there exists $k>0$ , $k\in\mathbb{R}$ , which does not depend on $n$ , such that

\left|x_{n+1}-\overline{x}\right|\leq k\left|x_{n}-\overline{x}\right|^{3},\;% \forall n\in\mathbb{N};

jjj. if $x_{0}$ is close enough to $\overline{x}$ to obey $\sqrt{k}\left|\overline{x}-x_{0}\right|<1,$ then the sequences $\left(x_{n}\right)_{n\geq 0},$ $\left(\varphi_{1}\left(x_{n}\right)\right)_{n\geq 0}$ and $\left(\varphi_{2}\left(\varphi_{1}\left(x_{n}\right)\right)\right)_{n\geq 0}$ converge to their common limit $\overline{x}$ .

Proof. We shall analyses two cases.

I. $x_{0}<\overline{x}.$ Then $\varphi_{1}\left(x_{0}\right)-\overline{x}=\varphi_{1}\left(x_{0}\right)-% \varphi_{1}\left(\overline{x}\right)=\left[x_{0},\overline{x};\varphi_{1}% \right]\left(x_{0}-\overline{x}\right)<0,$ i.e., $\varphi_{1}\left(x_{0}\right)<\overline{x}$ . Denote $\Psi\left(x\right)=x-\varphi_{1}\left(x\right),$ and so $\Psi\left(x_{0}\right)-\Psi\left(\overline{x}\right)=\left[x_{0},\overline{x};% \Psi\right]\left(x_{0}-\overline{x}\right)=\left[1-\left[x_{0},\overline{x},% \varphi_{1}\right]\right]\left(x_{0}-\overline{x}\right)<0$ , i.e. $x_{0}<\varphi_{1}\left(x_{0}\right)$ . Now we show that $\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)>\overline{x}.$ From $\varphi_{1}\left(x_{0}\right)<\overline{x}$ it follows $\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)-\overline{x}=\varphi_{2}% \left(\varphi_{1}\left(x_{0}\right)\right)-\varphi_{2}\left(\overline{x}\right% )=\left[\overline{x},\varphi_{1}\left(x_{0}\right);\varphi_{2}\right]\left(% \varphi_{1}\left(x_{0}\right)-\overline{x}\right)>0,$ i.e. $\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)>\overline{x}.$ Next, we show that $x_{1}\in\left[\varphi_{1}\left(x_{0}\right),\varphi_{2}\left(\varphi_{1}\left(% x_{0}\right)\right)\right]$ , where $x_{1}$ is obtained from (1.10) for $n=0$ . Since $h^{\prime}\left(x\right)>0,\forall x\in\left[\alpha,\beta\right]$ , and $\varphi_{1}\left(x_{0}\right)\in\left[\alpha,\beta\right]$ , we get that $h\left(\varphi_{1}\left(x_{0}\right)\right)<0$ (we know that $\varphi_{1}\left(x_{0}\right)<\overline{x}$ ) and so $x_{1}$ satisfies $x_{1}>\varphi_{1}\left(x_{0}\right)$ . We have used the fact that $h^{\prime}\left(x\right)>0\;\forall x\in\left[\alpha,\beta\right]$ implies $\left[\varphi_{1}\left(x_{0}\right),\varphi_{2}\left(\varphi_{1}\left(x_{0}% \right)\right);h\right]>0.$ Now we show that $x_{1}<\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)$ . This inequality follows from $\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)>\overline{x}$ , $h\left(\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)\right)>0$ and from (2.1) for $n=0$ . we have shown that

(2.3)

x_{0}<\varphi_{1}\left(x_{0}\right)<\overline{x}<\varphi_{2}\left(\varphi_{1}% \left(x_{0}\right)\right)

and

(2.4)

x_{1}\in\left(\varphi_{1}\left(x_{0}\right),\varphi_{2}\left(\varphi_{1}\left(% x_{0}\right)\right)\right).

II. $x_{0}>\overline{x}$ . Similarly to the above reason, we get that

(2.5)

x_{0}>\varphi_{1}\left(x_{0}\right)>\overline{x}>\varphi_{2}\left(\varphi_{1}% \left(x_{0}\right)\right)

and

(2.6)

x_{1}\in\left(\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right),\varphi_{1% }\left(x_{0}\right)\right).

Denoting by $I_{0}$ the open interval determined by $\varphi_{1}\left(x_{0}\right)$ and $\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)$ , then obviously relations (2.3) - (2.6) may be sinthetized as

x_{1},\overline{x}\in I_{0}.

It can be easily seen that if we denote by $I_{1}$ the open interval determined by $\varphi_{1}\left(x_{1}\right)$ and $\varphi_{2}\left(\varphi_{1}\left(x_{1}\right)\right)$ then get

I_{1}\subset I_{0}

and

x_{2},\overline{x}\in I_{1}

where $x_{2}$ is obtained from (1.10) for $n=1$ .

Let $I_{n}$ be the open interval determined by $\varphi_{1}\left(x_{n}\right)$ and $\varphi_{2}\left(\varphi_{1}\left(x_{n}\right)\right)$ for some $n\in\mathbb{N}$ . Then repeating the above reason, we may show that

(2.7)

\overline{x},x_{n+1}\in I_{n}

and

I_{n+1}\subset I_{n},

where $I_{n+1}$ is determined by $\varphi_{1}\left(x_{n+1}\right)$ and $\varphi_{2}\left(\varphi_{1}\left(x_{n+1}\right)\right)$ . From the above reason and from (2.7) it follows j., which yields an error bound bound for each iteration step.

For jj. using identity (2.2) we get

	$\displaystyle h\left(\overline{x}\right)$	$\displaystyle=h(\varphi_{1}\left(x_{n}\right))+\left[\varphi_{1}\left(x_{n}% \right),\varphi_{2}\left(\varphi_{1}\left(x_{n}\right)\right);h\right]\left(% \overline{x}-\varphi_{1}\left(x_{n}\right)\right)+$
		$\displaystyle\left[\varphi_{1}\left(x_{n}\right),\varphi_{2}\left(\varphi_{1}% \left(x_{n}\right)\right),\overline{x};h\right]\left(\overline{x}-\varphi_{1}% \left(x_{n}\right)\right)\left(\overline{x}-\varphi_{2}\left(\varphi_{1}\left(% x_{n}\right)\right)\right)$

whence, taking into account (1.10) and $h\left(\overline{x}\right)=0,$ we get

(2.8)

\overline{x}-x_{n+1}=\frac{\left[\varphi_{1}\left(x_{n}\right),\varphi_{2}% \left(\varphi_{1}\left(x_{n}\right)\right),\overline{x};h\right]}{\left[% \varphi_{1}\left(x_{n}\right),\varphi_{2}\left(\varphi_{1}\left(x_{n}\right)% \right);h\right]}\left(\overline{x}-\varphi_{1}\left(x_{n}\right)\right)\left(% \overline{x}-\varphi_{2}\left(\varphi_{1}\left(x_{n}\right)\right)\right).

The mean value formulae for the divided differences lead us to

(2.9)

\left[\varphi_{1}\left(x_{n}\right),\varphi_{2}\left(x_{n}\right);h\right]=h^{% \prime}\left(\theta_{n}\right),\theta_{n}\in I_{n}

and

\left[\varphi_{1}\left(x_{n}\right),\varphi_{2}\left(\varphi_{1}\left(x_{n}% \right)\right),\overline{x};h\right]=\frac{h^{\prime\prime}\left(\eta_{n}% \right)}{2},\;\eta_{n}\in I_{n}.

From i. and using the Lagrange formula it follows

(2.10)

h^{\prime\prime}\left(\eta_{n}\right)=h^{\prime\prime}\left(\eta_{n}\right)-h^% {\prime\prime}\left(\overline{x}\right)=h^{\prime\prime\prime}\left(\xi_{n}% \right)\left(\eta_{n}-\overline{x}\right),\;\eta_{n}\in I_{n}.

Denoting

m_{1}=\inf\limits_{x\in\left[\alpha,\beta\right]}\left|h^{\prime}\left(x\right% )\right|

and

M_{1}=\sup\limits_{x\in\left[\alpha,\beta\right]}\left|h^{\prime\prime\prime}% \left(x\right)\right|,

from (2.8) and taking into account (2.9) and (2.10) we get

\left|\overline{x}-x_{n+1}\right|\leq\frac{M_{1}}{2m_{1}}\left|\overline{x}-% \varphi_{1}\left(x_{n}\right)\right|\left|\overline{x}-\varphi_{2}\left(% \varphi_{1}\left(x_{n}\right)\right)\right|\left|\overline{x}-\eta_{n}\right|.

The property jj. follows easily by denoting $k=\frac{M_{1}}{2m_{1}}$ and taking into account iv, v, and the fact that $\eta_{n}\in I_{n}.$

Property jjj is an immediate consequence of j and jj.

3 Determining the functions $\varphi_{1}$ and $\varphi_{2}$

we shall present a modality of choosing $\varphi_{1}$ and $\varphi_{2}$ in order to obey the assumptions of Theorem 2.1

Suppose that $f$ is strictly convex on $\left[a,b\right]$ , i.e. $f^{\prime\prime}\left(x\right)>0,\forall x\in\left[a,b\right]$ . This assumption, together with $f^{\prime}\left(x\right)>0$ $\forall x\in\left[a,b\right]$ , lead, by (1.4) to $h^{\prime}\left(x\right)>0\;\forall x\in\left[a,\overline{x}\right]$ . Relation (1.4) again and $f^{\prime}\left(x\right)>0$ and $f\left(\overline{x}\right)=0$ imply the existence of $\beta$ , $\overline{x}<\beta\leq b$ such that $h^{\prime}\left(x\right)>0,\,\forall x\in\left[\overline{x},\beta\right].$ These hypotheses ensure the existence of an interval $\left[\alpha,\beta\right]$ for which $h^{\prime}\left(x\right)>0,\,\forall x\in\left[\alpha,\beta\right]$ . Since $f^{\prime\prime}\left(x\right)>0$ it follows that $f^{\prime}\left(x\right)$ is increasing on $\left[a,b\right]$ .

Taking

\varphi_{1}\left(x\right)=x-\frac{1}{\mu}f\left(x\right)

and

\varphi_{2}\left(x\right)=x-\frac{1}{\lambda}f\left(x\right),

with $\mu\geq f_{s}^{\prime}\left(b\right)$ and $0<\lambda\leq f_{d}^{\prime}\left(a\right),$ and assuming that $0<f^{\prime}\left(x\right)<2\lambda,\forall x\in\left[a,b\right]$ , then the functions $\varphi_{1}$ and $\varphi_{2}$ defined above obey hypotheses iv and v of Theorem 2.1. For $a\leq x_{0}\leq\overline{x}$ in Theorem 2.1, hypothesis $\varphi_{1}\left(x_{0}\right)\in\left[\alpha,\beta\right]$ is automatically verified, but the assumption $\varphi_{2}\left(\varphi_{1}\left(x_{0}\right)\right)\in\left[\alpha,\beta\right]$ must be kept.

References

[1] Ben-Israel, A., Newton’s method with modified functions, Contemp. Math., 204, pp. 39–50, 1997.
[2] Brown, G. H., Jr., On Halley’s variation of Newton’s method, Amer. Math. Monthly, 84, pp. 726–728, 1977.
[3] Candela, V. and Marquina, A., Recurrence relations for rational cubic methods I: The Halley’s method, Computing, 44, pp. 169–184, 1990.
[4] Deslauries, G. and Dubuc, S., Le calcul de la racine cubique selon H´eron, El. Math., 51, pp. 28–34, 1996.
[5] Ford, W. F. and Pennline, J. A., Accelerated convergence in Newton method, SIAM Rev., 38, pp. 658–659, 1996.
[6] Gerlach, J., Accelerated convergence in Newton’s method, SIAM Rev., 36, pp. 272–276, 1994.
[7] Luca, D. and Pǎvǎloiu, I., On the Heron’s method for the approximation of the cubic root of a real number, Rev. Anal. Numér. Théor. Approx., 28, pp. 103–108, 1997.
[8] Melman, A., Geometry and convergence of Euler’s and Halley’s methods, SIAM Rev., 39, pp. 728–735, 1997.
[9] Ostrowski, A. M., The Solution of Equations and Systems of Equations, Academic Press, New York–London, 1960.
[10] Pǎvǎloiu, I., On the monotonicity of the sequences of approximations obtained by Steffensen method, Mathematica (Cluj), 35 (58), pp. 171–76, 1993.
[11] Pǎvǎloiu, I., Approximation of the roots of equations by Aitken–Steffensen-type monotonic sequences, Calcolo, 32, pp. 69–82, 1995.
[12] Pǎvǎloiu, I., On a Halley–Steffensen method for approximating the solutions of scalar equations, Rev. Anal. Numér. Théor. Approx., 30, N ${}^{\text{o}}$ .1, 2001, pp.69-74.
[13] Pǎvǎloiu, I., On Some Aitken–Steffensen-Halley-Type Methods for Approximating the Roots of Scalar Equations, Rev. Anal. Numér. Théor. Approx., 30, N ${}^{\text{o}}$ .2, 2001, to appear.
[14] Popoviciu, T., Sur la délimitation de l’erreur dans l’approximation des racines d’une équation par interpolation linéaire ou quadratique, Rev. Roumaine Math. Pures Appl., 13, pp. 75–78, 1968.

A Halley-Aitken-type method for approximating the solutions of scalar equations

Abstract

Authors

Keywords

PDF

Cite this paper as:

About this paper

Journal

Publisher Name

JSTOR permalink

Print ISSN

Online ISSN

References

Paper (preprint) in HTML form

A Halley-Aitken type method for approximating the solutions of scalar equations

1 Introduction

2 The local convergence and error bounds

Theorem 2.1

3 Determining the functions $\varphi_{1}$ and $\varphi_{2}$

References

Related Posts

A Halley-Aitken-type method for approximating the solutions of scalar equations

Abstract

Authors

Keywords

PDF

Cite this paper as:

About this paper

Journal

Publisher Name

JSTOR permalink

Print ISSN

Online ISSN

References

Paper (preprint) in HTML form

A Halley-Aitken type method for approximating the solutions of scalar equations

1 Introduction

2 The local convergence and error bounds

Theorem 2.1

3 Determining the functions φ1 and φ2

References

Related Posts

3 Determining the functions $\varphi_{1}$ and $\varphi_{2}$