Given a nonlinear mapping \(G:D\subseteq \mathbb{R}^n\rightarrow \mathbb{R}^n\) differentiable at a fixed point \(x^\ast\), the Ostrowski theorem offers the sharp sufficient condition \[\rho(G′(x^\ast))<1\] for \(x^\ast\) to be an attraction point, where \(\rho\) denotes the spectral radius. However, no estimate for the size of an attraction ball is known.
We show in this note that such an estimate may be readily obtained in terms of \(\|G^\prime(x^\ast)\|<1\) (with \(\|\cdot \|\) an arbitrary given norm) and of the Hölder (in particular Lipschitz) continuity constant of \(G’\).
An elementary example shows that this estimate may be sharp. Our assumptions do not necessarily require \(G\) to be of contractive type on the whole estimated ball.
Authors
Emil Cătinaş
(Tiberiu Popoviciu Institute of Numerical Analysis, Romanian Academy)
Keywords
fixed point; attraction points; attraction ball; local convergence; convergence order.
Cite this paper as:
E. Cătinaş, Estimating the radius of an attraction ball, Appl. Math. Lett., 22 (2009) no. 5, pp. 712-714.
[1] J. M. Ortega and W. C. Rheinboldt, Iterative Solution of Nonlinear Equations in Several Variables, Academic Press, New York (1970).
[2] A. M. Ostrowski, Solution of Equations and Systems of Equations, 2nd ed., Academic Press, New York (1966).
[3] W. C. Rheinboldt, Methods for Solving Systems of Nonlinear Equations, 2nd ed., SIAM, Philadelphia (1998).
[4] E. Zeidler, Nonlinear Functional Analysis and Its Applications, I. Fixed Point Theorems, Springer-Verlag, New York (1986).
[5] E. Catinas, On the superlinear convergence of the successive approximations method, J. Optim. Theory Appl. 113 473-485 (2002).
[6] F. A. Potra, On Q-order and R-order of convergence, J. Optim. Theory Appl. 63 415-431 (1989).
[7] F. A. Potra, Q-superlinear convergence of the iterates in primal-dual interior-point methods, Math. Progr. 91 1 ser. A 99-115 (2001).
[8] T. J. Ypma, Local convergence of inexact Newton methods, SIAM J. Numer. Anal. 21 583-590 (1984).
[9] E. Catinas, Inexact perturbed Newton methods and applications to a class of Krylov solvers, J. Optim. Theory Appl. 108 543-570 (2001).
[10] E. Catinas, The inexact, inexact perturbed and quasi-Newton methods are equivalent models, Math. Comp. 74 249 291-301 (2005).
[11] J. E. Dennis, Jr., and R. B. Schnabel, Numerical Methods for Unconstrained Optimization and Nonlinear Equations, Prentice-Hall Series in Computational Mathematics, Englewood Cliffs (1983).
[12] W. C. Rheinboldt, An adaptive continuation process for solving systems of nonlinear equations, Polish Academy of Science, Banach Ctr. Publ. 3 129-142 (1977).
[13] P. Deuflhard and F. A. Potra, Asymptotic mesh independence of Newton-Galerkin methods via a refined Mysovskii theorem, SIAM J. Numer. Anal. 29 1395-1412 (1992).
[14] I. K. Argyros, On the convergence and application of Newton’s method under weak Holder continuity assumptions, Int. J. Comp. Math. 80 6 767-780 (2003).
Paper (preprint) in HTML form
Catinas Estimating the radius of an attraction ball 2009
Estimating the radius of the attraction balls*
Emil Cătinaş ^(†){ }^{\dagger}
Abstract
Given a nonlinear mapping GG differentiable at a fixed point x^(**)x^{*}, the Ostrowski theorem offers the sharp sufficient condition rho(G^(')(x^(**))) < 1\rho\left(G^{\prime}\left(x^{*}\right)\right)<1 for x^(**)x^{*} to be an attraction point, where rho\rho denotes the spectral radius. However, no estimation for the size of an attraction ball is known.
We show in this note that such an estimate may be readily obtained in terms of ||G^(')(x^(**))|| < 1\left\|G^{\prime}\left(x^{*}\right)\right\|<1 (with ||*||\|\cdot\| an arbitrary given norm) and of the Hölder (in particular Lipschitz) continuity constant of G^(')G^{\prime}. An elementary example shows that this estimate may be sharp.
Our assumptions do not necessarily require GG to be of contractive-type on the whole estimated ball.
Let G:D subeR^(n)rarr DG: D \subseteq \mathbb{R}^{n} \rightarrow D be a nonlinear mapping which has a fixed point x^(**)in int Dx^{*} \in \operatorname{int} D :
x^(**)=G(x^(**)).x^{*}=G\left(x^{*}\right) .
This point is an attraction point [1, Def. 10.1.1] if, given a norm on R^(n)\mathbb{R}^{n}, there exists an open ball B_(r):=B_(r)(x^(**))={x inR^(n):||x-x^(**)|| < r}sube DB_{r}:=B_{r}\left(x^{*}\right)=\left\{x \in \mathbb{R}^{n}:\left\|x-x^{*}\right\|<r\right\} \subseteq D, such that for any initial approximation x_(0)inB_(r)x_{0} \in B_{r}, the successive approximations
remain in DD and converge to x^(**)x^{*}. Note that a finite number of iterates are allowed to lie outside B_(r)B_{r}.
The following result is well known.
Theorem 1 (Ostrowski) (see, e.g., [2, Th.22.1], [1, Th.10.1.3] and [3, Th.3.5]). If GG is differentiable at the fixed point x^(**)x^{*} and the spectral radius satisfies
then x^(**)x^{*} is an attraction point.
Remark 1 According to [1, N.R.10.1-2], this result holds also when instead of R^(n)\mathbb{R}^{n} one considers an arbitrary Banach space XX, with the observation that in defining the spectral radius of a linear continuous operator from XX to XX one takes its resolvent and spectrum (see, e.g., [4, p. 795]).
Condition (2) is sharp:
Example 1 [1, E.10.1.2]. The function G:RrarrR,G(x)=x+x^(3)G: \mathbb{R} \rightarrow \mathbb{R}, G(x)=x+x^{3}, is differentiable on R\mathbb{R}, and sigma=1\sigma=1 at the fixed point x^(**)=0x^{*}=0, which is not an attraction point.
The spectral radius also offers some global information regarding the convergence rate of all the sequences of the successive approximations converging toward x^(**)x^{*}, while the spectral elements of G^(')(x^(**))G^{\prime}\left(x^{*}\right) characterize the convergence rate of each individual such sequence. Indeed, sigma\sigma yields in fact the worst convergence rate among all the sequences converging to x^(**)x^{*} (see [1, Th.10.1.4] and [3, Th.3.5]), while the zero eigenvalue and its corresponding eigenvectors characterize the high convergence orders (more precisely, the qq-superlinear convergence and with qq-orders 1+p,p in(0,1])1+p, p \in(0,1]) of a single such sequence [5]. ^(1){ }^{1}
Theorem 1 says nothing about the size rr of the attraction ball. It is interesting to note that such an estimate can be deduced by applying some existing results. Indeed, under some additional differentiability assumptions on GG and denoting F(x)=x-G(x)F(x)=x-G(x), the successive approximations may be regarded as inexact Newton iterates for solving the nonlinear system F(x)=0[5]F(x)=0[5] :
Ypma [8] has obtained an estimation for the radius of attraction of this method, and we can deduce the corresponding one for the successive approximations.
However, instead of conditions in terms of Hölder continuity constant of I-G^(')I-G^{\prime} required by this approach, we obtain below (sharp) estimates in a direct manner, in terms of the Hölder continuity constant of G^(')G^{\prime}. Before doing that, we notice that the successive approximations may also be regarded as quasi-Newton iterates (or, more general, as inexactly solved perturbed Newton iterates [9]) [10], but not as (exact) Newton iterates; in the first case there do not exist estimates for the radius of the convergence, while in the second there exist (see, e.g., [11], [12], [13], [14]).
Theorem 2 Suppose there exist r_(1) > 0,p in(0,1],K_(p) > 0r_{1}>0, p \in(0,1], K_{p}>0 and a norm ||*||\|\cdot\| in R^(n)\mathbb{R}^{n} such that GG is differentiable on B_(r_(1))B_{r_{1}}, with G^(')G^{\prime} Hölder continuous at exponent pp :
||G^(')(x)-G^(')(y)|| <= K_(p)||x-y||^(p),quad AA x,y inB_(r_(1)).\left\|G^{\prime}(x)-G^{\prime}(y)\right\| \leq K_{p}\|x-y\|^{p}, \quad \forall x, y \in B_{r_{1}} .
Proof. The continuity hypothesis on G^(')G^{\prime} implies (see, e.g., [1, 3.2.12]):
||G(x)-G(x^(**))-G^(')(x^(**))(x-x^(**))|| <= (K_(p))/(1+p)||x-x^(**)||^(1+p),quad AA x inB_(r_(1)).\left\|G(x)-G\left(x^{*}\right)-G^{\prime}\left(x^{*}\right)\left(x-x^{*}\right)\right\| \leq \frac{K_{p}}{1+p}\left\|x-x^{*}\right\|^{1+p}, \quad \forall x \in B_{r_{1}} .
The key condition is t < 1t<1, which yields ||x_(0)-x^(**)|| < r_(2)\left\|x_{0}-x^{*}\right\|<r_{2}. The rest of the proof follows by induction.
Remark 2 a) The relationship between condition (2) ( sigma < 1\sigma<1 ) and the existence of a norm such that ( 4)(||G^(')(x^(**))|| < 1))\left(\left\|G^{\prime}\left(x^{*}\right)\right\|<1\right) holds is the following. Condition (4) implies (2), since the spectral radius satisfies
{:(7)rho(G^(')(x^(**))) <= ||G^(')(x^(**))||","quad" for any norm "||*||" on "R^(n).:}\begin{equation*}
\rho\left(G^{\prime}\left(x^{*}\right)\right) \leq\left\|G^{\prime}\left(x^{*}\right)\right\|, \quad \text { for any norm }\|\cdot\| \text { on } \mathbb{R}^{n} . \tag{7}
\end{equation*}
Condition (2) does not imply (4) in any norm, but for any epsi > 0\varepsilon>0 there exists a norm ||*||_((epsi))\|\cdot\|_{(\varepsilon)} on R^(n)\mathbb{R}^{n} such that (see, e.g., [1, 2.2.8], [2, Th.19.3]):
In the case of a Banach space instead of R^(n)\mathbb{R}^{n}, the statements regarding relations (7) and (8) remain true, with the remark that the involved norms must be equivalent to the initial one (see, e.g., [4, p. 795]). Therefore, the relationship between (2) and (4) remains the same.
Corollary 1 Under the hypotheses of Theorem 2, if G^(')G^{\prime} is Lipschitz continuous, i.e., Hölder continuous with p=1,L:=K_(1)p=1, L:=K_{1}, then instead of (5) one may take
and, correspondingly, t=(L)/(2)||x_(0)-x^(**)||+qt=\frac{L}{2}\left\|x_{0}-x^{*}\right\|+q.
Corollary 2 The assumptions of Theorem 2 imply the necessary condition
{:(10)||G^(')(x)|| < 1+p(1-q)","quad AA x inB_(r)",":}\begin{equation*}
\left\|G^{\prime}(x)\right\|<1+p(1-q), \quad \forall x \in B_{r}, \tag{10}
\end{equation*}
regardless of the value of the Hölder or Lipschitz continuity constant.
If G^(')G^{\prime} is Lipschitz continuous, then (10) becomes
{:(11)||G^(')(x)|| < 2-q","quad AA x inB_(r).:}\begin{equation*}
\left\|G^{\prime}(x)\right\|<2-q, \quad \forall x \in B_{r} . \tag{11}
\end{equation*}
Proof. The statement is obtained by taking into account the triangle inequality
The assumptions we have considered do not necessarily require contractivetype nonlinear mappings on the whole estimated ball, as the following example shows.
Example 2 Let G:RrarrR,G(x)=x^(2)G: \mathbb{R} \rightarrow \mathbb{R}, G(x)=x^{2}, having x^(**)=0x^{*}=0 as attraction point; r_(1) > 0r_{1}>0 may be chosen arbitrarily large, the derivative G^(')G^{\prime} is Lipschitz continuous on R\mathbb{R}, with L=2L=2. Since G^(')(0)=0G^{\prime}(0)=0, one obtains from (9) that r=r_(2)=1r=r_{2}=1. Moreover,
||G^(')(x)||=2,quad" for "||x-x^(**)||=r=1.\left\|G^{\prime}(x)\right\|=2, \quad \text { for }\left\|x-x^{*}\right\|=r=1 .
The above example also shows that estimates (9) and (11) are sharp.
Remark 3 We notice that the predicted radius r_(2)r_{2} may vary inverse proportionally with r_(1)r_{1}, since the Hölder continuity constant may increase with r_(1)r_{1}. In the previous example, if we take r_(1)=(1)/(2)r_{1}=\frac{1}{2} we get L=1L=1 and then r_(2)=2r_{2}=2, which is too large. Analogously, we can obtain too small values for r_(2)r_{2} if we take for instance G(x)=x^(3)G(x)=x^{3} and large values for r_(1)r_{1}.
References
[1] J. M. Ortega and W. C. Rheinboldt, Iterative Solution of Nonlinear Equations in Several Variables, Academic Press, New York (1970).
[2] A. M. Ostrowski, Solution of Equations and Systems of Equations, 2nd ed., Academic Press, New York (1966).
[3] W. C. Rheinboldt, Methods for Solving Systems of Nonlinear Equations, 2nd ed., SIAM, Philadelphia (1998).
[4] E. Zeidler, Nonlinear Functional Analysis and Its Applications, I. FixedPoint Theorems, Springer-Verlag, New York (1986).
[5] E. Cătinas, On the superlinear convergence of the successive approximations method, J. Optim. Theory Appl. 113 473-485 (2002).
[6] F. A. Potra, On QQ-order and RR-order of convergence, J. Optim. Theory Appl. 63 415-431 (1989).
[7] F. A. Potra, QQ-superlinear convergence of the iterates in primal-dual interior-point methods, Math. Progr. 911 ser. A 99-115 (2001).
[8] T. J. Ypma, Local convergence of inexact Newton methods, SIAM J. Numer. Anal. 21 583-590 (1984).
[9] E. Cătinaş, Inexact perturbed Newton methods and applications to a class of Krylov solvers, J. Optim. Theory Appl. 108 543-570 (2001).
[10] E. Cătinaş, The inexact, inexact perturbed and quasi-Newton methods are equivalent models, Math. Comp. 74249 291-301 (2005).
[11] J. E. Dennis, Jr., and R. B. Schnabel, Numerical Methods for Unconstrained Optimization and Nonlinear Equations, Prentice-Hall Series in Computational Mathematics, Englewood Cliffs (1983).
[12] W. C. Rheinboldt, An adaptive continuation process for solving systems of nonlinear equations, Polish Academy of Science, Banach Ctr. Publ. 3 129-142 (1977).
[13] P. Deuflhard and F. A. Potra, Asymptotic mesh independence of NewtonGalerkin methods via a refined Mysovskii theorem, SIAM J. Numer. Anal. 29 1395-1412 (1992).
[14] I. K. Argyros, On the convergence and application of Newton's method under weak Hölder continuity assumptions, Int. J. Comp. Math. 806 767780 (2003).
*This work has been supported by MEdc under grant 2CEEX-06-11-96/19.09.2006. ^(†){ }^{\dagger} Romanian Academy, "T. Popoviciu" Institute of Numerical Analysis, P.O. Box 68-1, Cluj-Napoca, Romania, e-mail: ecatinas@ictp.acad.ro, url: http://www.ictp.acad.ro/catinas.
^(1){ }^{1} For the rigorous definitions of the convergence rates and for different relating results we refer the reader to [1, ch.9] and [6] (see also [3, ch.3] and [7]).