The quasi-Newton method and the inexact Newton method are classical models of Newton iterates which take into account certain error terms. We have recently introduced the inexact perturbed Newton method, which takes into account all the possible sources of perturbations. We show that these models are in fact equivalent, in the sense that each one may be used to characterize the high convergence orders of the other two.
Authors
Emil Catinas, Tiberiu Popoviciu Institute of Numerical Analysis, Romanian Academy
E. Cătinaş, The relationship between the models of perturbed Newton iterations, with applications, PAMM, 5 (2005) no. 1, pp. 785-786, https://doi.org/10.1002/pamm.200510367
[1] E. Catinas, Inexact perturbed Newton methods and applications to a class of Krylov solvers, J. Optim. Theory Appl. 108 (2001),543–570.
[2] E. Catinas, The inexact, inexact perturbed and quasi-Newton methods are equivalent models, Math. Comp., 74 (2005) no. 249, pp.291–301.
[3] R. S. Dembo, S. C. Eisenstat and T. Steihaug, Inexact Newton methods, SIAM J. Numer. Anal. 19 (1982), 400–408.
[4] J. E. Dennis, Jr. and J. J. Mor´e, A characterization of superlinear convergence and its application to quasi-Newton methods, Math.Comp. 28 (1974), 549–560.
[5] E. M. Kasenally, GMBACK: a generalised minimum backward error algorithm for nonsymmetric linear systems, SIAM J. Sci. Comput.16 (1995), 698–719.
[6] E. M. Kasenally and V. Simoncini, Analysis of a minimum perturbation algorithm for nonsymmetric linear systems, SIAM J. Numer.Anal. 34 (1997), 48–66.
[7] C. T. Kelley, Iterative Methods for Linear and Nonlinear Equations, SIAM, Philadelphia, 1995.
[8] J. M. Ortega and W. C. Rheinboldt, Iterative Solution of Nonlinear Equations in Several Variables, Academic Press, New York, 1970.
[9] F. A. Potra, Q-superlinear convergence of the iterates in primal-dual interior-point methods, Math. Progr., 91 no. 1, Ser. A, pp. 99–115,2001.[10] W. C. Rheinboldt, Methods for Solving Systems of Nonlinear Equations, Second ed., SIAM, Philadelphia, PA, 1998.
The relationship between the models of perturbed Newton iterations, with applications
Emil Cătinaş*"T. Popoviciu" Institute of Numerical Analysis, P.O. Box 68-1, Cluj-Napoca, Romania.
Abstract
The quasi-Newton method and the inexact Newton method are classical models of Newton iterates which take into account certain error terms. We have recently introduced the inexact perturbed Newton method, which takes into account all the possible sources of perturbations. We show that these models are in fact equivalent, in the sense that each one may be used to characterize the high convergence orders of the other two.
1 The equivalence of the inexact and quasi-Newton methods
Let F(x)=0F(x)=0 be a nonlinear system, with F:D subeR^(n)rarrR^(n)F: D \subseteq \mathbb{R}^{n} \rightarrow \mathbb{R}^{n}. The local convergence of the Newton iterates
is studied under the following usual conditions (which will be implicitly assumed hereafter):
there exists x^(**)in int Dx^{*} \in \operatorname{int} D such that F(x^(**))=0F\left(x^{*}\right)=0;
the mapping FF is Fréchet differentiable on int D\operatorname{int} D, with F^(')F^{\prime} continuous at x^(**)x^{*};
the Jacobian F^(')(x^(**))F^{\prime}\left(x^{*}\right) is invertible.
Given a norm ||*||\|\cdot\| on R^(n)\mathbb{R}^{n}, these hypotheses assure the existence of a radius r > 0r>0 such that the Newton iterates converge qq-superlinearly to x^(**)x^{*} for any initial approximation x_(0)x_{0} with ||x_(0)-x^(**)|| < r\left\|x_{0}-x^{*}\right\|<r [8, Th.10.2.2] (see also [10, Th.4.4]):
l i m s u p_(k rarr oo)(||x_(k+1)-x^(**)||)/(||x_(k)-x^(**)||)=0,quad(" assuming "x_(k)!=x^(**)" for all "k >= k_(0)),\limsup _{k \rightarrow \infty} \frac{\left\|x_{k+1}-x^{*}\right\|}{\left\|x_{k}-x^{*}\right\|}=0, \quad\left(\text { assuming } x_{k} \neq x^{*} \text { for all } k \geq k_{0}\right),
also denoted by ||x_(k+1)-x^(**)||=o(||x_(k)-x^(**)||)\left\|x_{k+1}-x^{*}\right\|=o\left(\left\|x_{k}-x^{*}\right\|\right), as k rarr ook \rightarrow \infty.
In the practical applications, there are used mainly two models of perturbed Newton iterates: the quasi-Newton (QN) and the inexact Newton (IN) method
The first model assumes the use of perturbed Jacobians, while the second assumes that at each step the correction s_(k)s_{k} is only approximately determined (i.e., having nonzero residual r_(k)r_{k} ). The superlinear convergence (as well as the qq-orders p in(1,2]p \in(1,2] ) of these methods was characterized by Dennis and Moré, respectively Dembo, Eisenstat and Steihaug.
Theorem 1.1 ([4]) Consider a sequence (B_(k))_(k >= 0)subR^(n xx n)\left(B_{k}\right)_{k \geq 0} \subset \mathbb{R}^{n \times n} of invertible matrices and an initial approximation x_(0)in Dx_{0} \in D. If the QN iterates converge to x^(**)x^{*}, then they converge superlinearly iff
{:(DM)(||(B_(k)-F^(')(x^(**)))(x_(k+1)-x_(k))||)/(||x_(k+1)-x_(k)||)rarr0","quad" as "k rarr oo.:}\begin{equation*}
\frac{\left\|\left(B_{k}-F^{\prime}\left(x^{*}\right)\right)\left(x_{k+1}-x_{k}\right)\right\|}{\left\|x_{k+1}-x_{k}\right\|} \rightarrow 0, \quad \text { as } k \rightarrow \infty . \tag{DM}
\end{equation*}
Theorem 1.2 ([3]) If the IN iterates converge to x^(**)x^{*}, then the convergence is superlinear iff
{:(DES)||r_(k)||=o(||F(x_(k))||)","quad" as "k rarr oo.:}\begin{equation*}
\left\|r_{k}\right\|=o\left(\left\|F\left(x_{k}\right)\right\|\right), \quad \text { as } k \rightarrow \infty . \tag{DES}
\end{equation*}
Denoting Delta_(k)=B_(k)-F^(')(x_(k))\Delta_{k}=B_{k}-F^{\prime}\left(x_{k}\right), the quasi-Newton iterates are transcribed as (QN'):
The following two results show the connection between the conditions for the superlinear convergence of the two methods.
Theorem 1.3 ([2]) If the QN ' iterates are written as IN iterates,
Previous results contained only sufficient conditions. The elements Delta_(k)inR^(n xx n)\Delta_{k} \in \mathbb{R}^{n \times n} represent perturbations to the Jacobians, delta_(k)inR^(n)\delta_{k} \in \mathbb{R}^{n} perturbations to the function evaluations, while hat(r)_(k)inR^(n)\hat{r}_{k} \in \mathbb{R}^{n} are the residuals of the approximate solutions s_(k)s_{k} of the resulted perturbed linear systems. The perturbation terms may contain errors resulted from floating point representation/arithmetical operations, so this model may be conceived to contain all sources of perturbations. In [1] we have assumed that the perturbed Jacobians are invertible. This condition may be relaxed, and one can assume just that the perturbed linear systems are compatible (i.e., the iterations are well defined):
Theorem 2.1 ([2]) If the IPN iterates are well defined and converge to x^(**)x^{*}, then the convergence is superlinear iff
||-Delta_(k)s_(k)+delta_(k)+ hat(r)_(k)||=o(||F(x_(k))||),quad" as "k rarr oo\left\|-\Delta_{k} s_{k}+\delta_{k}+\hat{r}_{k}\right\|=o\left(\left\|F\left(x_{k}\right)\right\|\right), \quad \text { as } k \rightarrow \infty
As shown in [2], the (DM) and (DES) conditions may be retrieved from the above one; on the other hand, the IPN iterates may be regarded as IN iterates, so it follows that the conditions characterizing the high convergence orders of the QN, IN and IPN methods are equivalent.
The above result can be successfully applied to characterize the convergence of some Newton-Krylov methods. Let
Au=b,quad A inR^(n xx n)" nonsingular, "b inR^(n),A u=b, \quad A \in \mathbb{R}^{n \times n} \text { nonsingular, } b \in \mathbb{R}^{n},
and consider an initial approximation u_(0)u_{0} to the exact solution u^(**)=A^(-1)bu^{*}=A^{-1} b. Denote K_(m):=span{r_(0),Ar_(0),dots,A^(m-1)r_(0)}\mathcal{K}_{m}:=\operatorname{span}\left\{r_{0}, A r_{0}, \ldots, A^{m-1} r_{0}\right\}, 1 <= m <= n1 \leq m \leq n, where r_(0):=b-Au_(0)r_{0}:=b-A u_{0}.
The GMBACK algorithm [5] finds u_(m)^(GB)inu_(0)+K_(m)u_{m}^{G B} \in u_{0}+\mathcal{K}_{m} as argmin for (i.e., minimizes the backward error in AA )
where ||*||_(F)\|\cdot\|_{F} denotes the Frobenius norm. Depending on the parameters, the above problem may have a unique, several ones, or no solution at all. Therefore, Theorem 1.1 is not the best choice (whereas Theorem 2.1 is) for characterizing the superlinear convergence of the Newton-GMBACK iterates, when written in the QN form
Acknowledgements I would like to express my gratitude to the organizers, who made possible my participation to the conference.
References
[1] E. Cătinaş, Inexact perturbed Newton methods and applications to a class of Krylov solvers, J. Optim. Theory Appl. 108 (2001), 543-570.
[2] E. Cătinaş, The inexact, inexact perturbed and quasi-Newton methods are equivalent models, Math. Comp., 74 (2005) no. 249, pp. 291-301.
[3] R. S. Dembo, S. C. Eisenstat and T. Steihaug, Inexact Newton methods, SIAM J. Numer. Anal. 19 (1982), 400-408.
[4] J. E. Dennis, Jr. and J. J. Moré, A characterization of superlinear convergence and its application to quasi-Newton methods, Math. Comp. 28 (1974), 549-560.
[5] E. M. Kasenally, GMBACK: a generalised minimum backward error algorithm for nonsymmetric linear systems, SIAM J. Sci. Comput. 16 (1995), 698-719.
[6] E. M. Kasenally and V. Simoncini, Analysis of a minimum perturbation algorithm for nonsymmetric linear systems, SIAM J. Numer. Anal. 34 (1997), 48-66.
[7] C. T. Kelley, Iterative Methods for Linear and Nonlinear Equations, SIAM, Philadelphia, 1995.
[8] J. M. Ortega and W. C. Rheinboldt, Iterative Solution of Nonlinear Equations in Several Variables, Academic Press, New York, 1970.
[9] F. A. Potra, QQ-superlinear convergence of the iterates in primal-dual interior-point methods, Math. Progr., 91 no. 1, Ser. A, pp. 99-115, 2001.
[10] W. C. Rheinboldt, Methods for Solving Systems of Nonlinear Equations, Second ed., SIAM, Philadelphia, PA, 1998.