Abstract
Let \(x_k\rightarrow x^\ast\in{\mathbb R}^N\); denote the errors by \( e_k: = \|x^\ast – x_k\| \) and the quotient factors by \(Q_p(k)=\frac{e_{k+1}}{{e_k}^p}\) (\(p \geq 1\)).
Consider the following definitions of Q-orders \(p_0>1\) of \(\{x_k\}\):
\begin{equation}\tag{$Q_\varepsilon$}\label{def Qe}
\begin{cases}
Q_p=0, & p \in [1,p_0),
\\
Q_p=+\infty, & p \in (p_0,+\infty)
\end{cases}
\Leftrightarrow
\begin{cases}
Q_{p_0-\varepsilon}=0, & \forall \varepsilon >0, p_0-\varepsilon>1,
\\
Q_{p_0+\varepsilon}=+\infty, & \forall \varepsilon >0
\end{cases}
\end{equation}
\begin{equation}\tag{$Q$}\label{def Q}
q_l=q_u, {\rm where}
\begin{cases}
q_l=\inf \big\{p\in[1,+\infty) : \bar{Q}_p=+\infty\big\}, \quad \bar{Q}_p=\limsup\limits_{k\rightarrow\infty}Q(k)
\\
q_u=\sup \big\{p\in[1,+\infty) : \underset{\bar{}}{Q}{}_p=0\big\}, \quad \underset{\bar{}}{Q}{}_p=\liminf\limits_{k\rightarrow\infty}Q(k)
\end{cases}
\end{equation}
\begin{equation}\tag{$Q_L$}\label{def Q_L}
\lim\limits_{k\rightarrow \infty}
\frac{\ln e_{k+1}}{\ln e_k}=p_0
\end{equation}
\(\forall \varepsilon>0, \exists A,B\geq 0, \ k_0 \geq 0\) s.t.
\begin{equation}\tag{$Q_{I,\varepsilon}$}\label{f. def Q_eps}
A\cdot e_k ^{p_{0}+\varepsilon}\leq e_{k+1} \leq B\cdot e_k^{p_{0}-\varepsilon},\ \; \; \forall k\geq k_0
\end{equation}
\begin{equation}\tag{$Q_\Lambda$}\label{def Q_Lam}
\lim\limits_{k\rightarrow \infty}
\frac{\ln \frac{e_{k+2}}{e_{k+1}}}{\ln \frac{e_{k+1}}{e_k}}=p_0.
\end{equation}
Twenty years after the classical book of Ortega and Rheinboldt was published, the above definitions for the Q-orders were independently and rigorously studied (i.e., some orders characterized in terms of others), by Potra (1989), resp. Beyer, Ebanks and Qualls (1990). The relationship between all the five definitions (only partially analyzed in each of the two papers) was not subsequently followed and, moreover, the second paper slept from the readers attention.
The main aim of this paper is to provide a rigorous, selfcontained, and, as much as possible, a comprehensive picture of the theoretical aspects of this topic, as the current literature has taken away the credit from authors who obtained important results long ago.
Moreover, this paper provides rigorous support for the numerical examples recently presented in an increasing number of papers, where the authors check the convergence orders of different iterative methods for solving nonlinear (systems of) equations.
Authors
Emil Cătinaş
(Tiberiu Popoviciu Institute of Numerical Analysis, Romanian Academy)
Keywords
Cite this paper as
E. Cătinaş, A survey on the high convergence orders and computational convergence orders of sequences, Appl. Math. Comput., 343 (2019) 1-20.
doi: 10.1016/j.amc.2018.08.006
About this paper
Print ISSN
0096-3003
Online ISSN
Google Scholar citations
Errata
The sequence \(z_k\) from Example 2.8 actually does not have Q-order \(2\):
\[
z_k = \begin{cases} 2^{-2^k/k}, & k \text{ odd}\\ 2^{-k2^k}, & k \text{ even.} \end{cases}
\]
Post version
November 11, 2018: post created.
A survey on the high convergence orders and computational convergence orders of sequences
Abstract
Twenty years after the classical book of Ortega and Rheinboldt was published, five definitions for the -convergence orders of sequences were independently and rigorously studied (i.e., some orders characterized in terms of others), by Potra (1989), resp. Beyer, Ebanks and Qualls (1990). The relationship between all the five definitions (only partially analyzed in each of the two papers) was not subsequently followed and, moreover, the second paper slept from the readers attention.
The main aim of this paper is to provide a rigorous, selfcontained, and, as much as possible, a comprehensive picture of the theoretical aspects of this topic, as the current literature has taken away the credit from authors who obtained important results long ago.
Moreover, this paper provides rigorous support for the numerical examples recently presented in an increasing number of papers, where the authors check the convergence orders of different iterative methods for solving nonlinear (systems of) equations. Tight connections between some asymptotic quantities defined by theoretical and computational elements are shown to hold.
Keywords. convergent sequences in ; Q-, C-, and R-convergence orders of sequences; computational convergence orders.
MSC 2010. 65J05.
author address: Tiberiu Popoviciu Institute of Numerical Analysis, Romanian Academy, P.O. Box 68-1, Cluj-Napoca, Romania. e-mail: ecatinas@ictp.acad.ro, www.ictp.acad.ro/catinas
1 Introduction
Consider in the beginning a sequence from , which converges to a finite limit . Its speed of convergence is characterized by several measures, called convergence orders, which are fundamental notions in Mathematical and in Numerical Analysis.
The relation
(1) |
gives the classical definition of the convergence with order , and, in a less rigorous but sufficiently intuitive statement, it can be traced back in 1818, to Fourier 26. Even deeper traces may be found in history: in a letter of Newton dated in 1675, it appears he was aware of the rough doubling of the number of correct significant digits in one step (characteristic to the quadratic convergence) of the iterative method which bears his name 57.
The existence of the nonzero value is a very restrictive condition not only in theory, but also in examples from practice – not to mention that in the multidimensional setting it is a norm-dependent problem. Moreover, as it requires the knowledge of both and , relation (1) is only of theoretical use.
Still, such definition (along with the -superlinear convergence, which requires for ) is extremely important in studying the local convergence of the main iterative methods for nonlinear problems, either of Newton type (see, e.g., 4, 32, 28, 62, 53, 19, 41, 55, 11, 14, 49), or the more general successive approximations (see, e.g., 4, (32, ch. 10), 62, 13, 46) – which, under some supplementary assumptions (including smoothness) may be regarded in fact as equally general (see 13). This definition is also important in the field of the matrix function theory (polar decomposition, matrix sign function, computing the matrix inverse by Schulz or different iterative methods), in a setting where the matrix is an element of a normed space (see, e.g., 43, 15); we should also mention the acceleration of the convergence of sequences (see, e.g., 6), and perhaps other fields too.
From a numerical standpoint, the aim is to define the so-called computational convergence orders, that iteratively approximate the convergence orders using only the elements of the sequence known at the step . To this end (or just for theoretical purposes) new definitions have been introduced and studied. We attempt to follow the origin of each notion in the following section; this is a difficult task, as most of the papers treat the convergence orders as a collateral topic; no survey has been written since two monographs containing entire chapters devoted to the rigorous analysis of the - and -convergence orders. The book of Ortega and Rheinboldt 32, though first published in 1970, is a standard reference in the field of iterative methods and their convergence orders (see also 62); it was republished in 2000 in the Classics in Applied Mathematics series of SIAM. The book of Potra and Pták (1984) 19 contains further, complementary results.
Two important papers were subsequently published, independently and close in time, where the equivalence of different definitions of (computational) - and -convergence orders were given. The complementary results of these two works allow us to form a complete, simplified and rigorous picture of the topic. The first paper was written by Potra (1989) 16; as of November 2017, Scholar Google reported 96 citations of it, which shows that the mathematicians are aware of it. The other paper however, written by Beyer, Ebanks and Qualls (1990) 60, is almost unknown to the mathematical community: as of November 2017, we have found only 6 citations in Google Scholar (out of which two papers in Chinese, two papers in the field of Bioarchaeology, one paper solving an unsolved problem from 60, and one paper dealing with the quadratic convergence in period doubling for trapezoid maps).
Unfortunately, the fact that the paper of Beyer, Ebanks and Qualls remained unknown, allowed some authors to (honestly) reinvent such notions, e.g., in 54 and 1. The very cited paper 54111As of September 2017, a search on the Internet using Google Scholar for (the exact string, quotation marks included) ”computational order of convergence” returned 673 papers, out of which 101 were published in 2016. has the distinctive feature that not only the definition of the computational convergence order proposed there is not original (nor proved, nor in fact entirely computational - as it requires ), but even the iterative method itself (as it was given in the well known book of J.F. Traub (30, formula (8-14), p. 164)222We owe this information to a previous referee, to whom we thank for improving the presentation of the introduction.).
In numerous works, the classical - and -orders defined by Ortega and Rheiboldt lead to statements expressed as ” has -order (at least) and -order (at least) , ”. The -order we study here is a completion to the classical one, and no longer allows such conclusions. Instead, if having at all an order , the sequences will be only in one of the cases (the -order is obtained by taking norms in (1)):
-
a)
has -order (but neither - nor -order);
-
b)
has the same - and -order (but no -order);
-
c)
has the same -, - and -order .
In less words, the relation reads for as
with no converse implication holding in general.
Nonetheless, the comments of Tapia, Dennis Jr. and Schäfermeyer (49, p. 49) remain true: “The distinction between - and -convergence is quite meaningful and useful and is essentially sacred to workers in the area of computational optimization. However, for reasons not well understood, computational scientists who are not computational optimizers seem to be at best only tangentially aware of the distinction.”
The structure of this paper is the following: in Section 2 we review, in chronological order, the definitions of convergence orders. In Section 3 we analyze the high convergence orders (), giving full proofs for - and -orders. We present first the relation (implication, resp. equivalence) between the introduced convergence orders; the results are based on the proofs from 16 and 60, which we can simplify for some cases. Then we review the main properties of the -orders. We analyze next the computational - and -convergence orders based on the corrections (the approach was dealt with in 60, but the authors assumed real and monotone sequences in treating certain cases; a result of Potra and Pták 19 and the Dennis-Moré lemma 28 allow us to complete the proofs). We also consider computational convergence orders based on the nonlinear residuals , when solving nonlinear systems of equations having the same number of equations and unknowns. The section ends with a summary of all the relations obtained, illustrated by a diagram. Finally, in Section 4 we recall, without proofs, the relationships between some convergence orders in the linear case. Some conclusions are presented in the end of the paper.
2 Definitions - a brief historical review
2.1 C- and Q-type (computational) convergence orders.
2.1.1 Convergence orders based on the errors x*-xk
We turn now our attention to a sequence from , which converges to a finite limit ; we prefer the common setting of a normed space— endowed with a given norm —though most of the results from this paper can be presented in a more general setting, of a metric space. We assume that . Since this leads to dealing with positive numbers, we can simplify the notations and consider further only the errors (called such as a short for ”norms of errors”)
As we have mentioned, the oldest definition of convergence with order is that
(2) |
which we call, as in 60, -convergence with order (the linear convergence requires and ) – comes from quotient. If it exists, it is uniquely defined and it implies that such that
(3) |
or, less sharp, the existence of such that
(4) |
Remark 2.1
a) For example, if and is not too large, relation (3) says that the error is approximately squared at each step , for sufficiently large.
c) An inequality of the form (4) still holds if the limit in (2) does not exist, but instead and are finite and nonzero. Denoting
(5) |
relation
(6) |
implies (4): attracts the first inequality in (4), while the second one (see also (32, 9.3.3)).
d) As widely known, the larger the order , the faster the speed of convergence.
The -order is a norm-dependent notion, as the following two examples show for the -orders and .
Example 2.2
35 Let , the sequence
(7) |
Consider the norms
It follows for the -norm, while does not exist for the -norm.
Example 2.3
Let and
(8) |
One obtains when considering the maximum norm , while does not exist for the norm .
The following definition for the convergence order seems to be the second oldest one (we adopt the notation from 60). The sequence has -convergence order if:
(9) |
Remark 2.4
a) As noticed by Brezinski in 8 and 9, is a particular case of a definition used by Bourbaki 42 in order to compare the convergence orders of two sequences.
The measure was then independently considered by Wall in 1956 10 (see also 34, (32, ch. 9), (52, ch. 2) and 20), argued – for instead of – as the limit of the quotient of two consecutive numbers of correct decimal places in resp. (see also (19, p. 90)); in this sense, convergence for example with -order means intuitively that, from a certain step, the number of correct digits are doubled the successive elements of . The logarithm base may be taken as any positive real number ; we shall consider further the natural base for simplicity.
b) In contrast to , the expression does not require the (usually unknown) value of the order, but provides an approximation of it. This may be advantageous in certain situations when evaluating the order, as it turned out, e.g., in the books of Wright (56, p. 252 and Ch. 7), Ye (63, p. 210, p. 226) and in two papers of Potra 18 and 17 (where the authors deal with , a quantity which we analyze in Proposition 3.3).
c) Brent, Winograd and Wolfe noticed in 51 that -order immediately implies -order, of same value.
In 1967, Feldstein and Firestone 3 introduced essentially the following convergence order, which we denote here by .
The sequence has -convergence order if:
(10) | ||||
Condition will be a standing assumption from here on, in all subsequent formulas, while in Section 3 we will implicitely assume that .
The convergence with -convergence order can be expressed in a more intuitive form (see also 60):
Two years later, the same authors 2 expressed the above order in an equivalent form, which in fact will be the same as the -order defined below. Let
(11) | ||||
They noticed that , and when , the sequence was said to have the order .
In their classical book from 1970, Ortega and Rheinboldt (32, Ch. 9) introduced and studied the quantity denoted in (5) by for all :
They shown the following behavior for as a ”function” of .
The quantity
(12) |
(called in 32 as -convergence with order at least and denoted by O) stood there and in many subsequent works for a measure of the convergence order. We call it, as in 60, the lower -order, and denote it by (instead of used there). We shall see that the lower -order needs to be completed by the upper -order for obtaining the ”full” -order; as a matter of fact, , the lower order from (11) defined in 2.
Remark 2.6
The sequence was said in (32, Def. 9.1.4) that converges faster than if for some we have (it is interesting to note that if these two upper bounds are finite and nonzero, the inequality may revert when changing the norm (32, p. 285)). However, in the case of the even stronger relation
numerical evidence reveals that does not necessarily converge faster than , as shown by Ypma and Igarashi 40.
On the other hand, as noticed by Brezinski 9, the sequences given by and , , have the same asymptotical constant, , though they converge with quite different speeds.
Some further comments on this topic will be made in Remark 2.11.
In 1979, Schwetlick (20, B.4.2.3) defined the -convergence order in the following way, which will be used throughout the paper. Let the lower -order be given in (12), recall the notation from (5)
and define the upper -order by
Then the -convergence is with order if This definition is equivalent to the one given by Feldstein and Firestone in (11), as and .
In 1981, Schmidt 33 introduced the following type of convergence order, denoted here, as in 60, by : has -convergence order if
(13) |
The author gave no result relating the - and -orders he introduced, and assumed that is equivalent to an -order (while it is in fact a -order, as we shall see).
In 1982, Beyer and Stein 61 independently introduced the measure (19) below, which is somehow similar to , and in fact equivalent to defined in subsubsection 2.1.2.
In 1985, unaware of 33 and 61, Brezinski 9 considered the -convergence order and, assuming that has the exact -order of convergence (as defined by (4)), he proved that has the -order (and -order as well).
In 1989, Potra introduced in 16 the following type of convergence order (denoted here by , with ”” from ”inequality”), which will be useful in obtaining some simplified proofs.
The sequence has -convergence order if such that
(14) |
This can be regarded as a generalization of inequalities (4).
Potra shown some fundamental results in our study, that -convergence with order is equivalent to - and to -convergence of same order, which we denote here, as in 60, by symbols between curled braces:
Ortega and Rheinboldt (32, N.R. 9.2.2) have previously noticed a weaker property of -convergence with order , namely that it implies (partial) -convergence with order (i.e., in the notations below, that ).
In the same paper, Potra has also considered the lower and upper -orders, when noting that if a sequence has exact -order then .
In 1990, Beyer, Ebanks and Qualls 60, unaware of the definitions and the results mentioned above, introduced the definitions of the - and -orders and shown some other fundamental results, that convergence with -order is equivalent to and to -convergence with same order:
They considered inspired by the similar measure (19), as we have already mentioned.
We end the historical remarks by noting that the references 45, 5 and 36 are cited in certain works as containing aspects referring to convergence orders, but we were not able to consult them.
Lemma 3.1 from Section 3 shows that (which justifies the terminology lower/upper); when this inequality is strict, the sequence does not have a -order, as shown below.
Example 2.7
Let
We get
The graphical illustration of and for leads to the so called convergence profile of the sequence, as termed in 60 (see Fig. 1).
This sequence does not have a -order, as , but it converges with (exact) -order , as defined later. We note the classical statement in this case, ” converges with -order (at least) and with -order (at least) ”, which no longer holds in the setting of this paper.
An elementary example shows that in case of convergence with -order apart of finite nonzero values of one (or both) of the asymptotical constants , any of the following relations may hold:
(15) | ||||
(16) | ||||
(17) |
Example 2.8
Let
It is easy to verify that for the sequences verify respectively (15), (16) and (17) ( was also considered in 60).
Schwetlick (20, Ü.4.2.3, p. 93) also considered an example of the type , , for the errors converging with exact -order .
One may easily find examples when and is finite, nonzero, or, on the other hand, when is finite, nonzero and . Jay 35 considered the sequence , odd, , even, for , , for which one obtains and .
Further examples, to be analyzed either elementary or with the aid of the results from this paper, were given by Potra and Pták (19, p. 93): , , , and also , , .
Remark 2.9
Despite the fact that conditions (15)–(17) may seem rather abstract (or, at least, suitable for scholar examples), they do occur in relevant situations from theory or practice. For a nontrivial illustration, we refer the reader to a computational optimization problem (56, p. 252 and Ch. 7), where one has and .
Remark 2.10
Remark 2.11
Returning to the comments from Remark 2.6, we note that the comparison of the convergence of two sequences , having errors resp. is important in particular in the field of the acceleration of the convergence of sequences, when sequences with (at most) linear convergence are considered; here it is said that converges faster than if (see, e.g., 6):
(18) |
Now we see that this definition makes sense not only if the -orders of the two sequences are different, but even if they are equal, as the above condition is norm-independent.
To show this, we consider the terminology proposed by Jay 35 for a sequence converging with -order , who defined that the convergence is with:
-
•
-suborder if ,
-
•
-superorder if .
The last terminology is in fact widely used - recall the superquadratic convergence (for ), supercubic, etc.).
Now we see that for sequences with -order we can briefly say that:
- •
-
•
exact -convergence is faster than -subconvergence;
-
•
(consequently) -superconvergence is faster than -subconvergence.
This can be illustrated by the following sequences having -order : (-superquadratic), (exact -order ), (-subquadratic).
2.1.2 Computational convergence orders, based on the corrections xk+1-xk and the nonlinear residuals F(xk)
We consider now some quantities which do not require the limit , for which we adopt the terminology computational convergence orders. As some of them require the knowledge of the order itself, we keep in mind that perhaps a more proper terminology for those would be semi-computational.
As a notation, for the corresponding convergence orders based on corrections we shall add a prime mark (e.g., ) as in 60, while for those based on nonlinear residuals, two prime marks (e.g., ).
Corrections.
We introduce another notation, for the corrections (including their norms too):
In 1974 Dennis and Moré 28 obtained a result which shows that if a sequence converges -superlinearly, then the errors and corrections converge precisely at the same rate (Lemma 3.14 from subsubsection 3.2.1).
In 1984, Potra and Pták 19 shown that a sequence converges -superlinearly iff the corrections converge -superlinearly (Proposition 3.13); in 1997 Walker 21 gave a different proof, also presented later.
In 1982, Beyer and Stein 61 considered, for sequences in , the quantity
(19) |
equivalent to defined below.
Three years later, Brezinski 9 proposed the use of the corrections instead of the (unknown) errors in the definitions of the convergence orders studied there ( and exact -order).
In 1990, Beyer, Ebanks and Qualls 60, taking the same approach, considered
with the resulted definitions being denoted correspondingly by and ; we use here the same convention for denoting . These authors proved the following fundamental results: for , the errors and corrections converge simultaneously to zero with the same convergence order, and therefore each convergence order is equivalent to its corresponding computational one. The following extended relation resulted:
However, the considered assumptions were very strong when proving and ””: was assumed as a monotone sequence from (see (60, Sect. 4)). We shall see that for the general setting, some results from 19 and 28 may be used instead to obtain simplified proofs.
In 2010, Grau-Sánchez, Noguera and Gutiérrez 38, while studying connections between - and -orders (in the case of a sequence from with -order ) considered the ”extrapolated convergence order”
(20) |
where . This convergence order was obtained from by replacing the unknown limit with an extrapolation given by the Aitken method (in order to use only the terms known at the step ). However, this convergence order is valid only in the one dimensional case, since the Aitken acceleration process does not have a direct extension to .
We do not pursue this definition as, for high convergence orders, it is equivalent to .
In 2012, Grau-Sánchez, Noguera, Grau and Herrero 37 considered another extrapolated convergence order
(21) |
for a sequence from , having -order . Assuming -order, they shown some connections to the convergence orders (22) and (23) below.
In 38 and 37 the authors used the technique of asymptotic expansions in order to relate different convergence orders of , and for sequences from . However, this approach could not be extended to the case of several dimensions.
Remark 2.12
It is important to keep in mind that the -, -, -orders are semi-computational orders, as they require the (supposed) unknown order, while the logarithm-based - and -orders are (fully) computational orders, i.e., they require neither the limit nor the order in their expressions.
As certain numerical experiments (see 39 and the references therein) have not revealed a clear superiority of over , the later seems to be the most convenient to use in practice (and in theory too: see (56, p. 252 and Ch. 7), (63, p. 210, p. 226), 18 and 17). Moreover, Propositin 3.3 and Theorem 3.15 will show, in terms we will define later, that and .
Nonlinear residuals.
Other computational convergence orders were considered in the context of solving nonlinear systems of equations , , with solution . For simplicity, we shall denote (resp. when ).
In 1981, Păvăloiu 25 defined the exact convergence order corresponding to (4) by
while in 1995, he introduced in 22 the expression
(22) |
and shown its equivalence to . Also, while extending his results from 1999 24, he introduced in an unpublished manuscript 23 (posted on his website) the measure
(23) |
in connection with the above ones. However, since the manuscript was not published as of 2011, the credit of first publishing this measure goes to Petković, who mentioned in passing the above expression in a note published in SIAM J. Numer. Anal. (2011).
Several computational aspects regarding these quantities were subsequently analyzed (see, e.g., 39 and the references therein).
2.2 R-convergence orders.
The root convergence order (-order) requires weaker conditions than the - and -orders, but unfortunately at the price of loosing its theoretical importance and its practical applications. As we have quoted in the Introduction, “The distinction between - and -convergence is […] essentially sacred to workers in the area of computational optimization” (49, p. 49), meaning that the -order is a much less powerful notion.
In Example 3.11 is presented a sequence for which, depending on the parameters, the lower -order is arbitrary close to from above, while the (exact) -order is arbitrary high (and even higher is the upper -order ).
We shall review certain results, but not treat all of them in the same detail as we will for the - and -orders.
In 1970, Ortega and Rheinboldt (32, 9.2) introduced and studied the root convergence factors, which we denote here by :
Unlike the quotient factors, the root factors do not relate each two consecutive terms, but consider some averaged asymptotic quantities.
They proved a Proposition similar to Proposition 2.5, but here takes the role of for .
The lower -order, which we denote here by (instead of in (32, 9.2)), defined by
stood in 32 and in many subsequent works as the definition of the root convergence order.
Remark 2.14
b) The value arises in the analysis of the speed of convergence of sequences with the errors satisfying different recurrence inequalities.
The simplest ones appear in usual circumstances for the secant method (see 4, 41, and also 51):
and they yield the well-known -order . It is important to note that the secant method then attains the same value also for the lower -order (see 4, 41).
For the more general inequalities
Schmidt 33, Burmeister and Schmidt 58, 59 resp. Potra and Pták (19, p. 107) determined in a series of works the lower -order .
It is interesting to note that for such inequalities, Herzberger and Metzner 27 and then Potra 16 gave certain sufficient conditions for the lower bound . Potra 16 has also noted that the above inequalities do not necessarily attract lower order greater than one: take
(24) |
with . This sequence satisfies (which immediately attracts and ), but in fact , (and , as we shall see).
Certain iterative methods have lead to some different inequalities, of the type (32, Th. 9.2.9.)
c) As noted in (32, N.R. 9.2.1), the factor has been used implicitly in much work concerning iterative processes for linear systems of equations, see, e.g., Varga 47. For nonlinear systems, it was used explicitly by Ortega and Rockoff 31.
d) Some connections between the errors and their majorizing sequences were made by Dennis Jr. in 29 (see also (49, p. 49)). Potra 16 has noticed that the lower -order is consistent with the natural ordering of sequences, while the lower -order is not: given with errors , resp. , if and has order then so has ; however, this statement does not hold for the lower -order , as one may take from Example 2.7 and . A more disturbing example was given by Jay 35, for and
(25) |
At the first sight, since , the speed of seems higher than of the (exact) -quadratic , but though, does not attain even -linear convergence, as (we shall see that in this case the upper -order is ). Another example was given in (24) above.
In 1973, Brent, Winograd and Wolfe 51 considered the following expressions, which we denote here by
(26) | ||||
(in 52 Brent considered in the same year only ). The sequence was said that converges with -order if , i.e., in a single condition,
(27) |
They noted that the -order easily implies -order .
In 1979, Schwetlick 20 considered the upper -order
and (in the notations of this paper)
he defined the convergence with -order if
Similarly to the case of the -order, the sequence is said that converges -superlinearly if
Remark 2.15
In 1989, Potra 16 has introduced and studied some further measures for the -convergence order, which we denote here by , resp. :
-
•
converges with -order if
-
•
converges with -order if and such that
The exact -order of convergence was also defined here – analogously to the definition of the exact -order (4) – as being if , such that
(28) |
we see that this definition can be immediately connected to the following one, analogous to (6):
(29) |
Potra has also considered the lower and upper -orders, when noting that if a sequence has exact -order then .
Comparing the list of - and -orders, the definition of was not given before, so we consider it here, by denoting the expressions
and by defining the -order when , i.e., when
(30) |
3 Results relating the high (computational) convergence orders
In this section we consider and present the relationships of different definitions of the (computational) convergence orders, with full proofs in the case of the - and -orders.
3.1 Relationships between the convergence orders
The equivalences and implications mentioned in the previous section can be summarized as:
At this point we can simply merge the equivalences from the two statements above and write the conclusion.
However, as we intend to provide a self-contained material, we include the proofs of the equivalence of the -orders, some of them much simplified.
3.1.1 C- and Q-convergence orders
We shall need the three auxiliary results below.
Lemma 3.1
(60, lemma 2.1)
- (i)
-
If then for all , and so
- (ii)
-
if then for all and so
- (iii)
-
if then
- (iv)
-
if then .
Relations (iii) and (iv) show that and justify the terminology ”lower” and ”upper”.
Proof. 60 For (i), if and then
For (iii), suppose and choose By definition of then, . Now, by (i),
The other statements are proved in a similar manner.
Remark 3.2
Potra obtained the following general result, interesting in its own, which will help us relating some quantities to their corresponding computational versions.
Proposition 3.3
Proof. 16 Let
Then for any with , there exists such that
We may assume that , and we obtain
so that
This proves that and, by Lemma 3.1 (i), we get
Suppose that
It follows by Lemma 3.1 (iii) that so there is such that i.e.,
But then
which leads to
which is a contradiction. Hence, we have proved that
Analogously, implies
Lemma 3.4
Proof. 60 For some some sufficiently small, such that Hence, the numbers all have the same sign . If they were positive, then we would have , contradictory to the convergence of Therefore, they are all negative and
We present now the result relating the - and -orders, and we choose to incorporate Proposition 3.3.
Theorem 3.5
Proof. We give the proof in five steps: A. , B. , C. , D. and E.
A1 (). The simplest approach is (also mentioned in 51) to take logarithms in (2). The proof in 60 used an approach similar to Lemma 3.1.
B. The relation is obvious.
C1 () Let such that Since , it follows that exists such that i.e., the second inequality in the definition of The first inequality is obtained in the same way.
The first inequality (valid for all ) implies and again by Lemma 3.1 (ii), with
The assertion follows since can be taken arbitrarily small.
D. (16, Prop. 1.1) Using the above result, we get and vice versa.
E1 () 60 We notice that, by B and D, we have that , so it suffices to show that .
Considering now instead of we obtain similarly that
(34) |
We prove that the monotone sequence is unbounded: Otherwise, if s.t. for all i.e., then according to Lemma 3.1 (i), this implies in contradiction to the monotony from (34).
Similarly, one can prove that for any with we have for all sufficiently large and we get
It is interesting to notice that -convergence with order implies However, the reverse is not true (i.e., -superlinear convergence does not imply -convergence with order ), as the following examples show.
Example 3.6
a) (32, E 10.1.4) (we follow here (19, p. 94)) Let and consider the sequence
which converges superlinearly to Suppose there exists such that converges with -order (and therefore with order) Then with s.t. Hence,
for all sufficiently large, which contradicts the well known limit , for any . As , we get .
Remark 3.7
We notice that the orders do not have elements to define the strict -superlinear convergence, when and .
The -superlinear convergence implies -superlinear convergence, as in any norm (32, 9.3.1)
In fact, is a lower bound for in all norms from .
It is interesting to see that sequences with infinite -order exist too, as one can take (from the list of exercises in (32, E.9.2.1)).
Remark 3.8
An important aspect we address now is the problem of changing the norms, as we have seen in Example 2.3 that for a given sequence in the existence of the -order is norm-dependent. Ortega and Rheinboldt (32, 9.1.6) proved that the three relations , finite, are norm-independent. The same hold for , as one can easily verify (using also Lemma 3.1), which shows the norm-independence of the -order. By the equivalence from Theorem 3.5, we conclude that the rest of the -orders are norm independent too. This assertion will be further completed by the computational convergence orders, as we shall see in the following subsection.
It is important to recall that for sequences with -order , while this -order is norm-independent, the values of the quantities and however, if finite, are norm-dependent (32, E.9.1-1).
Remark 3.9
One can easily show that the sequence converges with -order iff one of the the following sequences converges with the same order: , or ( arbitrary, given).
3.1.2 R-convergence orders
In 1989 Potra 16 stated the equivalence , which can be easily proved.
In fact we may extend Theorem 3.5 and the above relations. For brevity, we choose to write as a generic form for all the five equivalent -type orders, . The same will be written in the sequel for , , , and .
Theorem 3.10
(cf. 16) In the assumptions of Theorem 3.5, we have:
(36) |
(with the equivalence of requiring the additional assumption ).
Moreover,
(37) |
Proof. (cf. 16) We notice first that -order implies -order (and therefore , as mentioned above); this is obtained from the proofs of Propositions 9.3.2 and 9.3.3 in 32 and taking into account Proposition 2.13 above. A simpler approach results by considering directly (37), with .
The converse ”” is false, even if has exact -convergence order (i.e., (28) or (29) hold), as shown in Examples 2.7 and 3.11. One can easily show however that if in (28), this attracts convergence with -order .
The equivalence of the -order to the , and therefore all remaining -orders, is based on the relation
We note that there exist sequences (see, e.g., (25)), for which , and which can therefore be studied for the -orders.
Relation (37) – which is a completion of the one stated and proved by Potra 16 for the lower orders – can be obtained again from the well known inequalities for positive numbers
The following example shows that one may find sequences with -order () arbitrary high and arbitrary close to 1 from above.
Example 3.11
16 Given any numbers we construct a sequence which has the exact -convergence order and for which Let be given, and take with such that Define
The sequence has exact -convergence order On the other hand, for even we obtain
so that
In fact, one obtains while (, this value representing the exact -order).
This example is also suitable for justifying the mentioned comments from 49.
In (37), any inequality may be strict. Now we see (e.g., Examples 2.7 and 3.11) that the inner inequality may be equality (i.e., obtain -order) while one of the outer inequalities may be strict (i.e., no -order).
This also shows that the sequences with -order and -superlinear convergence (see Example 3.6) cannot have (upper) -order greater than one, i.e., they converge strict -superlinearly. The proof is based on relation (see 32) and on (37).
Remark 3.12
We notice that the -order is also a norm independent notion, as it can be proved by some results of Ortega and Rheinboldt (32, 9.2.2).
3.2 Results relating the computational convergence orders
In this section, the errors are replaced either by the corrections or by the nonlinear residuals , when solving nonlinear systems having the same number of equations and unknowns. We shall assume that and
We present full proofs only for the - and -convergence orders.
3.2.1 Corrections
-, -orders and corrections
As we have mentioned, Beyer, Ebanks and Qualls 60 shown that, for , the errors and corrections converge simultaneously to zero with the same - resp. -type convergence order, and therefore each convergence order is equivalent to its corresponding computational one. The following relation resulted:
(38) |
However, for proving and the considered assumptions were very strong: was considered as a monotone sequence from (see (60, Sect. 4)).
We shall extend relation (38) and show some simplified proofs. For that, we first present two auxiliary results.
Proposition 3.13
(Potra-Pták Lemma) (19, Prop. 6.4) The sequence converges -superlinearly to if and only if the sequence converges -superlinearly to zero.
Proof. Necessity. 19 Suppose converges -superlinearly. Then there exists a positive integer such that
It follows that for , we have:
from which we deduce the -superlinear convergence of the sequence
Sufficiency. 19 If we suppose that the sequence converges -superlinearly, then there exists a positive integer such that
For any we have:
Finally, writing
we infer that converges -superlinearly.
For necessity, an alternative proof in the one dimensional case can be obtained using the Brezinski’s ”l’Hospital’s rule for sequences” (see 8).
The necessity was proved in the general case even before, by Dennis and Moré.
Lemma 3.14
(Dennis-Moré) 28 If the sequence converges -superlinearly then
The assumption of this Lemma also implies that converges -superlinearly to zero too.
The converse of this result does not hold, as an example given by the same authors shows 28: , , .
Another proof of the Potra-Pták lemma was independently given by Walker 21, in 1997.
Proof of Potra-Pták lemma. Necessity 21: the Dennis-Moré Lemma.
Sufficiency. 21 Suppose , -superlinearly. Then
For each , define . Note that and for each . Then
and it follows that -superlinearly.
We are now able to present the main result of this subsubsection, which shows not only equivalence of -type resp. -type orders, but also that the lower and upper -orders are limits of some (truly) computational quantities.
Theorem 3.15
(cf. 60) In the assumptions of Theorem 3.5 and notations above, the following (generic) relations hold:
Moreover,
(39) | ||||
(40) |
Proof. The proof will consist of two steps. At step A, we prove , while at step B we prove the equivalence of the -type and -type orders. Relations (39) and (40) follow along the way.
A1 () Suppose converges with -order Then converges -superlinearly, and by the Potra-Pták and Dennis-Moré Lemmas, we get
(41) |
which shows that converges with -order
A2 () A2 is proved similarly to A1.
B1 ( ) Suppose converges with some -type order Then converges with -order and therefore -superlinearly; by the Potra-Pták and Dennis-Moré Lemmas, it follows that
(42) |
i.e., converges with -order One can view the previous statement from a different viewpoint: the sequence converges with -order so by the results in Subsection 3.1, it converges with any -type order , whence the conclusion.
B2 (-type -type order) The reverse implication is proved similarly.
Remark 3.16
a) Relation (40) shows in particular that if has an exact -order of convergence , then the sequence of corrections has not only the same -order, but even the same asymptotic constants.
b) We note that the norm-independence of the -type orders attract the same property for the -type orders.
-orders and corrections
Potra and Pták 19 have obtained some fundamental results in this direction, which can be easily completed:
Theorem 3.17
The following three results show that the asymptotic constants from the -convergence orders of the (errors of a) sequence and of its corrections are not so tightly connected as in the case of convergence with -order (relation (40)).
Corollary 3.18
(19, p. 111) If and have exact -orders of convergence, then they must have the same exact -order of convergence.
Proposition 3.19
(19, p. 111) If has an exact -order of convergence this does not imply that has an exact -order of convergence, and vice versa.
Example 3.20
(19, p. 111) Let be three real numbers with and set
It is easy to see that is the exact -order of convergence of the sequences and while the sequences and have no exact -order of convergence.
Remark 3.21
One can easily show that the sequence converges with -order iff one of the the following sequences converges with the same order: or ( arbitrary, given). The same holds for , but under the supplementary assumption that .
3.2.2 Nonlinear residuals - relationships
We consider now the solving of a system of nonlinear equations for a nonlinear mapping , having a solution . The quantities - in different formulae - were used not only to control the convergence of a given sequence to , but also to determine at each step the next element, (see, e.g., the well known works of Dembo, Eisenstat and Steihaug 53, Eisenstat and Waker 55 and some extensions we have obtained in 11, 14).
We assume here that
A) is differentiable on the open domain
B) continuous at
C) is invertible at
Replacing the errors by the corresponding residuals, we denote
and similarly, the resulted definitions for the - and all the rest of -type orders.
-, -orders and nonlinear residuals
The following example shows that neither of and implies the other.
Example 3.22
a) Consider , with
and the (linear) mapping , . One can see that conditions A)–C) are satisfied, and further, has -order , while does not have -order (in the max-norm).
b) Let , with
and similarly the mapping . Now has no -order, while has -order (in the max-norm).
One can also view Example 2.3 in the sense that if one takes as the identity operator, , the sequence from (8) has -order , while does not have -order.
Before the result relating the orders, we recall (without proof) an auxiliary result, which is presented in different standard monographs and papers.
Lemma 3.23
53 Assume the mapping is differentiable on the open set , the derivative continuous at the solution , with the Jacobian nonsingular at : . Let , where . Then there exists such that for any with , one has
We notice that . The sharpest bounds seem to be obtained by Rheinboldt (62, Th.4.2): given , there exists s.t. with we have
We obtain the final result from this paper, which shows in fact the equivalence of all -type orders.
Theorem 3.24
Under the assumptions of Theorem 3.5, A)–C) and with the above notations, the following (generic) relations hold:
Proof. Assume converges with some -type order, say -order Then, by Lemma 3.23, we have
which shows that and further the equivalence of the -type orders.
The implication follows in a similar manner.
The -convergence orders are norm-independent, being equivalent to the -orders.
-orders and nonlinear residuals
3.3 Summary of the high (computational) convergence orders
We present the following diagram in order to have a complete picture on the implications/equivalences of the high orders . As in 60, we use the symbol ”” for the conjunction of and , while the equivalence of the -type orders requires the additional assumption .
4 Linear convergence
The -linear convergence is less attractive in numerical applications, but remains an important theoretical topic (e.g., in the field of Dynamical Systems, as noted in 60).
In contrast to the high convergence orders , in the linear case there are fewer equivalences and more negative results, illustrated by counterexamples. Beyer, Ebanks and Qualls 60 did most of the work in providing them, in 60, which we also summarize in the diagram below:
However, it is worth noting that different relations (e.g., ) were obtained under the strong assumption that is a monotone sequence from
Zhang and Wang 48 completed the diagram with two implications. The first one, was proved again under the monotony assumption on the sequence of real numbers. The second one, , was proved under some even further supplementary assumptions (we mark this by dashed line in the diagram): and , where , , . The following completed diagram resulted.
5 Conclusions
As the completion for the definition of the -order of Beyer, Ebanks and Qualls remained unknown, most of the results on the -orders of different iterative methods were given only in terms of the lower -order . Ortega and Rheinboldt used and to measure the speed of convergence of some ”entire” iterative processes both for fixed point problems (the Ostrowski local attraction theorem (32, Th. 10.1.3, Th. 10.1.6, Th. 10.1.7)) and for nonlinear systems of equations (the Newton attraction theorem (32, Th. 10.2.2)). Further results for characterizing the lower bounds for a single sequence (instead of the whole method) were obtained by us 13, 12 – in the case of (perturbed) successive approximations – respectively by Dembo, Eisenstat and Steihaug 53 and us 11, 13, in the case of different Newton-type iterations in presence of error sources.
It is interesting to note that results on the upper -order exist too. One can find them (where else?) in the book of Ortega and Rheinboldt, both for the successive approximations (32, Th.10.1.7), and for the Newton method (32, Th.10.2.2). It seems interesting to clarify the conditions ensuring the upper -order for individual sequences.
The further consideration only of the lower -order may remain a powerful theoretical tool in measuring the convergence rate of sequences. In the case when a -order exists (), the use of the computational quotients may offer approximations to , while otherwise (when ) the use of the computational quotients may hopefully offer just an idea about the value of (and perhaps too).
Acknowledgments. The author is grateful to an anonymous referee for the careful reading of the manuscript and for several constructive remarks, that improved the presentation (specially in Section 3).
References
- [1] A. Cordero, J.R. Torregrosa, Variants of Newton’s method using fifth-order quadrature formulas, Appl. Math. Comput., 190 (2007) no. 1, pp. 686–698.. Cited by: §1.
- [2] A. Feldstein, R.M. Firestone, A study of Ostrowski efficiency for composite iteration algorithms, Proceedings of the 1969 24th national conference, ACM, 1969.. Cited by: §2.1.1, §2.1.1.
- [3] A. Feldstein, R.M. Firestone, Hermite interpolatory iteration theory and parallel numerical theory, Report, Division of Applied Mathematics, Brown University, 1967.. Cited by: §2.1.1.
- [4] A. Ostrowski, Solution of Equations and Systems of Equations, Academic Press, New York, 1960 (2nd ed. 1966). 3rd edition published as Solution of Equations in Euclidean and Banach Spaces, Academic Press, 1973, New York and London.. Cited by: §1, Remark 2.14.
- [5] A.I. Cohen, Rate of convergence and optimality properties of root finding and optimization algorithms, Doctoral dissertation, University of California, Berkeley, 1970.. Cited by: §2.1.1.
- [6] C. Brezinski, Accélération de la Convergence en Analyse Numérique, Springer-Verlag, Berlin, 1977.. Cited by: §1, Remark 2.1, Remark 2.11.
- [7] C. Brezinski, Comparaison des suites convergences, Revue Française d’Informatique et de Recherche Operationelle, 5 (1971) no. 2, pp. 95–99.. Cited by: Remark 2.1.
- [8] C. Brezinski, Limiting relationships and comparison theorems for sequences, Rend. Circolo Mat. Palermo, Ser. II, vol. 28 (1979), pp. 273–280.. Cited by: Remark 2.4, §3.2.1.
- [9] C. Brezinski, Vitesse de convergence d’une suite, Rev. Roumaine Math. Pures Appl., 30 (1985), pp. 403–417.. Cited by: §2.1.1, §2.1.2, Remark 2.1, Remark 2.4, Remark 2.6, Example 3.6.
- [10] D.D. Wall, The order of an iteration formula, Math. Comp., 10 (1956) no. 55, pp. 167–168.. Cited by: Remark 2.4.
- [11] E. Cătinaş, Inexact perturbed Newton methods and applications to a class of Krylov solvers, J. Optim. Theory Appl., 108 (2001) no. 3, pp. 543–570.. Cited by: §1, §3.2.2, §5.
- [12] E. Cătinaş, On accelerating the convergence of the successive approximations method, Rev. Anal. Numér. Théor. Approx., 30 (2001) no. 1, pp. 3–8.. Cited by: §5.
- [13] E. Cătinaş, On the superlinear convergence of the successive approximations method, J. Optim. Theory Appl., 113 (2002) no. 3, pp. 473–485.. Cited by: §1, §5.
- [14] E. Cătinaş, The inexact, inexact perturbed and quasi-Newton methods are equivalent models, Math. Comp., 74 (2005) no. 249, pp. 291–301.. Cited by: §1, §3.2.2.
- [15] F. Soleymani, On finding robust approximate inverses for large sparse matrices, Linear and Multilinear Algebra, 62 (2014) no. 10, 1314-1334.. Cited by: §1.
- [16] F.A. Potra, On -order and -order of convergence, J. Optim. Theory Appl., 63 (1989) no. 3, pp. 415–431.. Cited by: §1, §1, §2.1.1, §2.2, Remark 2.1, Remark 2.14, Remark 2.14, Remark 2.4, Example 2.8, §3.1, §3.1, §3.1.1, §3.1.1, §3.1.2, §3.1.2, §3.1.2, Theorem 3.10, Example 3.11, Proposition 3.3, Theorem 3.5.
- [17] F.A. Potra, Primal-dual affine scaling interior point methods for linear complementarity problems, SIAM J. Optim., 19 (2008) no. 1, pp. 114-–143.. Cited by: Remark 2.12, Remark 2.4.
- [18] F.A. Potra, Q-superlinear convergence of the iterates in primal-dual interior-point methods, Math. Program., Ser. A 91 (2001), pp. 99–-115, DOI 10.1007/s101070100230. Cited by: Remark 2.12, Remark 2.4.
- [19] F.A. Potra, V. Pták, Nondiscrete Induction and Iterative Processes, Pitman, Boston, Massachusetts, 1984.. Cited by: §1, §1, §1, §2.1.2, §2.1.2, Remark 2.1, Remark 2.14, Remark 2.4, Example 2.8, §3.1.2, §3.2.1, §3.2.1, §3.2.1, §3.2.1, Proposition 3.13, Theorem 3.17, Corollary 3.18, Proposition 3.19, Example 3.20, Example 3.6.
- [20] H. Schwetlick, Numerische Lösung Nichtlinearer Gleichungen, VEB, Berlin, 1979.. Cited by: §2.1.1, §2.2, Remark 2.1, Remark 2.4, Example 2.8.
- [21] H.F. Walker, An approach to continuation using Krylov subspace methods, Computational Science in the 21st Century, J. Periaux, ed., John Wiley and Sons, Ltd., 1997, pp. 72–81.. Cited by: §2.1.2, §3.2.1, §3.2.1, §3.2.1.
- [22] I. Păvăloiu, Approximation of the roots of equations by Aitken-Steffensen-type monotonic sequences, Calcolo, 32 (1995) nos. 1-2, pp. 69–82. DOI 10.1007/BF02576543. Cited by: §2.1.2.
- [23] I. Păvăloiu, On the convergence of one step and multistep iterative methods, unpublished manuscript, http://ictp.acad.ro/convergence-one-step-multistep-iterative-methods/, accessed April 1st, 2018.. Cited by: §2.1.2.
- [24] I. Păvăloiu, Optimal algorithms concerning the solving of equations by interpolation, Research on Theory of Allure, Approximation, Convexity and Optimization, Ed. Srima, Cluj-Napoca (1999), pp. 222–248, ISBN 973-98551-4-3. http://ictp.acad.ro/optimal-algorithms-concerning-solving-equations-interpolation/ accessed March 22, 2018.. Cited by: §2.1.2.
- [25] I. Păvăloiu, Sur l’ordre de convergence des méthodes d’itération, Mathématica, 23 (46) (1981) no. 1, pp. 261–272 (in French). http://ictp.acad.ro/convergence-order-iterative-methods/ accessed March 22, 2018.. Cited by: §2.1.2.
- [26] J. Fourier, Question d’analyse algébrique, Bull. des Science par la Societe Philo., 67 (1818), pp. 61–67. http://gallica.bnf.fr/ark:/12148/bpt6k33707/f248.item accessed February 23, 2018.. Cited by: §1.
- [27] J. Herzberger, L. Metzner, On the -orders of convergence of coupled sequences arising in iterative numerical processes, Computing, 57 (1976), pp. 357–363.. Cited by: Remark 2.14.
- [28] J.E. Dennis, Jr. and J.J. Moré, A characterization of superlinear convergence and its application to quasi-Newton methods, Math. Comput., 28 (1974) no. 126, pp. 549–560.. Cited by: §1, §1, §2.1.2, §2.1.2, §3.2.1, §3.2.1, Lemma 3.14.
- [29] J.E. Dennis, On Newton-like methods, Numer. Math., 11 (1968), pp. 324–330.. Cited by: Remark 2.14.
- [30] J.F. Traub, Iterative Methods for the Solution of Equations, Prentice-Hall, Englewood Cliffs, N.J., 1964.. Cited by: §1, Remark 2.14.
- [31] J.M. Ortega, M.L. Rockoff, Nonlinear difference equations and Gauss-Seidel type iterative methods, SIAM J. Numer. Anal., 3 (1966) no. 3, pp. 497–513.. Cited by: Remark 2.14.
- [32] J.M. Ortega, W.C. Rheinboldt, Iterative Solution of Nonlinear Equations in Several Variables, Academic Press, New York, 1970.. Cited by: §1, §1, §2.1.1, §2.1.1, §2.1.1, §2.2, §2.2, Remark 2.1, Proposition 2.13, Remark 2.14, Remark 2.14, Remark 2.14, Remark 2.4, Proposition 2.5, Remark 2.6, §3.1.1, §3.1.1, §3.1.2, §3.1.2, Remark 3.12, Remark 3.2, Example 3.6, Example 3.6, Remark 3.8, Remark 3.8, §5, §5.
- [33] J.W. Schmidt, On the R-order of coupled sequences, Computing, 26 (1981), pp. 333–-342.. Cited by: §2.1.1, §2.1.1, Remark 2.14.
- [34] L. Tornheim, Convergence of multipoint iterative methods, J. ACM, 11 (1964), pp. 210–220.. Cited by: Remark 2.4.
- [35] L.O. Jay, A note on Q-order of convergence, BIT, 41 (2001) no. 2, pp. 422–429.. Cited by: Remark 2.11, Remark 2.14, Example 2.2, Example 2.8, Example 2.8, Example 3.6.
- [36] Li Wang, Symbolic dynamics for a class of unimodal maps and a metric property of bifurcation in trapezoidal maps (period doubling, contiguity of harmonics, quadratic convergence), PhD thesis, State University of New York at Buffalo, 1986.. Cited by: §2.1.1.
- [37] M. Grau-Sánchez, M. Noguera, À. Grau, J.R. Herrero, On new computational local orders of convergence, Appl. Math. Lett., 25 (2012), pp. 2023–2030.. Cited by: §2.1.2, §2.1.2.
- [38] M. Grau-Sánchez, M. Noguera, J.M. Gutiérrez, On some computational orders of convergence, Appl. Math. Lett., 23 (2010), pp. 472–478.. Cited by: §2.1.2, §2.1.2.
- [39] M. Grau-Sánchez, M. Noguera, On Convergence and efficiency in the resolution of systems of nonlinear equations from a local analysis. In: Amat S., Busquier S. (eds) Advances in Iterative Methods for Nonlinear Equations. SEMA SIMAI Springer Series, vol 10. Springer, 2016.. Cited by: §2.1.2, Remark 2.12.
- [40] M. Igarashi, T. Ypma, Empirical versus asymptotic rate of convergence of a class of methods for solving a polynomial equation, J. Comp. Appl. Math., 82 (1997), 229–237.. Cited by: Remark 2.6.
- [41] M. Raydan, Exact order of convergence of the secant method, J. Optim. Theory Appl., 78 (1993) 3, pp. 541–551.. Cited by: §1, Remark 2.14.
- [42] N. Bourbaki, Fonction d’une Variable Réelle. Livre 4. Chapitre 5. Hermann, Paris, 1951.. Cited by: Remark 2.4.
- [43] N.J. Higham, Functions of Matrices: Theory and Computation, SIAM, Philadelphia, 2008.. Cited by: §1.
- [44] R. Brent, The computational complexity of iterative methods for systems of nonlinear equations, in: Miller R.E., Thatcher J.W., Bohlinger J.D. (eds) Complexity of Computer Computations. The IBM Research Symposia Series. Springer, Boston, MA, 1972.. Cited by: Remark 2.4.
- [45] R. Cavanagh, Difference equations and iterative processes, Doctoral thesis, Univ. of Maryland, College Park, 1970.. Cited by: §2.1.1.
- [46] R. Hauser, J. Nedić, On the relationship between the convergence rates of iterative and continuous processes, SIAM J. Optim. Appl., 18 (2007) no. 1, pp. 52–64.. Cited by: §1.
- [47] R. Varga, Matrix Iterative Analysis, Prentice Hall, Englewood Cliffs, New Jersey, 1962.. Cited by: Remark 2.14.
- [48] R. Zhang, L. Wang, Two convergence problems for monotone sequences, Acta Appl. Math., 47 (1997), pp. 213–220.. Cited by: §4.
- [49] R.A. Tapia, J.E. Dennis, Jr., J.P. Schäfermeyer, Inverse, shifted inverse, and Rayleigh quotient iteration as Newton’s method, SIAM Review, 60 (2018) no. 1, pp. 3–55. DOI 10.1137/15M1049956. Cited by: §1, §1, §2.2, Remark 2.14, §3.1.2.
- [50] R.G. Voigt, Orders of convergence for iterative procedures, SIAM J. Numer. Anal., 8 (1971), pp. 222–243.. Cited by: Remark 2.14.
- [51] R.P. Brent, S. Winograd, P. Wolfe, Optimal iterative processes for root-finding, Numer. Math., 20 (1973), pp. 327–341.. Cited by: §2.2, Remark 2.14, Remark 2.4, §3.1.1.
- [52] R.P. Brent, Some efficient algorithms for solving systems of nonlinear equations, SIAM J. Numer. Anal., 10 (1973), pp. 327–344.. Cited by: §2.2, Remark 2.4, §3.1.2, Example 3.6.
- [53] R.S. Dembo, S.C. Eisenstat and T. Steihaug, Inexact Newton methods, SIAM J. Numer. Anal., 19 (1982) no.2, pp. 400–408.. Cited by: §1, §3.2.2, Lemma 3.23, §5.
- [54] S. Weerakoon, T.G.I. Fernando, A variant of Newton’s method with accelerated third-order convergence, Appl. Math. Lett., 13 (2000) no. 8, pp. 87–93.. Cited by: §1.
- [55] S.C. Eisenstat, H.F. Walker, Choosing the forcing terms in an inexact Newton method, SIAM J. Sci. Comput., 17 (1996) no. 1, pp. 16–32.. Cited by: §1, §3.2.2.
- [56] S.J. Wright, Primal-Dual Interior-Point Methods, SIAM, Philadelphia, 1997.. Cited by: Remark 2.12, Remark 2.4, Remark 2.9.
- [57] T.J. Ypma, Historical development of the Newton-Raphson method, SIAM Review, 37 (1995) no. 4, pp. 531–551.. Cited by: §1.
- [58] W. Burmeister, J.W. Schmidt, On the R-order of coupled sequences II, Computing, 29 1982, pp. 73–81.. Cited by: Remark 2.14.
- [59] W. Burmeister, J.W. Schmidt, On the R-order of coupled sequences III, Computing, 30 1983, pp. 157–169.. Cited by: Remark 2.14.
- [60] W.A. Beyer, B.R. Ebanks, C.R. Qualls, Convergence rates and convergence-order profiles for sequences, Acta Appl. Math., 20 (1990), pp. 267–284.. Cited by: §1, §1, §2.1.1, §2.1.1, §2.1.1, §2.1.1, §2.1.1, §2.1.1, §2.1.1, §2.1.2, §2.1.2, §2.1.2, Example 2.7, Example 2.8, §3.1, §3.1, §3.1.1, §3.1.1, §3.1.1, §3.1.1, §3.1.1, §3.1.1, §3.2.1, §3.2.1, §3.3, Lemma 3.1, Theorem 3.15, Lemma 3.4, Theorem 3.5, Example 3.6, §4, §4.
- [61] W.A. Beyer, P.R. Stein, Period doubling for trapezoid function iteration: metric theory, Adv. Appl. Math., 3 (1982), pp. 1–17.. Cited by: §2.1.1, §2.1.1, §2.1.2.
- [62] W.C. Rheinboldt, Methods for Solving Systems of Nonlinear Equations, 2nd ed., SIAM, Philadelphia, 1998.. Cited by: §1, §1, §3.2.2.
- [63] Y. Ye, Interior Point Algorithms: Theory and Analysis, Wiley, 1997.. Cited by: Remark 2.12, Remark 2.4.
[1] W.A. Beyer, B.R. Ebanks, C.R. Qualls, Convergence rates and convergence-order profiles for sequences, Acta Appl. Math. 20 (1990) 267-284.
CrossRef (DOI)
[2] W.A. Beyer, P.R. Stein, Period doubling for trapezoid function iteration: metric theory, Adv. Appl. Math., 3 (1982), pp. 1-17.
CrossRef (DOI)
[3] W.A. Beyer, P.R. Stein, Further results on periods and period doubling for iterates of the trapezoid function, Adv. Appl. Math., 3 (1982), pp. 265-287.
CrossRef (DOI)
[4] E. Bodewig, On types of convergence and on the behaviour of approximations in the neighborhood of a multiple root of an equation, Quart. Appl. Math., 7 (1949), pp. 325-333
CrossRef (DOI)
[5] N. Bourbaki, Fonction d’une Variable Réelle. Livre 4. Chapitre 5. Hermann, Paris, 1951.
[6] R. Brent, The computational complexity of iterative methods for systems of nonlinear equations, in: Miller R.E., Thatcher J.W., Bohlinger J.D. (eds) Complexity of Computer Computations. The IBM Research Symposia Series. Springer, Boston, MA, 1972
[7] R.P. Brent, Some efficient algorithms for solving systems of nonlinear equations, SIAM J. Numer. Anal., 10 (1973), pp. 327-344.
CrossRef (DOI)
[8] R.P. Brent, Algorithms for minimization without derivatives, Prentice-Hall, Englewood Cliffs, New Jersey, 1973.
[9] R.P. Brent, S. Winograd, P. Wolfe, Optimal iterative processes for root-finding, Numer. Math., 20 (1973), pp. 327-341.
CrossREf (DOI)
[10] C. Brezinski, Comparaison des suites convergences, Revue Française d’Informatique et de Recherche Operationelle, 5 (1971) no. 2, pp. 95-99.
CrossREf (DOI)
[11] C. Brezinski, Accélération de la Convergence en Analyse Numérique, Springer-Verlag, Berlin, 1977.
[12] C. Brezinski, Limiting relationships and comparison theorems for sequences, Rend. Circolo Mat. Palermo, Ser. II, vol. 28 (1979), pp. 273-280.
CrossRef (DOI)
[13] C. Brezinski, Vitesse de convergence d’une suite, Rev. Roumaine Math. Pures Appl., 30 (1985), pp. 403-417.
[14] W. Burmeister, J.W. Schmidt, On the R-Order of Coupled Sequences II, Computing, 29 1982, pp. 73-81.
CrossRef (DOI)
[15] W. Burmeister, J.W. Schmidt, On the R-Order of Coupled Sequences III, Computing, 30 1983, pp. 157-169.
CrossRef (DOI)
[16] E. Cătinaş, Inexact perturbed Newton methods and applications to a class of Krylov solvers, J. Optim. Theory Appl., 108 (2001) no. 3, pp. 543-570.
CrossRef (DOI)
[17] E. Cătinaş, On accelerating the convergence of the successive approximations method, Rev. Anal. Numér. Théor. Approx., 30 (2001) no. 1, pp. 3-8.
Paper on author’s website
[18] E. Cătinaş, On the superlinear convergence of the successive approximations method, J. Optim. Theory Appl., 113 (2002) no. 3, pp. 473-485.
CrossRef (DOI)
[19] E. Cătinaş, The inexact, inexact perturbed and quasi-Newton methods are equivalent models, Math. Comp., 74 (2005) no. 249, pp. 291-301.
Crossref (DOI)
[20] R. Cavanagh, Difference equations and iterative processes, Doctoral thesis, Univ. of Maryland, College Park, 1970.
[21] A.I. Cohen, Rate of convergence and optimality properties of root finding and optimization algorithms, Doctoral dissertation, University of California, Berkeley, 1970.
[22] A. Cordero, J.R. Torregrosa, Variants of Newton’s method using fifth-order quadrature formulas, Appl. Math. Comput. 190 (1) (2007), 686-698.
CrossRef (DOI)
[23] R.S. Dembo, S.C. Eisenstat, T. Steihaug, Inexact Newton methods, SIAM J. Numer. Anal. 19 (2) (1982) 400-408.
CrossRef (DOI)
[24] J.E. Dennis, On Newton-like methods, Numer. Math. 11 (1968) 324-330.
CrossRef (DOI)
[25] J.E. Dennis Jr., J.J. Moré, A characterization of superlinear convergence and its application to quasi-Newton methods, Math. Comput. 28 (126) (1974) 549-560.
CrossRef (DOI)
[26] F. Dubeau, On comparisons of Chebyshev-Halley iteration functions based on their asymptotic constants, Int. J. Pure Appl. Math. 85 (5) (2013) 965-981.
CrossRef (DOI)
[27] S.C. Eisenstat, H.F. Walker, Choosing the forcing terms in an inexact Newton method, SIAM J. Sci. Comput. 17 (1) (1996) 16-32.
CrossRef (DOI)
[28] A. Feldstein, R.M. Firestone, Hermite interpolatory iteration theory and parallel numerical theory, Division of Applied Mathematics, Brown University, 1967 Report.
[29] A. Feldstein, R.M. Firestone, A study of Ostrowski efficiency for composite iteration algorithms, in: Proceedings of the 1969 24th National Conference, ACM, 1969.
CrossRef (DOI)
[30] A. Feldstein, Bounds on order and Ostrowski efficiency for interpolatory iteration algorithms, California University/Lawrence Livermore Lab., Livermore, USA, 1969 Report ucrl-72238.
[31] J. Fourier, Question d’analyse algébrique, Bull. Sci. Soc. Philos. 67 (1818) (2018) 61–67. http://gallica.bnf.fr/ark:/12148/bpt6k33707/f248.item, accessed February 23, 2018
[32] M. Grau-Sánchez, M. Noguera, J.M. Gutiérrez, On some computational orders of convergence, Appl. Math. Lett. 23 (2010) 472-478.
CrossRef (DOI)
[33] M. Grau-Sánchez, M. Noguera, A. Grau, J.R. Herrero, On new computational local orders of convergence, Appl. Math. Lett. 25 (2012) 2023-2030.
CrossRef (DOI)
[34] M. Grau-Sánchez, M. Noguera, On convergence and efficiency in the resolution of systems of nonlinear equations from a local analysis, in: S. Amat, S. Busquier (Eds.), Advances in Iterative Methods for Nonlinear Equations, SEMA SIMAI Springer Series, 10, Springer, 2016.
CrossRef (DOI)
[35] R. Hauser, J. Nedic, On the relationship between the convergence rates of iterative and continuous processes, SIAM J. Optim. Appl. 18 (1) (2007) 52-64.
CrossRef (DOI)
[36] J. Herzberger, L. Metzner, On the q-orders of convergence of coupled sequences arising in iterative numerical processes, Computing 57 (1976) 357–363.
CrossRef (DOI)
[37] N.J. Higham, Functions of Matrices: Theory and Computation, SIAM, Philadelphia, 2008.
[38] M. Igarashi, T. Ypma, Empirical versus asymptotic rate of convergence of a class of methods for solving a polynomial equation, J. Comput. Appl. Math. 82 (1997) 229-237.
CrossREf (DOI)
[39] L.O. Jay, A note on q-order of convergence, BIT 41 (2) (2001) 422-429.
CrossRef (DOI)
[40] J.M. Ortega, W.C. Rheinboldt, Iterative Solution of Nonlinear Equations in Several Variables, Academic Press, New York, 1970.
CrossRef (DOI)
[41] J.M. Ortega, M.L. Rockoff, Nonlinear difference equations and Gauss-Seidel type iterative methods, SIAM J. Numer. Anal. 3 (3) (1966) 49–513.
CrossRef (DOI)
[42] A. Ostrowski, Solution of Equations and Systems of Equations, Academic Press, New York, 1960 (2nd ed. 1966). 3rd edition published as Solution of Equations in Euclidean and Banach Spaces, Academic Press, 1973, New York and London.
[43] I. Pavaloiu, Sur l’ordre de convergence des méthodes d’itération, Mathematica, 23 46 (1) (1981) 261–272. (in French). https://ictp.acad.ro/convergence-order-iterative-methods/ accessed March 22, 2018.
paper on author’s website
[44] I. Pavaloiu, Approximation of the roots of equations by Aitken-Steffensen-type monotonic sequences, Calcolo 32 (1–2) (1995) 69-82.
CrossRef (DOI), paper on author’s website
[45] I. Pavaloiu, Optimal algorithms concerning the solving of equations by interpolation, in: Research on Theory of Allure, Approximation, Convexity and Optimization, Editura Srima, Cluj-Napoca, Romania, 1999, pp. 222-248. ISBN 973-98551-4-3. https://ictp.acad.ro/optimal-algorithms-concerning-solving-equations-interpolation/ accessed March 22, 2018.
paper on author’s website
[46] I. Pavaloiu, On the convergence of one step and multistep iterative methods, Unpublished manuscript, https://ictp.acad.ro/convergence- one- step- multistep- iterative- methods/, accessed April 1st, 2018.
paper on author’s website
[47] F.A. Potra, On Q-order and R-order of convergence, J. Optim. Theory Appl. 63 (3) (1989) 415-431.
CrossRef (DOI)
[48] F.A. Potra, V. Pták, Nondiscrete Induction and Iterative Processes, Pitman, Boston, Massachusetts, 1984.
[49] F.A. Potra, Q-superlinear convergence of the iterates in primal-dual interior-point methods, Math. Program. Ser. A 91 (2001) 99-115
CrossRef DOI)
[50] F.A. Potra, Primal-dual affine scaling interior point methods for linear complementarity problems, SIAM J. Optim. 19 (1) (2008) 114-143.
CrossRef (DOI)
[51] M. Raydan, Exact order of convergence of the secant method, J. Optim. Theory Appl. 78 (3) (1993) 541-551.
CrossRef (DOI)
[52] W.C. Rheinboldt, Methods for Solving Systems of Nonlinear Equations, 2nd. ed., SIAM, Philadelphia, 1998
[53] J.W. Schmidt, On the r-order of coupled sequences, Computing 26 (1981) 333-342.
CrossREf (DOI)
[54] H. Schwetlick, Numerische Lösung Nichtlinearer Gleichungen, VEB, Berlin, 1979.
[55] F. Soleymani, On finding robust approximate inverses for large sparse matrices, Linear Multilinear Algebra 62 (10) (2014) 1314-1334.
CrossRef (DOI)
[56] R.A. Tapia, J.E. Dennis Jr., J.P. Schäfermeyer, Inverse, shifted inverse, and Rayleigh quotient iteration as Newton’s method, SIAM Rev. 60 (1) (2018) 3-55.
CrossRef (DOI)
[57] L. Tornheim, Convergence of multipoint iterative methods, J. ACM 11 (1964) 210-220.
CrossRef (DOI)
[58] J.F. Traub, Iterative Methods for the Solution of Equations, Prentice-Hall, Englewood Cliffs, N.J., 1964.
[59] R. Varga, Matrix Iterative Analysis, Prentice Hall, Englewood Cliffs, New Jersey, 1962.
[60] R.G. Voigt, Rates of Convergence for Iterative Methods for Nonlinear Systems of Equations, University of Maryland, College Park, Maryland, Ph.D. Diss.
[61] R.G. Voigt, Orders of convergence for iterative procedures, SIAM J. Numer. Anal. 8 (1971) 222-243
CrossRef (DOI)
[62] H.F. Walker, An approach to continuation using Krylov subspace methods, in: J. Periaux (Ed.), Computational Science in the 21st Century, John Wiley and Sons, Ltd., 1997, pp. 72-81
paper on author website
[63] D.D. Wall, The order of an iteration formula, Math. Comput. 10 (55) (1956) 167-168.
CrossRef (DOI)
[64] L. Wang, Symbolic dynamics for a class of unimodal maps and a metric property of bifurcation in trapezoidal maps (period doubling, contiguity of harmonics, quadratic convergence), State University of New York at Buffalo, 1986 Ph.D. thesis.
[65] S. Weerakoon, T.G.I. Fernando, A variant of Newton’s method with accelerated third-order convergence, Appl. Math. Lett. 13 (8) (20 0 0) 87-93.
CrossRef (DOI)
[66] S.J. Wright, Primal-Dual Interior-Point Methods, SIAM, Philadelphia, 1997.
[67] Y. Ye, Interior Point Algorithms: Theory and Analysis, Wiley, 1997.
[68] T.J. Ypma, Historical development of the Newton-Raphson method, SIAM Rev. 37 (4) (1995) 531-551
CrossRef (DOI)
[69] R. Zhang, L. Wang, Two convergence problems for monotone sequences, Acta Appl. Math. 47 (1997) 213-220.
CrossRef (DOI)