In this note we deal with two problems: the first regards the efficiency in approximating the inverse of a matrix by the Schultz-type methods, and the second is the problem of evaluating the errors in the approximation of the inverses of the perturbed matrices.
Author
Ion Păvăloiu
(Tiberiu Popoviciu Institute of Numerical Analysis)
[1] Herzberger J., Explizite Shulz Verfahren hoherer ordrnung zur approximation der reversen matrix, Z. Angew Math. und Mech. 1988, Bd. 68, No. 5, pp. 494-496
[2] Ostrowski M.A., Solution of equations in euclidian and Banach spaces, Academic Press. New York and London (1975)
[3] Stickel E., On a class of high order methods for investing matrices, Z. Angew Math. und Mech. 1987, Bd. 67, No. 7, pp. 334-336
In this note we deal with two problems: the first regards the efficiency in approximating the inverse of a matrix by the Shulz-type methods, and the second is the problem of evaluating the errors in the approximation of the inverses of the perturbed matrices.
1. Introduction
In this note we deal with two problems: the first regards the efficiency in approximating the inverse of a matrix by the Shulz-type methods, and the second is the problem of evaluating the errors in the approximation of the inverses of the perturbed matrices.
As it is well known, given a nonsingular matrix A inR^(m xx m)A \in \mathbb{R}^{m \times m} and a matrix D_(0)inR^(m xx m)D_{0} \in \mathbb{R}^{m \times m} such that
with q inRq \in \mathbb{R} and II the mm-th order unit matrix, then, for k inN,k >= 2k \in \mathbb{N}, k \geq 2 fixed, the sequence of matrices (D_(n))_(n >= 0)\left(D_{n}\right)_{n \geq 0} given by
The methods of type (2) represent generalizations of the well known Shulz method. Relation (3) shows that the convergence order of sequence (D_(n))_(n >= 0)\left(D_{n}\right)_{n \geq 0} is k,k >= 2k, k \geq 2.
We introduce the notion of efficiency index of method (2). We notice that at each iteration step, the number of the matrix sums required is equal to the number of matrix products which appear in (2). Moreover, for computing the sum I+F_(n)+cdots+F_(n)^(k)I+F_{n}+\cdots+F_{n}^{k} we may use a method similar to the Horner scheme, i.e.
In this way the matrix sums required reduce to sums in which one term is the identity matrix. This remark is also valid for the term F_(n)=I-AD_(n)F_{n}=I-A D_{n}. The operation consisting of one matrix product and one matrix sum (regardless of their order) we call it computing unit.
Definition 1. The efficiency index of method (2) is given by
where s inNs \in \mathbb{N} represents the number of computing units required at each iteration step of method (2).
This definition is given by analogy to the efficiency index introduced by A.M. Ostrowski in [2]. The definition may also be motivated by the following reasoning.
Consider now two methods of type (2), having the convergence orders k_(1)k_{1} and k_(2)k_{2} respectively. Assume that, for achieving the same precision, these methods require n_(1)n_{1} respectively n_(2)n_{2} iteration steps. Then (8) implies
Taking into account Definition 1, it follows that among the methods of type (2) for different values of kk, the most efficient is given by the one with high efficiency index.
We shall determine in the following section the optimal method, i.e., having the high efficiency index, when k inN,k >= 2k \in \mathbb{N}, k \geq 2.
2. Optimal efficiency index
Assume that we use (4) at each iteration step in (2). It can be easily seen that for the sum in (4) there are needed k-2k-2 matrix products. Relation (2) shows that 2 more matrix products are required at each iteration step, so in total we need kk matrix products.
Taking into account (5), it follows that the efficiency index of method (2) is given by
Considering the function f:(0,+oo)rarrR,f(x)=x^((1)/(x))f:(0,+\infty) \rightarrow \mathbb{R}, f(x)=x^{\frac{1}{x}}, it can be easily seen that this function attains a maximum value at x=ex=e. Since ff is increasing on ( 0,e0, e ) and decreasing on ( e,+ooe,+\infty ), it follows that bar(E)_(k)\bar{E}_{k} is the largest for k=3k=3.
We have proved the following result.
Theorem 2. Among the methods (2) for k=N,k >= 2k=\mathbb{N}, k \geq 2, the method with highest efficiency index is given by:
In practice, the elements of the matrix AA are usually obtained as results of certain experiments, measurements, approximations etc. Therefore their values are altered by errors. Consequently we replace AA by the approximation widetilde(A)\widetilde{A}. For a rigorous interpretation of the results, it is necessary to know an error bound epsi > 0\varepsilon>0 for which
and (26) attracts 1-|| widetilde(F)_(0)||-epsi|| widetilde(D)_(0)|| > 01-\left\|\widetilde{F}_{0}\right\|-\varepsilon\left\|\widetilde{D}_{0}\right\|>0.
The following inequality can be easily proved
This inequality provides a priori evaluations for the error. If we want to stop the iterations at a certain step bar(n)\bar{n} such that || widetilde(F)_( bar(n))|| <= epsi_(1),epsi_(1) > 0\left\|\widetilde{F}_{\bar{n}}\right\| \leq \varepsilon_{1}, \varepsilon_{1}>0 given, then by (7) and (17) it follows
[1] Herzberger J., Explizite Shulz Verfahren höherer ordrnung zur approximation der reversen matrix, Z. Angew Math. und Mech. 1988, Bd. 68, No. 5, pp. 494-496
[2] Ostrowski M.A., Solution of equations in euclidian and Banach spaces, Academic Press. New York and London (1975)
[3] Stickel E., On a class of high order methods for investing matrices, Z. Angew Math. und Mech. 1987, Bd. 67, No. 7, pp. 334-336
"T. Popoviciu" Institute of Numerical Analysis
Str. Fântânele, nr.57, Bloc B7, sc.II, etaj 5, ap. 67-68
Cluj-Napoca, Romania
E-mail address: pavaloiu@ictp.acad.ro
Received: 28.02.2003; In revised form: 30.10.2003
Key words and phrases. Inverse of a matrix, perturbed matrix, efficiency in approximating the inverse of a matrix.