Multicentric calculus and the Riesz projection

Diana Apetrei\(^\ast \) Olavi Nevanlinna\(^\ast \)

November 9, 2015.

\(^\ast \)Aalto University, Department of Mathematics and Systems Analysis, P.O.Box 11100, Otakaari 1M, Espoo, FI-00076 Aalto, Finland, email: {Diana.Apetrei,
Olavi.Nevanlinna}@aalto.fi@aalto.fi
.

In multicentric holomorphic calculus one represents the function \(\varphi \) using a new polynomial variable \(w=p(z)\) in such a way that when it is evaluated at the operator \(A,\) then \(p(A)\) is small in norm. Usually it is assumed that \(p\) has distinct roots. In this paper we discuss two related problems, the separation of a compact set (such as the spectrum) into different components by a polynomial lemniscate, respectively the application of the Calculus to the computation and the estimation of the Riesz spectral projection. It may then become desirable the use of \(p(z)^n\) as a new variable. We also develop the necessary modifications to incorporate the multiplicities in the roots.

MSC. 30B99, 32E30, 34L16, 47A10, 47A60.

Keywords. multicentric calculus, Riesz projections, spectral projections, sign function of an operator, lemniscates.

A Introduction

Let \(p(z)\) be a polynomial of degree \(d\) with distinct roots \(\lambda _1, \dots , \lambda _d\). In multicentric holomorphic calculus the polynomial is taken as a new variable \(w=p(z)\) and functions \(\varphi (z)\) are represented with the help of a vector-valued function \(f\), mapping \(w\mapsto f(w) \in \mathbb C^d\), [ 9 ] . For example, sets bounded by lemniscates \(|p(z)| = \rho \) are then mapped onto discs \(|w| \le \rho \) and for \(\rho \) small, \(f\) has a rapidly converging Taylor series.

The multicentric representation then yields a functional calculus for operators (or matrices) \(A\), if one has found a polynomial \(p\) such that \(\| p(A)\| \) is small. In fact, denote

\[ V_p(A)=\big\{ z \in \mathbb C \ : \ |p(z)| \le \| p(A)\| \big\} \]

and observe that, by spectral mapping theorem, the spectrum \(\sigma (A)\) satisfies \(\sigma (A) \subset V_p(A)\). If \(f\) has a rapidly converging Taylor series for \(|w| \le \| p(A)\| \), then \(\varphi (A)\) can be written down by a rapidly converging explicit series expansion.

Since \(V_p(A)\) can have several components, one can define \(\varphi =1\) in the neighborhood of some components, while \(\varphi =0\) in a neighborhood of the others. Then \(\varphi (A)\) represents the spectral projection onto the invariant subspace corresponding to the part of the spectrum where \(\varphi =1\). The spectral projection satisfies

\[ \varphi (A)=\tfrac {1}{2\pi i} \int _\gamma (\lambda I-A)^{-1} d\lambda , \]

where \(\gamma \) surrounds the appropriate components of the spectrum, but the computational approach does not need the evaluation of the contour integral. The coefficients for the Taylor series of \(f\) can be computed with explicit recursion from those of \(\varphi \) at the local centers \(\lambda _j\) [ 9 ] . The approach also yields a bound for \(\| \varphi (A)\| \) by a generalization of the von Neumann theorem for contractions, see [ 7 ] . If the scalar function \(\varphi \) is not holomorphic at the spectrum, the multicentric calculus leads to a new functional calculus to deal with, e.g. nontrivial Jordan blocks [ 10 ] .

In this paper we take a closer look at the computation of spectral projections in finding the stable and unstable invariant subspaces of an operator \(A\). The direct approach would be to ask for a polynomial \(p\) with distinct roots such that

\[ V_p(A) \cap i\mathbb R = \emptyset \]

and then apply the calculus with \(\varphi =1\) for \(\textnormal{Re } z{\gt}0,\) respectively \(\varphi =0\) for \(\textnormal{Re } z{\lt}0\). However, we discuss this as two different subjects, one being the separation of the spectrum and the other being the computation of the projection.

In order to discuss the separation, we denote

\[ V(p,\rho ) = \{ z \in \mathbb C \ : |p(z) | \leq \rho \} \]

and we let \(K=\sigma (A),\) then we ask what is the minimal degree of a polynomial such that

\begin{equation} \label{ekakysymys} K \subset V(p,\rho ) \text{ and } V(p,\rho ) \cap i\mathbb R = \emptyset . \end{equation}
A.1

holds. By Hilbert’s lemniscate theorem, see e.g. [ 12 ] , such a polynomial with a minimal degree always exists. We model this question by considering in place of \(\sigma (A)\) two lines parallel to the imaginary axis as follows

\[ K= \big\{ z=x+iy \ : \ x\in \{ -1,1\} , \ |y| \le \tan (\alpha )\big\} \]

and derive a sample of polynomials for which (A.1) holds when \(\alpha \) grows. For \(\alpha {\lt}\pi /4,\) the minimal degree is clearly \(2\) but in general we are not able to prove exact lower bounds.

Whenever (A.1) holds, then the series expansion of \(f\) representing \(\varphi =1\) for \(\textnormal{Re } z{\gt}0\) and \(\varphi =0\) for \(\textnormal{Re } z{\lt}0\) converges in \(p(K),\) and if \(\sigma (A)\subset K\) then we do obtain a convergent expresion for the projection onto to unstable invariant subspace. However, whenever \(p(A)\) would be nonnormal, it could happen that \(V_p(A) \cap i\mathbb R \not=\emptyset \), and then we would not get a bound for the projection by the generalization of von Neumann theorem. To overcome this, note that from the spectral radius formula \(r(B)= \lim \| B^n\| ^{1/n}\) in such a case, there does exist an integer \(n\) such that with \(q=p^n\) we have \( V_q(A)\cap i\mathbb R =\emptyset \).

This leads to the other topic discussed in this paper. With \(q=p^n,\) the new variable there are no longer simple zeros and we shall therefore derive the multicentric representations needed in this case. This is done in Section \(2\) while the model problem is discussed in Section \(3\). In Section \(4\) we present concluding remarks and in the end we discuss a nonnormal small dimensional problem with \(q(z)=(z^2-1)^n\).

B Representations and main estimates

B.1 Formulas and estimates

We need the basic formula for expressing a given function \(\varphi (z)\) as a linear combination of functions \(f_{j,k}(w^n)\) when \(w=p(z),\) for \(n\) a given positive integer.

Let \(p(z)\) be the monic polynomial of degree \(d\) with distinct roots \(\lambda _1, \dots ,\lambda _d.\) We denote by \(\delta _k\in \mathbb {P}_{d-1}\) the Lagrange interpolation basis polynomials at \(\lambda _j\)

\[ \delta _k(\lambda )=\tfrac {1}{p’(\lambda _k)} \prod _{j\neq k} (\lambda -\lambda _j). \]

Then the multicentric representation of \(\varphi \) takes the form

\begin{equation} \varphi (z)= \sum _{j=1}^{d} \delta _j(z)f_j(w) \label{phiA}, \ \ \text{where } \ \ w=p(z) \end{equation}
B.1

and \(f_j\)’s are obtained from \( \varphi \) with the formula [ 7 ]

\begin{equation} \label{kaanteiskaava} f_j(w) = \sum _{l=1}^d \delta _l(\lambda _j,w)\varphi (\zeta _l(w)), \end{equation}
B.2

where \(\zeta _l(w)\) denote the roots of \( p(\lambda )-w=0 \) and

\[ \delta _l(\lambda ,w)=\frac{p(\lambda )-w}{p'(\zeta _l(w))(\lambda -\zeta _l(w))}. \]

When \(\varphi \) is holomorphic, \(f\) can also be computed from the Taylor coefficients of \(\varphi \) at the local centers \(\lambda _j\). In fact,

\[ f_j(w)= \sum _{n=0}^\infty \tfrac {1}{n!} f_j^{(n)}(0) w^n, \]

where \(f_j^{(n)}(0)\) can be computed recursively:

\begin{align} & (p’(\lambda _j))^nf_j^{(n)}(0)= \label{ihanoikein} \\ & = \varphi ^{(n)}(\lambda _j) - \sum _{k=1}^d \sum _{m=0}^{n-1} \tbinom nm \delta _k^{(n-m)}(\lambda _j) \sum _{l=0}^m b_{m l}(\lambda _j) f_k^{(l)} (0) - \sum _{l=0}^{n-1} b_{nl}(\lambda _j)f_j^{(l)}(0). \nonumber \end{align}

Here the polynomials \(b_{nm}\) are determined by

\[ b_{n+1,m}= b_{n,m-1} p' + b_{nm}' \]

with \(b_{n0}=0, b_{1,1}=p'\) and \(b_{nm}=0\) for \(m{\gt}n\), see Proposition 4.3 in [ 9 ] .

Remark

Erratum: the last term on the right of (B.3) is missing from the formula (4.2) of [ 9 ] . â–¡

Since the computations for \(f_j\)’s are done with power series, we can move to \(p(z)^{n}=w^{n},\) because the expansions are done for that variable. Therefore we formulate the next theorem.

Theorem B.1

Suppose \(p\) has simple zeros and assume \(\varphi \) is holomorphic in a neighborhood of \(V(p,\rho )=\{ z\in \mathbb C \ : \ |p(z)|\le \rho \} \) and given in the form

\[ \varphi (z) = \sum _{j=1}^d \delta _j(z) f_j(w), \ \ \text{ where } w=p(z). \]

Then

\[ \varphi (z) = \sum _{j=1}^d \delta _j(z) [f_{j,0}(w^n) + \cdots + w^{n-1} f_{j, n-1}(w^n)], \]

where \(f_{j,k}\) are holomorphic in a neighborhood of the disc \(|w| \le \rho \) and given by

\[ w^k f_{j,k}(w^n) = \tfrac {1}{n } \Big\{ f_j(w)+ e^{-2\pi ik/n}f_j(e^{2\pi i/n}w) +\dots +e^{-2\pi i(n-1)k/n} f_j(e^{2\pi i(n-1)/n} w)\Big\} . \]

In order to prove this, we consider a fixed function \(f_j\) and put \(g=f_j.\) Then we set

\[ w^k g_k(w^{n})= \tfrac {1}{n } \Big\{ g(w)+ e^{-2\pi ik/n}g(e^{2\pi i/n}w) +\dots +e^{-2\pi i(n-1)k/n} g(e^{2\pi i(n-1)/n} w)\Big\} \]

pointwise.

Proposition B.2

Given an arbitrary \(n\in \mathbb {N}\) and the functions \(g_i, \text{ } i=0,\dots ,n-1,\) for all \(w \in \mathbb {C}\) we have:

\[ g(w) = g_0(w^n) + w g_1(w^n) + \dots + w^{n-1} g_{n-1}(w^n). \]
Proof â–¼

Using the above formula of \(g_k(w^{n}),\) for \(k=0,1,\dots ,n-1,\) we have

\begin{align*} g_0(w^{n}) =& \tfrac {1}{n} \Big\{ g(w)+g(e^{2\pi i/n}w)+ g(e^{4\pi i/n}w)+\dots + g(e^{2\pi i (n-1)/n}w)\Big\} \nonumber \\ w g_1(w^{n}) =& \tfrac {1}{n} \Big\{ g(w)+e^{-2\pi i/n}g(e^{2\pi i/n}w)+e^{-4\pi i/n}g(e^{4\pi i/n}w) +\dots \nonumber \\ & +e^{-2\pi i (n-1)/n} g(e^{2\pi i (n-1)/n}w)\Big\} \nonumber \\ w^{2} g_2(w^{n}) =& \tfrac {1}{n} \Big\{ g(w)+e^{-4\pi i/n}g(e^{2\pi i/n}w)+ e^{-8\pi i/n}g(e^{4\pi i/n}w)+\dots \nonumber \\ & +e^{-4\pi i (n-1)/n} g(e^{2\pi i (n-1)/n}w)\Big\} \nonumber \\ & \dots \nonumber \\ w^{n-1} g_{n-1}(w^{n}) =& \tfrac {1}{n} \Big\{ g(w)+e^{-2\pi i(n-1)/n}g(e^{2\pi i/n}w)+e^{-4\pi i (n-1)/n} g(e^{4\pi i/n}w)\nonumber \\ & +\dots +e^{-2\pi i (n-1)^{2}/n} g(e^{2\pi i (n-1)/n}w)\Big\} \nonumber . \end{align*}

Summing up all the terms we get

\begin{eqnarray} & \text{}& \tfrac {1}{n} n g(w) + \tfrac {1}{n} g(e^{2\pi i/n}w) \sum _{k=0}^{n-1} e^{-2\pi ik/n} + \tfrac {1}{n} g(e^{4\pi i/n}w) \sum _{k=0}^{n-1} e^{-4\pi ik/n} + \dots \nonumber \\ & \text{}& +\tfrac {1}{n} g(e^{2\pi i(n-2)/n}w)\sum _{k=0}^{n-1} e^{-2\pi i(n-2)k/n}+ \tfrac {1}{n} g(e^{2\pi i(n-1)/n}w)\sum _{k=0}^{n-1} e^{-2\pi i(n-1)k/n} \nonumber \\ & \text{}& = g(w)\nonumber \end{eqnarray}

since all the other terms sum up to zero.

Proof of Theorem B.1 â–¼

It follows now immediately from Proposition B.2 together with the following proposition.

Proposition B.3

If \(f_j\) is given for \(|p(z)|\leq \rho ,\) then \(f_{j,k},\) \(k=1,\dots , n-1,\) are defined for \(|p(z)^{n}|\leq \rho ^{n}\) and

\begin{equation} \label{fj} f_j(p(z))= \sum _{k=0}^{n-1} p(z)^{k} f_{j,k}(p(z)^{n}), \quad \text{for } |p(z)|\leq \rho . \end{equation}
B.-4

Further, if \(f_j(p(z))\) is analytic for \(|p(z)|\leq \rho \) then so are \(f_{j,k}(p(z)^{n}).\)

Proof â–¼

First part is proved in Proposition B.2.

For the second part, we assume \(f_j\) analytic, thus it can be written as a power series

\begin{equation} f_j(p(z))= \sum _{m=0}^{\infty } \alpha _m p(z)^{m}. \label{fj_series} \end{equation}
B.-3

We know that pointwise we have

\begin{eqnarray} p(z)^k f_{j,k}(p(z)^{n})& =& \tfrac {1}{n } \left[ f_j(p(z))+ e^{-2\pi ik/n}f_j(e^{2\pi i/n}p(z)) +\dots \right. \nonumber \\ & \text{ }& \left. +e^{-2\pi i(n-1)k/n} f_j(e^{2\pi i(n-1)/n} p(z))\right]. \label{wkfjk} \end{eqnarray}

When we substitute (??) in (B.-2), we get

\begin{align*} & f_{j,0}(p(z)^{n}) = \\ & =\tfrac {1}{n} \left[ \alpha _0 + \alpha _1 p(z) +\alpha _2 p(z)^{2}+ \dots + \alpha _n p(z)^{n}+ \dots \right. \nonumber \\ & +\alpha _0 +e^{2\pi i/n} \alpha _1 p(z)+ e^{4\pi i/n} \alpha _2 p(z)^{2} + \dots + \alpha _n p(z)^{n}+\dots \nonumber \\ & + \dots \nonumber \\ & +\left. \alpha _0 + e^{2\pi i(n-1)/n} \alpha _1 p(z)+ e^{4\pi i(n-1)/n} \alpha _2 p(z)^{2} + \dots + \alpha _n p(z)^{n}+\dots \right]. \nonumber \end{align*}

Thus \( \displaystyle f_{j,0}(p(z)^{n})= \alpha _0 + \alpha _n p(z)^{n}+ \alpha _{2n} p(z)^{2n}+\dots , \) since all the other terms vanish.

We continue with \(p(z)f_{j,1}(p(z)^{n}),\) so we get

\begin{align*} p(z)f_{j,1}(p(z)^{n}) =& \tfrac {1}{n} \left[ \alpha _0 + \alpha _1 p(z) +\alpha _2 p(z)^{2}+ \dots + \alpha _n p(z)^{n}+ \dots \right. \nonumber \\ & + e^{-2\pi i/n}\alpha _0 + \alpha _1 p(z)+ e^{2\pi i/n} \alpha _2 p(z)^{2} \nonumber \\ & + \dots + e^{2\pi i(n-1)/n}\alpha _n p(z)^{n}+ \alpha _{n+1} p(z)^{n+1} + \dots \nonumber \\ & + \dots \nonumber \\ & + e^{-2\pi i(n-1)/n}\alpha _0 + \alpha _1 p(z)+ e^{2\pi i(n-1)/n} \alpha _2 p(z)^{2} \nonumber \\ & \left. + \dots +e^{2\pi i(n-1)^{2}/n} \alpha _n p(z)^{n}+\alpha _{n+1}p(z)^{n+1}+\dots \right]. \nonumber \end{align*}

Therefore \(\displaystyle f_{j,1}(p(z)^{n})= \alpha _1 p(z) + \alpha _{n+1} p(z)^{n+1}+ \alpha _{2n+1} p(z)^{2n+1}+\dots .\)

In a similar way it follows that

\begin{equation} p(z)^{k}f_{j,k}(p(z)^{n})= \alpha _k p(z)^{k} + \alpha _{n+k} p(z)^{n+k}+ \alpha _{2n+k} p(z)^{2n+k}+\dots \label{wkfjk_series} \end{equation}
B.-11

Because all the coefficients \(\alpha _{m_k},\) for \(m_k=k,n+k,2n+k,3n+k,\dots ,\) come from \(f_j\) which is analytic, we know that \( \displaystyle \limsup |\alpha _{m_k}|^{1/m_k}\leq \tfrac {1}{\rho },\) therefore we have that \(p(z)^{k}f_{j,k}(p(z)^{n}) \) is analytic, i.e. a converging power series.

Now, if we factor (??)

\[ p(z)^{k}f_{j,k}(p(z)^{n})= p(z)^{k} \left[ \alpha _k + \alpha _{n+k} p(z)^{n}+ \alpha _{2n+k} p(z)^{2n}+\dots \right] \]

we have that

\[ f_{j,k}(p(z)^{n})= \alpha _k + \alpha _{n+k} p(z)^{n}+ \alpha _{2n+k} p(z)^{2n}+\dots \]

is a converging power series, thus is analytic for \(|p(z)^{n}|\leq \rho ^{n}\).

Next we need to be able to bound \(\varphi \) in terms of \(f_{j,k}\)’s and vice versa. The first one is straightforward.

Proposition B.4

Denote \(L(\rho )= \sup _{|p(z)|\le \rho } \sum _{j=1}^d |\delta _j(z)|.\) Then

\[ \sup _{|p(z)|\leq \rho } |\varphi (z)| \leq L(\rho ) \max _{1\le j\le d} \sum _{k=0}^{n-1} \sup _{|w| \le \rho } |w^{k}f_{j,k}(w^{n})|. \]

The other direction is more involved and we formulate it in the following theorem.

Theorem B.5

Assume \(p\) is a monic polynomial of degree \(d\) with distinct roots and \(s(\rho )\) denotes the distance from the lemniscate \(|p(z)|=\rho \) to the nearest critical point of \(p,\) \(z_c,\) such that \(|p(z_c)|{\gt}\rho \) . Then there exists a constant \(C\), depending on \(p\) but not on \(\rho ,\) such that if \(\varphi \) is holomorphic inside and in a neighborhood of the lemniscate, then each \(f_{j,k}\) is holomorphic for \(|w| \le \rho \) and for \(|w| \le \rho \), \(1 \le j \le d\), \(0\le k \le n-1,\) we have

\begin{equation} \label{jfa_bound} \ |w^{k}f_{j,k} (w^n)| \ \le \big(1+ \tfrac {C}{s^{d-1}} \big) \sup _{|p(z)| \le \rho }|\varphi (z)|. \end{equation}
B.-10

For the proof of this statement we need some lemmas. The aim is to bound the representing functions \(f_{j,k}\) in terms of the original function \(\varphi \). To that end we first quote the basic result of bounding \(f_j\) in terms of \(\varphi \) and then proceed bounding \(f_{j,k}\) in terms of \(f_j\).

Lemma B.6

(Theorem 1.1 in [ 7 ] ) Suppose \(\varphi \) is holomorphic in a neighborhood of the set \(\{ \zeta : |p(\zeta )|\le \rho \} \) and let \(s\) be as in Theorem B.5. There exists a constant \(C\), depending on \(p\) but independent of \(\rho \) and \(\varphi \), such that

\[ \sup _{|w| \le \rho }|f_j(w)| \le \big(1+ \tfrac {C}{s^{d-1}} \big) \sup _{|p(z)| \le \rho }|\varphi (z)|, \qquad \text{for all } j=1,\dots ,d. \]

Lemma B.7

In the notation above we have

\[ \sup _{|w| \le \rho } |w^{k}g_k(w^n)| \le \sup _{|w|\le \rho } |g(w)|. \]
Proof â–¼

We have

\[ w^k g_k(w^{n})= \tfrac {1}{n } \{ g(w)+ e^{-2\pi ik/n}g(e^{2\pi i/n}w) +\dots +e^{-2\pi i(n-1)k/n} g(e^{2\pi i(n-1)/n} w)\} . \]

The bound follows by taking the absolute values termwise.

Proof of Theorem B.5 â–¼

The proof follows immediately from these two lemmas.

Remark B.8

Gauss-Lucas theorem asserts that given a polynomial \(p\) with complex coefficients, all zeros of \(p'\) belong to the convex hull of the set of zeros of \(p,\) see [ 3 ] . Thus, as soon as \(\rho \) is large enough, all the critical points will stay inside the lemniscate whenever the lemniscate is just a single Jordan curve. But when we want to make the separation, we start squeezing the level of the lemniscate and this results in leaving at least one critical point outside. Therefore the \(s\) we are measuring is the distance to the boundary from that particular critical point.

If the critical points of \(p\) are simple (as they generically are) then the dependency of the distance is inverse proportional and the coefficient in the theorem takes the form

\[ \big(1+ \tfrac {C}{s} \big). \]

To see how one can find these constants \(C\) and what they describe we will present an example with the computations of \(\delta _l(\lambda _j,w)\) for \(p(z)=z^{4}+1.\) These computations will be used in computing the constants. We also apply them for the Riesz projections.

Example B.9

Let \(p(z)=z^4+1\) with roots \(\lambda _1=(-1)^{1/4},\) \(\lambda _2=(-1)^{3/4},\) \(\lambda _3=-(-1)^{1/4}\) and \(\lambda _4=-(-1)^{3/4}.\) Let \(w=z^4+1.\) Using the formula for \(\delta _l(\lambda _j,w)\) we have:

\begin{eqnarray} \delta _1(\lambda _1,w)& =& 1+\tfrac {3}{8}w+\tfrac {19}{64}w^2+\tfrac {33}{128}w^3+\dots \nonumber \\ \delta _1(\lambda _2,w)& =& -\tfrac {1+i}{8}w-\tfrac {3+4 i}{32}w^2-\tfrac {20+31 i}{256}w^3+\dots \nonumber \\ \delta _1(\lambda _3,w)& =& -\tfrac {1}{8}w-\tfrac {7}{64}w^2-\tfrac {13}{128}w^3+\dots \nonumber \\ \delta _1(\lambda _4,w)& =& -\tfrac {1-i}{8}w-\tfrac {3-4 i}{32}w^2-\tfrac {20-31 i}{256}w^3+\dots \nonumber \\ \delta _2(\lambda _1,w)& =& -\tfrac {1}{8}w-\tfrac {7}{64}w^2-\tfrac {13}{128}w^3+\dots \nonumber \\ \delta _2(\lambda _2,w)& =& -\tfrac {1-i}{8}w-\tfrac {3-4 i}{32}w^2-\tfrac {20-31 i}{256}w^3+\dots \nonumber \\ \delta _2(\lambda _3,w)& =& 1+\tfrac {3}{8}w+\tfrac {19}{64}w^2+\tfrac {33}{128}w^3+\dots \nonumber \\ \delta _2(\lambda _4,w)& =& -\tfrac {1+i}{8}w-\tfrac {3+4 i}{32}w^2-\tfrac {20+31 i}{256}w^3+\dots \nonumber \\ \delta _3(\lambda _1,w)& =& -\tfrac {1-i}{8}w-\tfrac {3-4 i}{32}w^2-\tfrac {20-31 i}{256}w^3+\dots \nonumber \\ \delta _3(\lambda _2,w)& =& 1+\tfrac {3}{8}w+\tfrac {19}{64}w^2+\tfrac {33}{128}w^3+\dots \nonumber \\ \delta _3(\lambda _3,w)& =& -\tfrac {1+i}{8}w-\tfrac {3+4 i}{32}w^2-\tfrac {20+31 i}{256}w^3+\dots \nonumber \\ \delta _3(\lambda _4,w)& =& -\tfrac {1}{8}w-\tfrac {7}{64}w^2-\tfrac {13}{128}w^3+\dots \nonumber \\ \delta _4(\lambda _1,w)& =& -\tfrac {1+i}{8}w-\tfrac {3+4 i}{32}w^2-\tfrac {20+31 i}{256}w^3+\dots \nonumber \\ \delta _4(\lambda _2,w)& =& -\tfrac {1}{8}w-\tfrac {7}{64}w^2-\tfrac {13}{128}w^3+\dots \nonumber \\ \delta _4(\lambda _3,w)& =& -\tfrac {1-i}{8}w-\tfrac {3-4 i}{32}w^2-\tfrac {20-31 i}{256}w^3+\dots \nonumber \\ \delta _4(\lambda _4,w)& =& 1+\tfrac {3}{8}w+\tfrac {19}{64}w^2+\tfrac {33}{128}w^3+\dots \nonumber \end{eqnarray}

where \(\zeta _l(w),\) for \(l=\overline{1,4},\) are given by \(\zeta _1(w)=(w-1)^{1/4},\) \(\zeta _2(w)=-(w-1)^{1/4},\) \(\zeta _3(w)=i(w-1)^{1/4}\) and \(\zeta _4(w)=-i(w-1)^{1/4}.\) â–¡

Remark B.10

From Lemma B.6 we see that

\begin{equation} |\delta _l^{(m)}(\lambda _k,w)| {\sim } \frac{C_m}{s^{m+1}}.\label{constC} \end{equation}
B.-9

where \(m\) is the multiplicity of the nearest critical point of \(p\) outside the lemniscate. â–¡

To be able to compute the constants \(C\) from equation (B.-9) for polynomials of degree \(d\geq 4,\) we need to separate the spectrum by perturbing the roots with an angle \(\varepsilon \) small enough and by dropping the magnitude \(\rho \) below \(1.\) The perturbations for polynomials of degree \(4,6,8,10,12\) and \(14\) are described in the next section.

Note that before the perturbation we have multiple critical points, all inside the lemniscate, except for \(0,\) which is on the level curve. After we perturb the roots, one critical point is left outside the lemniscate, while all the others remain inside. Therefore, in our case, the multiplicity \(m\) is zero.

From now on we will choose a random value for \(\varepsilon \) to make some experimental computations. All the computations below will work properly for any other random value \(\varepsilon \) small enough that ensures the desired separation.

Now, if we choose a random perturbation with, for example, \(\varepsilon =\pi /70,\) we get the following values for the constant \(C\)

Degree

4

6

8

10

12

14

Constant C

576.4344

1.4665

8.0721

2.2754

12.8520

4.0475

Table B.1 The constant \(C\).

The computations for \(\sum |\delta _l(\lambda _k,w)|\) were made with Mathematica and the values for \(s,\) the smallest distance from the lemniscate to the nearest critical point, were computed with the help of Tiina Vesanen that provided a Matlab program. The codes can be found in the appendix of [ 1 ] , which is a preprint version on this article, that contains in appendix material which is not included in this paper. For all these computations one has to choose a value for the level \(\rho \), smaller than \(1\). If one chooses level \(\rho =1,\) then the value for \(s\) will be zero, since the lemniscate passes through origin. Therefore we have chosen the minimum value for the level \(\rho \) such that the lemniscate separates only in two parts when having a perturbation with \(\varepsilon =\pi /70.\) Hence we have registered the following data

Degree

4

6

8

10

12

14

\(\sum |\delta _l(\lambda _k,w)|\)

2293.81

6.5122

46.6599

16.3586

83.4547

16.7046

\(s\)

0.2513

0.2252

0.1730

0.1391

0.1540

0.2423

Table B.2 Experimental values which help in computing the constant \(C\) in Table B.1.

From the table of constants \(C\) one can see that the lemniscate bifurcates differently even with a small \(\varepsilon .\)

Remark B.11

For the quadratic polynomial \(p(z)=z^{2}-1,\) one does not need to perturb the roots, but just to decrease the magnitude of \(\rho \) below \(1.\) In this case, for example, if the level is \(\rho = 0.9\) then one gets \(C=0.2,\) while if the level is \(\rho = 0.99\) then \(C=0.6956 .\) â–¡

B.2 Application to Riesz projection

Let \(A\) be a bounded operator in a Hilbert space such that

\[ V_{p^n}(A)=\{ z \ : \ |p(z)^{n}| \le \| p(A)^n\| \} . \]

In order to compute the Riesz projection we take \(\varphi =1\) in one of the components and \(\varphi =0\) in the other components of \(V_{p^{n}}(A)\) and

\begin{equation} f_j(p(z))=\sum _{k=0}^{n-1} p(z)^{k} f_{j,k}(p(z)^{n}). \nonumber \end{equation}
B.-8

We shall apply the following theorem, if \(\varphi \) is holomorphic in a neighbourhood of the unit disk \(\mathbb {D}\) and \(A\in \mathcal{B}(H),\) then

\begin{equation} \| \varphi (A)\| \leq \sup _{\mathbb {D}} |\varphi |, \label{star} \end{equation}
B.-8

see e.g. [ 11 ] .

From Theorem B.5 we have for \(|p(z)|\leq \rho \) that

\begin{equation} |p(z)^{k}f_{j,k}(p(z)^{n})|\leq \Big(1+\tfrac {C}{s^{d-1}}\Big)\sup _{|p(z)|\leq \rho } |\varphi (z)|, \label{jfk_bound} \end{equation}
B.-7

and since \(f_{j,k}\) are analytic, we can apply (B.-8) to each of them, so with \(\rho =\| p(A)^{n}\| ^{1/n}\)

\begin{equation} \| f_{j,k}(p(A)^{n})\| \leq \sup _{|p(z)|\leq \rho } |f_{j,k}(p(z)^{n})|. \label{vonN} \end{equation}
B.-6

We have,

\[ \| p(A)^{k} f_{j,k} (p(A)^{n}) \| \leq \| p(A)^{k}\| \| f_{j,k} (p(A)^{n})\| \]

and from (B.-6) we get

\[ \| p(A)^{k} f_{j,k} (p(A)^{n}) \| \leq \| p(A)^{k}\| \sup _{|p(z)|\leq \rho } |f_{j,k}(p(z)^{n})| . \]

By the Maximum principle we see that

\[ \| p(A)^{k}\| \sup _{|p(z)|\leq \rho } |f_{j,k}(p(z)^{n})| = \| p(A)^{k}\| \rho ^{-k} \sup _{|p(z)|\leq \rho } | p(z)^{k} f_{j,k}(p(z)^{n})|. \]

Therefore, by (??), we have

\[ \| p(A)^{k} f_{j,k} (p(A)^{n}) \| \leq \| (p(A)/\rho )^{k} \| \left( 1+\tfrac {C}{s^{d-1}}\right) \sup _{|p(z)|\leq \rho } |\varphi (z)|. \]

Substituting now in the decomposition (B.1), \(\varphi (A)\) becomes the Riesz projection and it is bounded by

\begin{equation} \label{RieszBound} \| \varphi (A) \| \leq \left[ \left( 1+\tfrac {C}{s^{d-1}}\right) \tfrac {\| p(A)^{k} \| }{\rho ^{k}} \sum _{j=1}^{d} \| \delta _j(A)\| \right] \sup _{|p(z)|\leq \rho } |\varphi (z)|. \end{equation}
B.-5

Thus we have proven the following theorem.

Theorem B.12

Given a polynomial \(p\) of degree \(d,\) with distinct roots and a bounded operator \(A\) in a Hilbert space, we assume that the "expression"

\[ V_{p^{n}}(A)=\{ z \ : \ |p(z)^{n}| \le \| p(A)^n\| \} \]

has at least two components. Set \(\rho =\| p(A)^{n}\| ^{1/n}\) and let \(\varphi \) be a function such that \(\varphi =1\) in one component of \(V_{p^{n}}(A)\) and \(\varphi =0\) in the others. Let \(s\) be the distance from the nearest outside critical point to the boundary of \(V_{p^{n}}(A)\).

Then, considering \(f_{j,k}\) as given by Theorem B.1, one has

\[ \varphi (A) = \sum _{j=1}^{d}\delta _j(A)\sum _{k=0}^{n-1}p(A)^{k}f_{j,k}(p(A)^{n}), \]

which is the Riesz spectral projection onto the invariant subspace corresponding to the spectrum inside the component where \(\varphi =1.\)

The bound for the norm of \(\varphi (A)\) is given by (B.-5).

Remark B.13

In applications we will consider \(\varphi =1\) in the components of \(V_{p^{n}}(A)\) that are on the right complex half-plane and \(\varphi =-1\) in the components on the left complex half-plane. This way the computations are more symmetrical. In this situation there is only one critical point at the origin, which is simple, so we will have \(\displaystyle \left(1+\tfrac {C}{s}\right)\) in the formula for the bound of the Riesz projection. â–¡

C Separating polynomials

C.1 Separation tasks

In this section we shall discuss separating issues by lemniscates. To that end, given a polynomial \(p\) denote by \(V(p, \rho )\) the set

\[ V(p, \rho )= \{ z \in \mathbb C \ : |p(z)| \le \rho \} . \]

A key result in this context is the following. Let \(K\subset \mathbb C\) be compact and such that \(\mathbb C \setminus K\) is connected. For \(\delta {\gt}0\) denote further

\[ K(\delta ) = \{ z \ : \ \text{dist}( z, K) {\lt} \delta \} . \]

Then there exists a polynomial \(p\) and \(\rho {\gt}0\) such that

\[ K \subset V(p,\rho ) \subset K(\delta ). \]

In particular, if \(K=K_1 \cup K_2\) and \(K_1 \cap K_2= \emptyset \) since \(K\) is compact, then for small enough \(\delta \)

\[ K_1(\delta ) \cap K_2(\delta )=\emptyset \]

as well. Thus, \(V(p,\rho )\) separates the components \(K_1\) and \(K_2\) respectively. Suppose that we have two analytic functions \(\varphi _j\), each analytic in \(K_j(\delta )\). We can view them as just one analytic function

\[ \varphi : K(\delta ) \rightarrow \mathbb C , \]

where \(\varphi \) agrees to \(\varphi _j\) on \(K_j(\delta )\). We are interested in particular in the case where \(\varphi _j\) is constant. Multicentric representation then gives a power series which is simultaneously valid in both components.

So, we can ask, for such a separation task what is the smallest degree of a polynomial achieving this.

We model this as follows: Let

\[ K_1(\delta ) = \{ z=1 +iy \ : |y| \le \tan (\delta ) \} \]

and \(K_2(\delta )\) symmetrically on the other side of the imaginary axis:

\[ K_2(\delta ) = \{ z=-1 +iy \ : |y| \le \tan (\delta ) \} \]

Our first problem concerns the minimal degree of a polynomial \(p\) such that

\[ K_j(\delta ) \subset V(p, \rho ) \]

and

\[ V(p,\rho ) \cap i\mathbb R = \emptyset . \]

This is discussed in the next subsection.

Another natural task is related to existence of a logarithm. So, \(C\) is again compact and we assume \(0 \notin C\). Now there exists a single valued logarithm in \(C\) if and only if \(0\) is in the unbounded component of the complement of \(C\). That is, the set \(C\) does not separate origin from infinity. The natural task here is to find a polynomial \(p\) such that

\[ C \subset V(p,\rho ) \]

and

\[ 0\notin V(p,\rho ). \]

As \(V(p,\rho )\) is polynomially convex, this suffices for representing the logarithm in \(V(p,\rho )\).

C.2 Model problem

Let \(p(z)=z^d-1\) be a monic polynomial of complex variable \(z,\) with \(|p(z)|=1.\) The polynomial \(p\) can be written as

\[ p(z)=\prod _{j=1}^{d}(z-e^{i \theta _j}) \]

where \(e^{i\theta _j}\) are the roots of \(p\) and \(\theta _j\) are the angles of the roots. For \(d=4m,\) \(m\geq 1\) we have

\[ p(z)=\prod _{j=1}^{m} (z-e^{i \theta _j})\prod _{j=1}^{m} (z-e^{-i \theta _j})\prod _{j=1}^{m} (z+e^{i \theta _j})\prod _{j=1}^{m} (z+e^{-i \theta _j}). \]

We are interested to separate the spectrum of \(p,\) as discussed earlier in this paper. In this sense, we first check if there are roots laying on the imaginary axis. If so, a rotation with angle \(\pi /d\) is applied so that no roots touch \(i\mathbb {R}.\)

Next we perturb the roots as follows: the four roots that are closest to the imaginary axis (complex roots) are moved along the unit circle towards the real axis with small angle \(\varepsilon .\) Then the level \(\rho \) is slightly decreased with \(\eta .\) This approach is used for polynomials of degree \(d\geq 4.\) The quadratic polynomial case is shortly discussed below and cubic polynomial case is presented as a first example.

To this end we have to find the maximum \(\eta \) for a chosen \(\varepsilon \) so that the spectrum gets separated in only two parts, one on the right hand side of the imaginary axis and one on the left hand side. Also we can find the maximum angle \(\alpha \) (see Figure 1) such that the spectrum will lay inside our lemniscate.

\includegraphics{lemni3level1-004-general.png}
Figure 1 Level 0.996.

Now we analyze some cases. The quadratic polynomial, \(p(z)=z^{2}-1\) is the classical lemniscate and for this one just has to decrease the magnitude \(\rho \) to below 1. No perturbation of the roots is needed. For a decrease with \(\eta =0.01 \) of the level we get the picture from Figure 2:

\includegraphics{lemni2level099-new}
Figure 2 Level 0.99.

Example C.1

Let \(p(z)=z^{3}-1\) with roots \(e^{2\pi i/3},\) \(e^{-2\pi i/3}\) and \(1.\) We apply a perturbation with \(\varepsilon \) to the complex roots and we get

\[ p_\varepsilon (z)=(z-e^{i(2\pi /3+\varepsilon )})(z-e^{-i(2\pi /3+\varepsilon }))(z-1) . \]

Thus the lemniscate is \(|p_\varepsilon (z)|=1-\eta .\) For \(\varepsilon =\pi /70\) the resulted picture is shown in Figure ??.â–¡

\includegraphics{lemni3level1-new}

Figure 3 Level 1.

 

\includegraphics{lemni3level1-004-new}

Figure 4 Level 0.996.

Example C.2

Let \(p(z)=z^{4}-1.\) Since there are roots on the imaginary axis, we have to apply a rotation with \(\pi /4.\) Thus our polynomial becomes \(p(z)=z^{4}+1.\)

Then

\[ p(z)=(z-e^{i \theta })(z-e^{-i\theta })(z+e^{i\theta })(z+e^{-i\theta }) \]

where \(\theta = \pi /4,\) so we have

\begin{equation} p(z)= (z^{2}-e^{2i\theta })(z^{2}-e^{-2i\theta })=z^{4}-2\cos (2\theta )z^{2}+1. \label{eq3.2} \end{equation}
C.1

Next, we decrease the angle \(\theta \) with \(\varepsilon \) small enough and denote the new angle \(\theta _\varepsilon =\theta -\varepsilon .\) For the polynomial with perturbed \(\theta \) we compute the lemniscate.

We have that

\begin{eqnarray} e^{2i\theta _\varepsilon } & =& e^{2i\theta -2i\varepsilon }=e^{i\pi /2-2i\varepsilon } \nonumber \\ & =& i(1-2i\varepsilon -2\varepsilon ^{2}+\dots )=2\varepsilon +i(1-2\varepsilon ^{2}+\dots ). \nonumber \end{eqnarray}

For \(z=t\in \mathbb {R}\)

\begin{eqnarray} |p_{\theta _\varepsilon }(t)| & =& |t^{2}-2\varepsilon +i(-1+2\varepsilon ^{2}+\dots )|^{2} \nonumber \\ & =& t^{4}-4t^{2}\varepsilon +1+ o(\varepsilon ^{2}). \label{eq3.3} \end{eqnarray}

For \(z=it\in \mathbb {C}\)

\begin{eqnarray} |p_{\theta _\varepsilon }(it)| & =& |(it)^{2}-2\varepsilon +i(-1+2\varepsilon ^{2}+\dots )|^{2} \nonumber \\ & =& t^{4}-4t^{2}\varepsilon +1+ o(\varepsilon ^{2}). \nonumber \end{eqnarray}

We write \(t=z=x+iy\) in (C.2) and compute the lemniscate \(l: |p_{\varepsilon }(z)|=1-\eta ,\) where \(p_{\varepsilon }(z)=z^{4}-2\cos (2\theta _\varepsilon )z^{2}+1\) from (C.1). Thus

\begin{align*} l : & \text{ } (x^{2}+y^{2})^{4}+2(x^{2}+y^{2})^{2}-16x^{2}y^{2}+16\varepsilon ^{2}(x^{2}+y^{2})^{2} \nonumber \\ & -8\varepsilon (x^{6}+x^{4}y^{2}-x^{2}y^{4}-y^{6}+x^{2}-y^{2})=0. \label{eq3.4} \end{align*}

For \(\varepsilon =\pi /70\) the results are shown in Figure ??. â–¡

\includegraphics{lemni4level1-new}

Figure 5 Level 1.

 

\includegraphics{lemni4level1-003-new}

Figure 6 Level 0.997.

Similar computations were made for polynomials of degree \(6,8,10,12\) and \(14,\) and these can be seen in [ 1 ] .

Remark C.3

The goal was to find the maximum angle \(\alpha \) such that the spectrum lays inside the lemniscate. For this, one can compute the ratio \(a/b,\) where \(a\) and \(b\) are the length of the lines \(a\) and \(b\) from Figure 7 below, and hence the angle \(\alpha \) is

\begin{equation} \alpha =\arctan (a/b). \nonumber \end{equation}
C.2
\includegraphics{Ratio_picture.png}
Figure 7 Ratio and angle \(\alpha \).

Note that, for a minimum level one might have that the line \(a\) cuts the lemniscate, situation that happens even for a slightly perturbation of the level in all the cases with polynomials of degree \(d\geq 6.\) Therefore, one has to consider a smaller angle.


For a minimum level \(\rho \) of the lemniscate and a perturbation with \(\varepsilon =\pi /70\) we have the values for the ratio from Table C.1 or from the chart in Fig. 8.

 

Degree 4

Degree 6

Degree 8

Degree 10

Degree 12

Degree 14

a

0.5637

0.9040

0.9905

1.0043

0.9973

0.9846

b

0.3090

0.3767

0.3790

0.3624

0.3402

0.3176

a/b

1.8242

2.3997

2.6134

2.7712

2.9315

3.1001

Table C.1 Ratio.
\includegraphics{Ratio_chart_MinLength}
Figure 8 Ratio chart.

The maximum angle \(\alpha \) that we have found is presented in the chart from Figure 9. â–¡

\includegraphics{MaxAngle_MinLength}
Figure 9 Maximum angle \(\alpha \).

Remark C.4

The quadratic polynomial is a special case since the only perturbation applied is decreasing the level. In this case, the maximum angle is when the level \(\rho \) is unchanged, in this case \(\alpha =45^{\circ }\) and the minimum angle goes to \(0^{\circ }\) when the level is significantly decreased with \(\eta =0,99.\) For a slightly change with \(\eta =0.01\) we have found an angle of \(29.92^{\circ }\) and for a change with \(\eta =0.1\) we have registered an angle of \(26.57^{\circ }.\) â–¡

Remark C.5

For polynomials of degree \(d\geq 4\), it is easy to check the maximum value of \(\eta ,\) i.e. the minimum value that the level can have, such that we get the desired separation. For example, if again the perturbation is \(\varepsilon =\pi /70\) we get the values from Table C.2.

Degree

4

6

8

10

12

14

\(\eta \)

0.008

0.0078

0.0038

0.0021

0.0022

0.0045

Table C.2 Maximum change \(\eta \) of the level \(\rho \).

With these values the lemniscate squeezes next to the closest critical point to the perturbed root. A bigger decrease of the level would force that critical point to get out from the interior of the lemniscate and thus one doesn’t get the desired separation. These estimates may not be the best but they are what we have reached by manipulating the pictures and the resulting pictures can be seen in [ 1 ] . â–¡

D Concluding remarks

In Section \(2\) we presented the closed formula and the bound for the Riesz projection, in Section \(3\) we described the separation process and in this section one can see how the expansions on the Riesz projection look like when taking \(\varphi =1\) for the components on the right side of the imaginary axis and \(\varphi =-1\) for the ones on the left side. We will finish this paper with an example that shows what the effects are on the Riesz projection.

In the appendices of [ 1 ] one can find applications that explicitly compute the series expansions for \(f_j\)’s in the decomposition

\[ \varphi (z)= \sum _{j=1}^d \delta _j(z) f_j(w), \quad w=p(z), \]

when \(\varphi \) is identically \(1\) on the right half plane and \(-1\) on the left half plane, for the quadratic, the quartic, the perturbed quartic polynomial and the polynomial \(q(z)=p(z)^{n}=(z^{2}-1)^{n}\), respectively.

Remark D.1

In using multicentric calculus a central problem is to find a polynomial \(p\) such that \(p(A)\) has small norm and, when aiming for Riesz projection, that the lemniscate on the level of \(\| p(A)\| \) separates the spectrum into different components. This can be done, for example, by minimizing \(\| p(A) \| \) approximately over a set of polynomials, or, by using a suitable \(p\) which has been computed for a neighbouring matrix.

Alternatively, and that is the main topic here, one search for polynomials \(p\) such that it is small in a neighbourhood of the spectrum of \(A.\) And then computes high enough power \(p(A)^{2^{m}}\) such that \(\| p(A)^{2^{m}} \| ^{1/2^{m}} \sim \rho (p(A)).\) â–¡

In the following example we point out with the help of a low-dimensional problem, how the size of coupling can affect on the need of taking a high power of \(p(A).\)

Example D.2

Let

\begin{equation} A=\begin{pmatrix} B & X \\ 0 & -B \end{pmatrix} \end{equation}
D.1

be a \(4 \times 4\) matrix where

\begin{equation} B=\begin{pmatrix} \alpha & 1 \\ 0 & \alpha \end{pmatrix} \end{equation}
D.2

and

\begin{equation} X=\begin{pmatrix} 0 & \gamma \\ \gamma & 0 \end{pmatrix}. \end{equation}
D.3

In this example we could take \(p(z)=z^2-\alpha ^2\) to actually get a closed form for the projection. However, we take \(p(z)=z^2-1\) as our polynomial and then the effect of \(\alpha {\gt}0\) being close or further away from 1 models the lack of exact knowledge on the spectrum. We are interested in having \(\| p(A)^n\| {\lt}1\) and ask how the parameters \(\alpha \) and \(\gamma \) contribute to the value of \(n\) needed. Qualitatively it is clear that such an \(n\) exists if and only if \( \alpha {\lt} \sqrt2\), independently of \(\gamma \).

Substituting \(A\) into \(p\) we have

\[ p(A)= \begin{pmatrix} C & Y \\ 0 & C \end{pmatrix}, \]

where

\[ C=\begin{pmatrix} \alpha ^2-1 & 2\alpha \\ 0 & \alpha ^2 -1 \end{pmatrix} \]

and

\[ Y=\begin{pmatrix} \gamma & 0 \\ 0 & - \gamma \end{pmatrix}. \]

A short computation shows that

\[ p(A)^n = \begin{pmatrix} C^n & n(\alpha ^2-1)^{n-1} Y \\ 0 & C^n \end{pmatrix}. \]

Thus, we have

\[ \| p(A)^n\| \sim |\alpha ^2-1|^{n-1} \Big[ |\alpha ^2-1| + n(|\alpha | + |\gamma |) \Big], \]

so that if \(| \alpha ^2 - 1| \ll 1\) then a small \(n\) shall work. If however, \(|\alpha ^2-1|= 1- \varepsilon \) with \(0{\lt}\varepsilon \ll 1\), modelling the case when e.g. spectrum of \(A\) is scattered inside the lemniscate, then the behavior is of the nature

\[ \| p(A)^n\| \sim (1-\varepsilon )^n(n+1), \]

which becomes below \(1\) only for \(n \gg 1/\varepsilon \). â–¡