Local convergence of a two-step Gauss-Newton Werner-type method for solving least squares problems

Ioannis K. Argyros¹, Santhosh George²

(Date: October 19, 2018; accepted: July 09, 2024; published online: July 11, 2024.)

Abstract.

The aim of this paper is to extend the applicability of a two-step Gauss-Newton-Werner-type method (TGNWTM) for solving nonlinear least squares problems. The radius of convergence, error bounds and the information on the location of the solution are improved under the same information as in earlier studies. Numerical examples further validate the theoretical results.

Key words and phrases:

Gauss-Newton method, Werner’s method, local convergence, least squares problem, average Lipschitz condition.

2005 Mathematics Subject Classification:

65G99, 65J15, 65H10, 49M15, 47H17. 65N35, 47H17, 49M15.

¹Department of Mathematical Sciences, Cameron University, Lawton, OK 73505, USA, e-mail: iargyros@cameron.edu.

²Department of Mathematical and Computational Sciences, National Institute of Technology Karnataka, India-575 025, e-mail: sgeorge@nitk.edu.in.

1. Introduction

Let $i, j$ be natural numbers with $j\geq i.$ Let also $\Omega$ be an open and convex subset of $\mathbb{R}^{j}.$ We are concerned with the solution $p$ of the least squares problem [4, 5, 6, 7, 8, 9]:

(1.1)

\min_{x\in\Omega}f(x):=\frac{1}{2}F(x)^{T}F(x),

where $F:\Omega\longrightarrow\mathbb{R}^{j}$ is a Fréchet-differentiable mapping. Numerous problems can be brought in the form (1.1) using Mathematical Modeling [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13]. The closed form solutions can only be found in special cases. That explains why most solution methods for problem (1.1) are iterative. Let $x_{0},y_{0}\in\Omega$ and set $z=\frac{x_{0}+y_{0}}{2}.$ In the present study, we provide the local convergence analysis of GNWTM defined for each $n=0,1,2,\ldots$ by

	$\displaystyle x_{n+1}$	$\displaystyle=$	$\displaystyle x_{n}-A_{n}F(x_{n})$
(1.2)		$\displaystyle y_{n+1}$	$\displaystyle=$	$\displaystyle x_{n+1}-A_{n}F(x_{n+1})$
	$\displaystyle z_{n}$	$\displaystyle=$	$\displaystyle\frac{x_{n}+y_{n}}{2},$

where $A_{n}=[F^{\prime}(z_{n})^{T}F^{\prime}(z_{n})]^{-1}F^{\prime}(z_{n})^{T}.$ If $i=j,$ TGNWTM reduces to a Gauss-Newton-Werner type method [3, 8, 9]. Notice that in each iteration the inversion of $[F^{\prime}(z_{n})^{T}F^{\prime}(z_{n})]^{-1}$ is required only once. Therefore, the computational cost is essentially the same as in the Gauss- Newton method. The $LL^{T}$ decomposition of $[F^{\prime}(z_{n})^{T}F^{\prime}(z_{n})]^{-1}$ costs $O(n^{3})$ floating-point operations (Flops) leading to the computation of $x_{n+1}.$ It then follows from the second substep of method (1) that $O(n^{2})$ Flops are needed for the computation of $y_{n+1}.$

The local convergence analysis of method (1) was given in the elegant paper by Shakhno et.al. in [9] (see also related work in [3, 8]). Their convergence analysis uses average Lipschitz continuity condition as well as Lipschitz conditions.

Using the concept of the average Lipschitz continuity [12] and our new idea of restricted convergence domains, we present a local convergence analysis with the following advantages (A) over works using the similar information [3, 4, 8, 9, 11, 12, 13]:

(a)

Larger radius of convergence;
(b)

Tighter error bounds on the distances $\|x_{n}-p\|;$
(c)

An at least as precise information on the location of the solution $p .$

Achieving (a)-(c) is very important in computational sciences, since: (a)^′ We obtain a wider choice of initial guesses; (b)^′ Fewer iterates are required to obtain a desired error tolerance; (c)^′ Better information about the ball of convergence is obtained.

The rest of the paper is structured as follows: Section 2 contains the local convergence analysis of method (1) whereas special cases and the applications are presented in the concluding Section 3.

2. Local convergence analysis

Set $U(w,\rho)=\{v\in\mathbb{R}^{j}:\|v-w\|<\rho\}$ to be the open ball in $\mathbb{R}^{j}$ and by $\bar{U}(w,\rho)$ to denote its closure. Let $R>0.$ Define parameter $R_{1}$ by $R_{1}:=\sup\{t\in[0,R]:\bar{U}(p,t)\subset\Omega\}.$ The convergence analysis of numerous iterative methods has been given using the following concept due to Wang [12]:

Definition 1.

Mapping $F:\bar{U}(p,R_{1})\longrightarrow\mathbb{R}^{i}$ satisfies the Lipschitz condition with $L_{1}$ average on $U(p,R_{1})$ if

\|F^{\prime}(x)-F^{\prime}(y)\|\leq\int_{0}^{\|x-y\|}L_{1}(u)du\,\,\text{{for %each}}\,\,\,x,y\in\bar{U}(p,R_{1}),

where $L_{1}$ is a positive non-decreasing function.

It turns out that the convergence analysis of iterative methods based on the preceding notion can be improved as follows:

Definition 2.

The mapping $F:\bar{U}(p,R_{1})\longrightarrow\mathbb{R}^{i}$ satisfies the center-Lipschitz condition with $L_{0}$ average on $U(p,R_{1})$ if

\|F^{\prime}(x)-F^{\prime}(p)\|\leq\int_{0}^{\|x-p\|}L_{0}(u)du\,\text{%\thinspace{for each}}\,\,\,x\in\bar{U}(p,R_{1}),

where $L_{0}$ is a positive non-decreasing function.

Clearly, we have that

(2.1)

L_{0}(u)\leq L_{1}(u)\,\,\text{{for each}}\,\,u\in[0,R_{1}],

and $\frac{L_{0}}{L_{1}}$ can be arbitrary small [2, 3, 4]. Let $\beta>0$ be a parameter. Suppose that equation

(2.2)

\beta\int_{0}^{t}L_{0}(u)du=1

has positive solutions. Denote by $R_{0}$ the smallest such solution. Notice for example that $R_{0}$ exists, if

(2.3)

\beta L_{0}(R_{1})R_{1}\geq 1.

Indeed, function $g(t):=\beta\int_{0}^{t}L(u)du-1$ is such that $g(0)=-1<0$ and $g(R_{1})=\beta L(R_{1})R_{1}-1>0.$ The existence of $R_{0}$ follows from the intermediate value theorem.

Definition 3.

The mapping $F:\bar{U}(p,R_{1})\longrightarrow\mathbb{R}^{i}$ satisfies the restricted Lipschitz condition with $L$ average on $U(p,R_{0})$ if

\|F^{\prime}(x)-F^{\prime}(y)\|\leq\int_{0}^{\|x-y\|}L(u)du\,\,\text{{for each%}}\,\,\,x,y\in\bar{U}(p,R_{0}),

where $L$ is a positive non-decreasing function.

We have that

(2.4)

L(u)\leq L_{1}(u)\,\,\text{{for each}}\,\,\,u\in[0,R_{0}],

since $R_{0}\leq R_{1}.$ Throughout this paper, we suppose that

(2.5)

L_{0}(u)\leq L(u)\,\,\text{{for each}}\,\,\,u\in[0,R_{0}],

unless, otherwise stated. Otherwise, i.e., if

(2.6)

L(u)\leq L_{0}(u)\,\,\text{{for each}}\,\,\,u\in[0,R_{0}],

then the results that follow hold with $L_{0}$ replacing $L .$ Moreover, we need the definitions:

Definition 4 ([12]).

Let $F:\bar{U}(p,R_{1})\longrightarrow\mathbb{R}^{i}$ be a twice Fréchet-differentiable mapping. We say that mapping $F$ satisfies the Lipschitz condition with $M_{1}$ average on $U(p,R_{1})$ if

\|F^{\prime\prime}(x)-F^{\prime\prime}(y)\|\leq\int_{0}^{\|x-y\|}M_{1}(u)du\,%\,\text{{for each}}\,\,\,x,y\in\bar{U}(p,R_{1}),

where $M_{1}$ is a positive nondecreasing function.

Definition 5.

Let $F:\bar{U}(p,R_{0})\longrightarrow\mathbb{R}^{i}$ be twice Fréchet-differentiable mapping. We say that mapping $F$ satisfies the restricted Lipschitz condition with $M$ average on $U(p,R_{0})$ if

\|F^{\prime\prime}(x)-F^{\prime\prime}(y)\|\leq\int_{0}^{\|x-y\|}M(u)du\,\,%\text{{for each}}\,\,\,x,y\in\bar{U}(p,R_{0}),

where $M$ is a positive nondecreasing function.

We have that

(2.7)

M(u)\leq M_{1}(u)\,\,\text{{for each}}\,\,\,u\in[0,R_{0}].

It is worth noticing that the definition of functions $L$ and $M$ (based on $L_{0}$ and $R_{0}$ ) was not possible in earlier studies using $L_{1}$ and $M_{1}.$ That is, $L=L(L_{0},R_{0},R_{1}),M=M(L_{0},R_{0},R_{1}),$ whereas $L_{1}=L_{1}(R_{1})$ and $M_{1}=M_{1}(R_{1}).$ It turns out that $L_{0}$ can replace the less precise $L$ in the computation of the upper bounds on the inverses of the operators involved and $\bar{U}(p,R_{0}),L,M$ can replace $\bar{U}(p,R_{1}),L_{1},M_{1},$ respectively in the proofs of such results. Moreover, notice that the iterates $x_{n}$ lie in $\bar{U}(p,R_{0})$ which is a more precise location than $\bar{U}(p,R_{1})$ used in earlier studies [2, 3, 4, 8, 11, 12, 13]. We shall make the paper as self contained as possible by stating some standard auxiliary concepts and results.

Denote by $\mathbb{R}^{i\times j}$ the set of all $i\times j$ matrices. The Moore-Penrose pseudo-inverse is defined by $A^{\dagger}=(A^{T}A)^{-1}A^{T}$ for each full rank $A\in\mathbb{R}^{i\times j}$ [6].

Lemma 2.1 ([2, 6]).

Let $A,A_{1}\in\mathbb{R}^{m\times n}.$ Assume that $A_{2}=A+A_{1},\|A^{\dagger}\|\|A_{1}\|<1,$ and $rank(A)=rank(A_{2}).$ Then,

\|A_{2}^{\dagger}\|\leq\frac{\|A^{\dagger}\|}{1-\|A^{\dagger}\|\|A_{1}\|}.

If $rank(A)=rank(A_{2})=\min\{m,n\},$ the following holds

\|A_{2}-A^{\dagger}\|\leq\frac{\sqrt{2}\|A^{\dagger}\|^{2}\|A_{1}\|}{1-\|A^{%\dagger}\|\|A_{1}\|}.

Lemma 2.2 ([5]).

Let $A,A_{1}\in\mathbb{R}^{m\times n}.$ Assume that $A_{2}=A+A_{1},\|A_{1}A^{\dagger}\|<1,$ and $rank(A)=n,$ then $rank(A_{2})=n.$

Lemma 2.3 ([12]).

Let $\varphi(t)=\frac{1}{t}\int_{0}^{t}P(u)du,0\leq t\leq\rho,$ where $P(u)$ is a positive integrable function and monotonically non-decreasing on $[0,\rho].$ Then, $\varphi(t)$ is monotonically non-decreasing with respect to $t .$

Lemma 2.4 ([12]).

Let $\psi(t)=\frac{1}{t^{3}}\int_{0}^{t}Q(u)du,0\leq t\leq\rho,$ where $Q(u)$ is a positive integrable function and monotonically non-decreasing on $[0,\rho].$ Then, $\psi(t)$ is monotonically non-decreasing with respect to $t .$

As in [9], it is convenient for the local convergence analysis that follows to introduce some functions and parameters:

$\displaystyle\alpha$	$\displaystyle=$	$\displaystyle\\|F(p)\\|,\,\beta=\\|(F^{\prime T}F^{\prime-1}F^{\prime T}\\|,$
$\displaystyle d(x)$	$\displaystyle=$	$\displaystyle\\|x-p\\|,\,s_{0}=\max\{d(x_{0}),d(y_{0})\},$
$\displaystyle\mu(t)$	$\displaystyle=$	$\displaystyle\mu(L_{0},L,M)(t)$
	$\displaystyle=$	$\displaystyle\frac{\beta}{8}\int_{0}^{t}M(u)(t-u)^{2}du$
		$\displaystyle+\beta t\left(\int_{0}^{\tfrac{3}{2}t}L(u)du+\int_{0}^{t}L_{0}(u)%du\right)+\sqrt{2}\alpha\beta^{2}\int_{0}^{t}L_{0}(u)du-t,$
$\displaystyle\gamma$	$\displaystyle=$	$\displaystyle\gamma(L_{0},L,M)$
	$\displaystyle=$	$\displaystyle\frac{\beta\int_{0}^{d(x_{0})}M(u)(d(x_{0})-u)^{2}du}{8d(x_{0})%\left(1-\beta\int_{0}^{d(x_{0})}L_{0}(u)du\right)}$
		$\displaystyle+\frac{\beta d(x_{0})\int_{0}^{\tfrac{d(x_{0})+d(y_{0})}{2}}L(u)%du}{\frac{2d(x_{0})+d(y_{0})}{3}\left(1-\beta\int_{0}^{d(z_{0})}L_{0}(u)du%\right)}$
		$\displaystyle+\frac{\sqrt{2}\alpha\beta^{2}\int_{0}^{d(z_{0})}L_{0}(u)du}{d(z_%{0})(1-\beta\int_{0}^{d(z_{0})}L_{0}(u)du)}<1,$
$\displaystyle\delta$	$\displaystyle=$	$\displaystyle\frac{\beta\int_{0}^{d(x_{0})}M(u)(d(x_{0})-u)^{2}du}{8d(x_{0})^{%3}\left(1-\beta\int_{0}^{d(z_{0})}L_{0}(u)du\right)},$
$\displaystyle\tau$	$\displaystyle=$	$\displaystyle\frac{\sqrt{2}\alpha\beta^{2}\int_{0}^{d(z_{0})}L_{0}(u)du}{d(z_{%0})\left(1-\beta\int_{0}^{d(z_{0})}L_{0}(u)du\right)},$
$\displaystyle\lambda$	$\displaystyle=$	$\displaystyle\frac{\beta\int_{0}^{\tfrac{d(x_{0})+d(y_{0})}{2}}L(u)du}{\tfrac{%2d(x_{0})+d(y_{0})}{3}\left(1-\beta\int_{0}^{d(z_{0})}L_{0}(u)du\right)},$
$\displaystyle e_{n+1}^{1}$	$\displaystyle=$	$\displaystyle\delta d(x_{n})^{3}+\lambda d(x_{n})d(y_{n})+\tau d(z_{n}),$
$\displaystyle e_{n+1}^{2}$	$\displaystyle=$	$\displaystyle\delta d(x_{n+1})^{3}+\tfrac{\lambda}{3}(d(x_{n})+d(y_{n})+d(x_{n%+1}))d(x_{n+1})+\tau d(z_{n}),$

and

s_{n+1}=\max\{d(x_{n+1}),d(y_{n+1})\}.

Notice that if $L_{0}=L=L_{1}$ and $M=M_{1},$ then the preceding definitions reduce to the corresponding ones in [9].

The local convergence analysis is based on the conditions ( $\mathcal{C}$ ):

( $\mathcal{C}_{1}$ )

Mapping $F:\bar{U}(p,R_{1})\longrightarrow\mathbb{R}^{i}$ is twice Fréchet-differentiable, $F^{\prime}(p)$ has full rank and $p$ solves problem (1.1).
( $\mathcal{C}_{2}$ )

$F^{\prime}(x)$ satisfies: the center-Lipschitz condition with $L_{0}$ average on $\bar{U}(p,R_{1})$ and the restricted Lipschitz condition with $L$ average on $\bar{U}(p,R_{0});$ $F^{\prime\prime}(x)$ satisfies the restricted Lipschitz condition with $M$ average on $\bar{U}(p,R_{0}),$ where $L_{0},L$ and $M$ are positive non-decreasing functions on $[0,\frac{3R_{0}}{2}].$
( $\mathcal{C}_{3}$ )

Function $\mu$ has a minimal zero $R^{*}$ in $[0,R_{0}],$ which also satisfies

$\beta\int_{0}^{R^{*}}L_{0}(u)du<1.$

Then, we can show the following local convergence result for TGNWTM under the conditions ( $\mathcal{C}$ ) and the preceding notation.

Theorem 2.5.

Suppose that conditions ( $\mathcal{C}$ ) hold. Then, sequences $\{x_{n}\},$ $\{y_{n}\},\{z_{n}\}$ generated for $x_{0},y_{0}\in\bar{U}(p,R^{*})-\{p\}$ by TGNWTM are well defined in $\bar{U}(p,R^{*}),$ remain in $\bar{U}(p,R^{*})$ for each $n=0,1,2,\ldots$ and converge to $p .$ Moreover, the following estimates hold

(2.8)

d(x_{n+1})\leq e_{n+1}^{1},

(2.9)

d(y_{n+1})\leq e_{n+1}^{2},

and

(2.10)

s_{n+1}\leq\gamma s_{k}\leq\ldots\leq\gamma^{n+1}s_{0}.

Proof. The proof follows the corresponding one in [9] but there are differences where we use ( $L_{0},L$ ), $M$ instead of $L_{1},M_{1},$ respectively used in [9]. We shall use mathematical induction to show that iterates $\{x_{k}\},$ $\{y_{k}\},\{z_{k}\}$ are well defined converge to $p$ and the error estimates (2.8)–(2.10) are satisfied. Using TGNWTM for $n=0,$ we can write

$\displaystyle x_{1}-p$	$\displaystyle=$	$\displaystyle x_{0}-p-[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(%z_{0})F(x_{0})$
	$\displaystyle=$	$\displaystyle[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{T%}[F^{\prime}(z_{0})(x_{0}-p)-F(x_{0})+F(p)]$
		$\displaystyle+[F^{\prime T}F^{\prime-1}F^{\prime T}F(p)-[F^{\prime}(z_{0})^{T}%F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{T}F(p)$
	$\displaystyle=$	$\displaystyle[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{T%}J\left(x_{0}\right)$
		$\displaystyle+[F^{\prime T}F^{\prime-1}F^{\prime T}F(p)-[F^{\prime}(z_{0})^{T}%F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{T}F(p),$

and

$\displaystyle y_{1}-p$	$\displaystyle=$	$\displaystyle x_{1}-p-[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(%z_{0})^{T}F(x_{1})$
	$\displaystyle=$	$\displaystyle[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{T%}[F^{\prime}(z_{0})(x_{1}-p)-F(x_{1})+F(p)]$
		$\displaystyle+[F^{\prime T}F^{\prime-1}F^{\prime T}F(p)-[F^{\prime}(z_{0})^{T}%F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{T}F(p)$
	$\displaystyle=$	$\displaystyle[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{T%}J\left(x_{1}\right)$
		$\displaystyle+[F^{\prime T}F^{\prime-1}F^{\prime T}F(p)-[F^{\prime}(z_{0})^{T}%F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})F(p),$

where

	$\displaystyle J\left(x_{i}\right)$	$\displaystyle=$	$\displaystyle F^{\prime}\left(\tfrac{x_{i}+p}{2}\right)(x_{i}-p)-F(x_{i})+F(p)$
			$\displaystyle+\left(F^{\prime}(z_{0})-F^{\prime}\left(\tfrac{x_{i}+p}{2}\right%)\right)(x_{i}-p).$

In view of the estimate

	$\displaystyle F(x)-F(y)-F^{\prime}\left(\tfrac{x+y}{2}\right)(x-y)$	$\displaystyle=\int_{0}^{1}\tfrac{1-t}{4}\left(F^{\prime\prime}\left(\tfrac{x+y%}{2}+\tfrac{t}{2}(x-y)\right)\right.$
		$\displaystyle\qquad\left.-F^{\prime\prime}\left(\tfrac{x+y}{2}+\tfrac{t}{2}(y-%x)\right)\right)(x-y)^{2}dt,$

for $x=p$ and $y=x_{0},$ we obtain in turn

	$\displaystyle\left\\|F(p)-F(x_{0})-F^{\prime}\left(\tfrac{x_{0}+p}{2}\right)(p-%x_{0})\right\\|$
	$\displaystyle=\frac{1}{4}\left\\|\int_{0}^{1}(1-t)\left[F^{\prime\prime}\left(%\tfrac{x_{0}+p}{2}+\tfrac{t}{2}(p-x_{0})\right)\right.\right.$
	$\displaystyle\qquad\left.\left.-F^{\prime\prime}\left(\tfrac{x_{0}+p}{2}+%\tfrac{t}{2}(x_{0}-p)\right)\right](p-x_{0})^{2}\,dt\right\\|$
	$\displaystyle\leq\int_{0}^{1}(1-t)\int_{0}^{t\\|x_{0}-p\\|}M(u)\,du\,\\|x_{0}-p\\|%^{2}\,dt$
	$\displaystyle=\frac{1}{8}\int_{0}^{d(x_{0})}M(u)\left(1-\tfrac{u}{d(x_{0})}%\right)^{2}\,du\,d(x_{0})^{2}$
	$\displaystyle=\frac{1}{8}\int_{0}^{d(x_{0})}M(u)(d(x_{0})-u)^{2}\,du,$

and

\left|\left|F^{\prime}\left(\tfrac{x_{0}+y_{0}}{2}\right)-F^{\prime}\left(%\tfrac{x_{0}+p}{2}\right)\right|\right|\leq\int_{0}^{d(y_{0})/2}L(u)du.

By the central Lipschitz condition, we have that

\left|\left|\left(F^{\prime}\left(p\right)^{T}F^{\prime}\left(p\right)\right)^%{-1}F^{\prime}\left(p\right)^{T}\right|\right|\|F^{\prime}(x)-F^{\prime}(p)\|%\leq\beta\int_{0}^{d(x)}L_{0}(u)du.

Moreover, by 2.1 and 2.2 and ( $\mathcal{C}_{1}$ ), for all $x\in U(p,R^{\ast}),$ we get

\left|\left|\left(F^{\prime}\left(x\right)^{T}F^{\prime}\left(x\right)\right)^%{-1}F^{\prime}\left(x\right)^{T}\right|\right|\leq\frac{\beta}{1-\beta\int_{0}%^{d(x)}L_{0}(u)du}

and

\left|\left|\left(F^{\prime}\left(x\right)^{T}F^{\prime}\left(x\right)\right)^%{-1}F^{\prime}\left(x\right)^{T}-\left(F^{\prime}\left(p\right)^{T}F^{\prime}%\left(p\right)\right)^{-1}F^{\prime}\left(p\right)^{T}\right|\right|\leq\frac{%\sqrt{2}\beta^{2}\int_{0}^{d(x)}L_{0}(u)du}{1-\beta\int_{0}^{d(x)}L_{0}(u)du}.

By the monotonicity of $L(u)$ and $M(u)$ with 2.3 and 2.4, functions $\frac{1}{t}\int_{0}^{t}L(u)du$ and $\frac{1}{t^{3}}\int_{0}^{t}M(u)(t-u)^{2}du$ are non-decreasing in $t .$ That is, by ( $\mathcal{C}_{3}$ )

$\displaystyle\gamma$	$\displaystyle=$	$\displaystyle\frac{1}{R_{0}}\left[\frac{\beta\int_{0}^{R_{0}}M(u)(R_{0}-u)^{2}%du}{8(1-\beta\int_{0}^{R_{0}}L(u)du)}+\frac{\beta R_{0}\int_{0}^{\frac{3}{2}R_%{0}}L(u)du}{1-\beta\int_{0}^{R_{0}}L_{0}(u)du)}+\frac{\sqrt{2}\alpha\beta^{2}%\int_{0}^{R_{0}}L_{0}(u)du}{1-\beta\int_{0}^{R_{0}}L_{0}(u)du)}\right]$
	$\displaystyle<$	$\displaystyle\frac{1}{R}\left[\frac{\beta\int_{0}^{R}M(u)(R-u)^{2}du}{8(1-%\beta\int_{0}^{R}L_{0}(u)du)}+\frac{\beta R\int_{0}^{\frac{3}{2}R}L(u)du}{1-%\beta\int_{0}^{R}L_{0}(u)du)}+\frac{\sqrt{2}\alpha\beta^{2}\int_{0}^{R}L_{0}(u%)du}{1-\beta\int_{0}^{R}L_{0}(u)du)}\right]$
	$\displaystyle\leq$	$\displaystyle 1.$

Thus, by 2.1–2.9 and condition ( $\mathcal{C}_{2}$ ), we have in turn

$\displaystyle\\|x_{1}-p\\|$	$\displaystyle\leq$	$\displaystyle\\|[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^%{T}\\|$
		$\displaystyle\times\left\\|\int_{0}^{1}(1-t)\left[F^{\prime\prime}\left(\tfrac{%x_{0}+p}{2}+\tfrac{t}{2}(p-x_{0})\right)\right.\right.$
		$\displaystyle\qquad\left.\left.-F^{\prime\prime}\left(\tfrac{x_{0}+p}{2}+%\tfrac{t}{2}(x_{0}-p)\right)\right](p-x_{0})^{2}\,dt\right\\|$
		$\displaystyle+\\|[F^{\prime T}F^{\prime-1}F^{\prime T}F(p)$
		$\displaystyle-[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{%T}F(p)\\|$
	$\displaystyle\leq$	$\displaystyle\frac{\beta d(x_{0})^{3}\int_{0}^{d(x_{0})}M(u)(d(x_{0})-u)^{2}du%}{8d(x_{0})^{3}(1-\beta\int_{0}^{d(x_{0})}L_{0}(u)du)}$
		$\displaystyle+\frac{\beta d(x_{0})d(y_{0})\int_{0}^{d(y_{0})/2}L(u)du}{d(y_{0}%)(1-\beta\int_{0}^{d(x_{0})}L_{0}(u)du)}+\frac{\sqrt{2}\alpha\beta^{2}\int_{0}%^{d(x_{0})}L_{0}(u)du}{d(z_{0})(1-\beta\int_{0}^{d(x_{0})}L_{0}(u)du)}$
	$\displaystyle<$	$\displaystyle\delta d(x_{0})^{3}+\lambda d(x_{0})d(y_{0})+\tau d(z_{0})<\deltaR%_{0}<R.$

In an analogous way, we get in turn

$\displaystyle\\|y_{1}-p\\|$	$\displaystyle\leq$	$\displaystyle\\|[F^{\prime}(z_{0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^%{T}\\|$
		$\displaystyle\times\left\\|F^{\prime}\left(\tfrac{x_{1}+p}{2}\right)(x_{1}-p)-F%(x_{1})+F(p)\right.$
		$\displaystyle\qquad\left.+\left(F^{\prime}(z_{0})-F^{\prime}\left(\tfrac{x_{1}%+p}{2}\right)\right)(x_{1}-p)\right\\|$
		$\displaystyle+\left\\|[F^{\prime T}F^{\prime-1}F^{\prime T}F(p)-[F^{\prime}(z_{%0})^{T}F^{\prime}(z_{0})]^{-1}F^{\prime}(z_{0})^{T}F(p)\right\\|$
	$\displaystyle\leq$	$\displaystyle\frac{\beta d(x_{1})^{3}\int_{0}^{d(x_{1})}M(u)(d(x_{1})-u)^{2}du%}{8d(x_{1})^{3}\left(1-\beta\int_{0}^{d(x_{1})}L_{0}(u)du\right)}$
		$\displaystyle+\frac{\beta d(x_{1})d(z_{0}^{\prime})\int_{0}^{d(z_{0})/2}L(u)du%}{d(z_{0}^{\prime})\left(1-\beta\int_{0}^{d(z_{0})}L_{0}(u)du\right)}+\frac{%\sqrt{2}\alpha\beta^{2}\int_{0}^{d(z_{0})}L_{0}(u)du}{d(z_{0})\left(1-\beta%\int_{0}^{d(z_{0})}L_{0}(u)du\right)}$
	$\displaystyle\leq$	$\displaystyle\delta d(x_{1})^{3}+\tfrac{\lambda}{3}d(x_{1})(d(x_{0})+d(y_{0})+%d(x_{1}))+\tau d(z_{0})$
	$\displaystyle<$	$\displaystyle\delta d(x_{0})^{3}+\tfrac{\lambda}{3}d(x_{0})(d(x_{0})+d(y_{0}))%+\tau d(z_{0})<\delta R_{0}<R,$

hold, where $d(z_{0}^{\prime})=(d(x_{0})+d(y_{0})+d(x_{1}))/2,$ so $x_{1},y_{1}\in U(p,R)$ and we also have that

R_{1}=\max\{\|x_{1}-p\|,\|y_{1}-p\|\}\leq\delta R_{0},

so (2.10) is satisfied. Suppose that $x_{k},y_{k}\in U(p,R)$ and (2.10) hold for $k>0.$ By TGNWTM for $k+1$ we get in turn that

$\displaystyle\\|x_{k+1}-p\\|$	$\displaystyle\leq$	$\displaystyle\frac{\beta d(x_{k})^{3}\int_{0}^{d(x_{k})}M(u)(d(x_{k})-u)^{2}du%}{8d(x_{k})^{3}\left(1-\beta\int_{0}^{d(x_{k})}L_{0}(u)du\right)}$
		$\displaystyle+\frac{\beta d(x_{k})d(y_{k})\int_{0}^{d(y_{k})/2}L(u)du}{d(y_{k}%)\left(1-\beta\int_{0}^{d(y_{k})}L_{0}(u)du\right)}+\frac{\sqrt{2}\alpha\beta^%{2}d(z_{k})\int_{0}^{d(z_{k})}L_{0}(u)du}{d(z_{k})\left(1-\beta\int_{0}^{d(z_{%k})}L_{0}(u)du\right)}$
	$\displaystyle\leq$	$\displaystyle\frac{\beta d(x_{k})^{3}\int_{0}^{d(x_{0})}M(u)(d(x_{0})-u)^{2}du%}{8d(x_{0})^{3}\left(1-\beta\int_{0}^{d(x_{0})}L_{0}(u)du\right)}$
		$\displaystyle+\frac{\beta d(x_{k})d(y_{k})\int_{0}^{d(y_{0})/2}L_{0}(u)du}{d(y%_{0})\left(1-\beta\int_{0}^{d(y_{0})}L_{0}(u)du\right)}+\frac{\sqrt{2}\alpha%\beta^{2}d(z_{k})\int_{0}^{d(z_{0})}L_{0}(u)du}{d(z_{0})\left(1-\beta\int_{0}^%{d(z_{0})}L_{0}(u)du\right)}$
	$\displaystyle\leq$	$\displaystyle\delta d(x_{k})^{3}+\lambda d(x_{k})d(y_{k})+\tau d(z_{k})<\deltaR%_{k}<R$

and

$\displaystyle\\|y_{k+1}-p\\|$	$\displaystyle\leq$	$\displaystyle\frac{\beta d(x_{k+1})^{3}\int_{0}^{d(x_{k+1})}M(u)(d(x_{k+1})-u)%^{2}du}{8d(x_{k+1})^{3}\left(1-\beta\int_{0}^{d(x_{k})}L_{0}(u)du\right)}$
		$\displaystyle+\frac{\beta d(x_{k+1})d(z_{k}^{\prime})\int_{0}^{d(z_{k}^{\prime%})/2}L(u)du}{d(z_{k}^{\prime})\left(1-\beta\int_{0}^{d(z_{k})}L_{0}(u)du\right%)}+\frac{\sqrt{2}\alpha\beta^{2}d(z_{k})\int_{0}^{d(z_{k})}L(u)du}{d(z_{k})%\left(1-\beta\int_{0}^{d(z_{k})}L_{0}(u)du\right)}$
	$\displaystyle\leq$	$\displaystyle\frac{\beta d(x_{k+1})^{3}\int_{0}^{d(x_{0})}M(u)(d(x_{0})-u)^{2}%du}{8d(x_{0})^{3}\left(1-\beta\int_{0}^{d(x_{0})}L_{0}(u)du\right)}$
		$\displaystyle+\frac{\beta d(x_{k+1})d(z_{k}^{\prime})\int_{0}^{d(z0^{\prime})}%L(u)du}{d(z_{0}^{\prime})\left(1-\beta\int_{0}^{d(z_{0})}L_{0}(u)du\right)}+%\frac{\sqrt{2}\alpha\beta^{2}d(z_{k})\int_{0}^{d(z_{0})}L_{0}(u)du}{d(z_{0})%\left(1-\beta\int_{0}^{d(z_{0})}L_{0}(u)du\right)}$
	$\displaystyle\leq$	$\displaystyle\delta d(x_{k+1})^{3}+\tfrac{\lambda}{3}(d(x_{k})+d(y_{k})+d(x_{k%+1}))d(x_{k+1})+\tau d(z_{k})$
	$\displaystyle<$	$\displaystyle\delta R_{k}<R,$

where $d(z_{k}^{\prime})=(d(x_{k})+d(y_{k})+d(x_{k+1}))/2.$ Furthermore, we obtain

R_{k+1}=\max\{\|x_{k+1}-p\|,\|y_{k+1}-p\|\}\leq\delta R_{k}\leq\delta^{2}R_{k-%1}\leq\ldots\delta^{k+1}R_{0},

Math input error

$\lim_{k\longrightarrow\infty}x_{k}=p$ and $\lim_{k\longrightarrow\infty}y_{k}=p.$

$\Box$

Concerning the uniqueness of the solution $p$ we have:

Proposition 2.6.

Under the conditions ( $\mathcal{C}$ ) further suppose that

(2.11)

\frac{\beta}{R^{*}}\int_{0}^{R^{*}}L_{0}(u)(R^{*}-u)du+\frac{\alpha\bar{\beta}%}{R^{*}}\int_{0}^{R^{*}}L_{0}(u)du<1,

where $\bar{\beta}=\|[F^{\prime T}F^{\prime-1}\|.$ Then, limit point $p$ is the only solution of problem (1.1) in $\bar{U}(p,R^{*}).$

The proof follows from the corresponding one in [5] but we only use the center-Lipschitz condition.

3. Special cases and applications

Remark 3.1.

(a)

Set $\alpha=\|F(p)\|=0$ in 2.5 and 2.6 to obtain the results in the case of zero residual.
(b)

If $L_{0},L,M$ are constants, then we can obtain results of special cases.
(c)

In the literature functions $L_{1}$ and $M_{1}$ are used instead of $L$ and $M,$ respectively [3, 5, 8, 9, 12]. Let us compare ratios and ball of convergence. Notice that in view of (2.1), (2.4), (2.5) and (2.7), we have

(3.1) $\mu(L_{0},L,M)(t)\leq\mu(L_{1},L_{1},M_{1})(t),$

and

(3.2) $\gamma(L_{0},L,M)(t)\leq\gamma(L_{1},L_{1},M_{1})(t),$

so

(3.3) $R^{*}(L_{1},L_{1},M_{1})(t)\leq R^{*}(L_{0},L,M)(t).$

Therefore, our radius of convergence is larger and our ratio of convergence is smaller. Moreover the information on the location of the solution $p$ is more precise, since only $L$ is used in (2.11) [9]. Notice that these advantages are obtained under the same computational cost, since in practice the computation of $L_{1}$ and $M_{1}$ require the computation of the rest of the functions $L_{0},L$ and $M$ as special cases.

Remark 6.

In particular, using the error estimates, it follows that for $\alpha=0$ we have $\tau=0$ and

$\displaystyle d(x_{k+1})$	$\displaystyle\leq$	$\displaystyle d(x_{k})(\delta d(x_{k})^{2}+\lambda d(y_{k}))$
$\displaystyle d(y_{k+1})$	$\displaystyle\leq$	$\displaystyle d(x_{k+1})\left[\delta d(x_{k+1})^{2}+\tfrac{\lambda}{3}\left(d(%x_{k})+d(x_{k+1})+d(y_{k})\right)\right]$
	$\displaystyle\leq$	$\displaystyle d(x_{k+1})\left[\left(\delta d(x_{k})+\tfrac{2\lambda}{3}\right)%d(x_{k})+\tfrac{\lambda d(y_{k})}{3}\right]$
	$\displaystyle\leq$	$\displaystyle d(x_{k+1})d(x_{k})(\delta R^{\ast}+\lambda)$
	$\displaystyle=$	$\displaystyle d(x_{k+1})d(x_{k})\ell_{1}.$

Also, for sufficiently large $k$ ,

$\displaystyle d(x_{k+1})$	$\displaystyle\leq$	$\displaystyle d(x_{k})(\delta d(x_{k})^{2}+\lambda d(y_{k}))$
	$\displaystyle\leq$	$\displaystyle d(x_{k})(\delta d(x_{k})^{2}+\lambda\ell_{1}d(x_{k})d(x_{k-1}))$
	$\displaystyle\leq$	$\displaystyle d(x_{k})^{2}d(x_{k-1})(\delta+\lambda\ell_{1})$
	$\displaystyle=$	$\displaystyle d(x_{k})^{2}d(x_{k-1})\ell_{2},$

leading to the equation

x^{2}-2x-1=0,

so the order of iterative method (1) is the positive root of the preceding equation which is $1+\sqrt{2}.$

Next, we present an example to show that (3.1)–(3.3) hold as strict inequalities justifying the advantages as claimed at the introduction of this study.

EXAMPLE 3.2.

Let $X=\mathbb{R}^{3},D=\bar{U}(0,1),p=(0,0,0)^{T}.$ Define function $F$ on $D$ for $w=(x,y,z)^{T}$ by

F(w)=\left(e^{x}-1,\tfrac{e-1}{2}y^{2}+y,z\right)^{T}.

Then, the Fréchet-derivative is given by

F^{\prime}(v)=\left[\begin{array}[]{ccc}e^{x}&0&0\\0&(e-1)y+1&0\\0&0&1\end{array}\right].

Notice that using the (2.9) conditions, we get $L_{0}=e-1,L=M=e^{1/L_{0}},L_{1}=M_{1}=e,\beta=1,i=j=3,\alpha=0.$ Then

R^{*}(L_{1},L_{1},M_{1})(t)=0.1468<R^{*}(L_{0},L,M)(t)=0.2263,

which justify the improvements as stated in the introduction of this paper.

	$\displaystyle\left\\|F(p)-F(x_{0})-F^{\prime}\left(\tfrac{x_{0}+p}{2}\right)(p-%x_{0})\right\\|$
	$\displaystyle=\frac{1}{4}\left\\|\int_{0}^{1}(1-t)\left[F^{\prime\prime}\left(%\tfrac{x_{0}+p}{2}+\tfrac{t}{2}(p-x_{0})\right)\right.\right.$
	$\displaystyle\qquad\left.\left.-F^{\prime\prime}\left(\tfrac{x_{0}+p}{2}+%\tfrac{t}{2}(x_{0}-p)\right)\right](p-x_{0})^{2}\,dt\right\\|$
	$\displaystyle\leq\int_{0}^{1}(1-t)\int_{0}^{t\\|x_{0}-p\\|}M(u)\,du\,\\|x_{0}-p\\|%^{2}\,dt$
	$\displaystyle=\frac{1}{8}\int_{0}^{d(x_{0})}M(u)\left(1-\tfrac{u}{d(x_{0})}%\right)^{2}\,du\,d(x_{0})^{2}$
	$\displaystyle=\frac{1}{8}\int_{0}^{d(x_{0})}M(u)(d(x_{0})-u)^{2}\,du,$