Abstract

The paper extends the well-known Lyusternik-Graves theorem for set-valued mappings to the Hölder framework, offers an affirmative answer to an open problem proposed by Dontchev and improves recent results of He and Ng. Primal and dual necessary and sufficient conditions for Hölder metric regularity are established. The results are applied to convergence analysis of a Newton-type method. Some open problems for future research are also discussed.

1 Introduction↩︎

It is well known that many important problems in variational analysis and optimization, cf. [1]–[4], can be modelled by the generalized equation \[\begin{gather} \label{GE} F(x)\ni y \end{gather}\tag{1}\] where \(F:X\rightrightarrows Y\) is a set-valued mapping between metric spaces. When the mapping is single-valued, inclusion 1 reduces to a conventional equation, but more broadly it can express a mixture of inequalities and equalities. Relation 1 can represent a variational inequality or a system of optimality conditions. An important issue in investigating a generalized equation is to study the behavior of the solution set \(F^{-1}(y)\) with respect to perturbations in \(y\), and this can often be expressed in terms of the property called ‘metric regularity’. The property has its roots in the classical results by Banach and plays a central role in variational analysis both theoretically and numerically [1], [3]–[5].

Definition 1. Let \(X\) and \(Y\) be metric spaces, \(F:X\rightrightarrows Y\), and \((\bar x,\bar y)\in {\rm gph}\,F\). The mapping \(F\) is metrically regular at \((\bar x,\bar y)\) if there exist \(\tau>0\) and \(\delta>0\) such that \[\begin{align} \label{D1461-4} \tau d(x,F^{-1} (y))\le d(y,F(x)) \end{align}\qquad{(1)}\] for all \(x\in B_{\delta}(\bar x)\) and \(y\in B_{\delta}(\bar y)\). The supremum of \(\tau\) such that ?? holds for some \(\delta>0\) is called the modulus of regularity of \(F\) at \((\bar x,\bar y)\) and denoted by rg\(F(\bar x,\bar y)\).

It should be noted that rg\(F(\bar x,\bar y)=\)1/reg\((F;\bar x|\bar y)\) where reg\((F;\bar x|\bar y)\) is the modulus of regularity employed in [1], [6], [7]. This quantity enables one to check how large a perturbation can be before a ‘good behavior’ of the solution mapping breaks down.

Metric regularity necessitates uniformity in estimates involving local perturbations of both \(\bar x\) and \(\bar y\). The term \(d(y,F(x))\) measures the residual when \(y\notin F(x)\). Strictly speaking, inequality ?? provides an estimate of how distant a point \(x\) is from being a solution to the generalized equation 1 . Computing a residual is much easier than finding a solution to the generalized equation. Such an estimate is crucial for numerous optimization problems, especially for computational purposes.

The study of the Hölder metric regularity has attracted considerable attention due to the fact that the conventional (linear) metric regularity fails in many practical situations. The number of publications dedicated to studying Hölder metric regularity is large, see [8]–[13] and the references therein.

Definition 2. Let \(X,Y\) be metric spaces, \(F:X\rightrightarrows Y\), \((\bar x,\bar y)\in {\rm gph}\,F\), and \(q>0\). The mapping \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) if there exist \(\tau>0\) and \(\delta>0\) such that \[\begin{align} \label{D1461-1} \tau d(x,F^{-1} (y))\le d^q(y,F(x)) \end{align}\qquad{(2)}\] for all \(x\in B_{\delta}(\bar x)\) and \(y\in B_{\delta}(\bar y)\). The supremum of \(\tau\) such that ?? holds for some \(\delta>0\) is called the modulus of regularity of order \(q\) of \(F\) at \((\bar x,\bar y)\) and denoted by rg\(^qF(\bar x,\bar y)\).

The next definition recalls the concept of Hölder continuity of set-valued mappings [12], [14].

Definition 3. Let \(X,Y\) be metric spaces, \(\Phi:X\rightrightarrows Y\), \((\bar x,\bar y)\in{\rm gph}\,F\), and \(q>0\). The mapping \(\Phi\) is Hölder continuous of order \(q\) at \((\bar x,\bar y)\) if there exist \(\tau>0\) and \(\delta>0\) such that \[\begin{align} d^q(y,\Phi(x))\le \tau d(x,x') \end{align}\] for all \(x,x'\in B_{\delta}(\bar x)\) and \(y\in \Phi(x')\cap B_{\delta}(\bar y)\). The infimum of \(\tau\) such that the above inequality holds for some \(\delta>0\) is called the modulus of Hölder continuity and denoted by \({\rm{lip}}^q \Phi(\bar x,\bar y)\).

When \(\Phi\) is single-valued, the property in Definition 3 reduces to the conventional Hölder continuity [15], [16], and the modulus is denoted by \({\rm{lip}}^q \Phi(\bar x)\). The absence of the above properties is characterized by \({\rm{rg}}^qF(\bar x,\bar y)=0\) and \({\rm{lip}}^q\Phi(\bar x,\bar y)=+\infty\), respectively. In the case \(q=1\), the properties reduce to the convetional metric regularity and Aubin property [1], [7].

The next statement is straighforward.

Let \(X,Y\) be metric spaces, \(F:X\rightrightarrows Y\), \((\bar x,\bar y)\in {\rm gph}\,F\), and \(q>0\). The mapping \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) if and only if \(F^{-1}\) is Hölder continuous of order \(\frac{1}{q}\) at \((\bar y,\bar x)\). Moreover, \[\begin{gather} {\rm{rg}}^qF(\bar x,\bar y)=({\rm{lip}}^{\frac{1}{q}} F^{-1} (\bar y,\bar x))^{-q}. \end{gather}\]

The property in Definition 2 does not change if one imposes an upper bound on the right-hand side of ?? ; cf. [3] and [17].

Let \(X\) and \(Y\) be metric spaces, \(F:X\rightrightarrows Y\), \((\bar x,\bar y)\in {\rm gph}\,F\), and \(q>0\). The mapping \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) if and only if there exist \(\tau>0\), \(\delta>0\), and \(\mu>0\) such that inequality ?? holds for all \(x\in B_{\delta}(\bar x)\) and \(y\in B_{\delta}(\bar y)\) with \(d^q(y,F(x))<\tau\mu\).

If inequality ?? holds for all \(x\in B_{\delta}(\bar x)\) and \(y\in B_{\delta}(\bar y)\) with \(d^q(y,F(x))<\tau\mu\), then we often say that \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) with \(\tau\), \(\delta\) and \(\mu\). The condition \(d^q(y,F(x))<\tau\mu\) in Proposition [P3461] can be replaced by \(d^q(y,F(x))<\mu\). However, we prefer to keep the current form since it results in ‘neater’ statements in Section 3. In any case, adding such a condition does not affect the constant \(\tau\) in Definition 2, but can have an effect on the value of \(\delta\). We consider in this paper the Hölder metric regularity for the case \(q\in(0,1]\), although some results are also valid when \(q>1\).

In the current paper, we study single-valued additive perturbations of the left-hand side of 1 ; cf. [1]. Set-valued perturbations were considered in [18]–[21]. What effect do such perturbations have on the metric regularity propery? The question is answered by the fundamental estimation arising from the works of Lyusternik [22] and Graves [23]. It shows that metric regularity of a set-valued mapping is preserved if the perturbation function is Lipschitz continuous with a sufficiently small Lipschitz constant. Interested readers are referred to [1], [3], [19], [24]–[26] and the references therein.

Theorem 1. Let \(X\) be a complete metric space, \(Y\) be a linear space with a shift-invariant metric, \(F:X\rightrightarrows Y\), \((\bar x,\bar y)\in{\rm gph}\,F\), \(f: X\rightarrow Y\) with \(f(\bar x)=0\), and \({\rm gph}\,F\) be closed near \((\bar x,\bar y)\). Then \[\begin{align} {\rm{rg}}(F+f)(\bar x,\bar y)\ge {\rm{rg}}F(\bar x,\bar y)-{\rm{lip}}f(\bar x). \end{align}\]

The right-hand side quantity in the above inequality can be undefined when both moduli equal plus infinity. To address this situation, we employ in the current paper the convention that \((+\infty)-(+\infty)=0.\)

In the paper [27], Dontchev posed an open question about a possible generalization of Theorem 1 to the Hölder setting. Recently, He and Ng [15] have proved a Hölder version of Theorem 1 by establishing a relation for the corresponding constants in the definitions of metric regularity and Hölder continuity. In [9], [28], the authors have provided some primal estimations for the modulus of Hölder regularity. In the current paper, we establish an improved version of [15]. The results are applied to convergence analysis of a Newton-type method enhancing [29].

The paper is organized as follows. The next Section 2 provides some preliminary results used throughout the paper. Section 3 studies slope and coderivative necessary and sufficient conditions for Hölder metric regularity. We establish in Section 4 a Hölder version of the extended Lyusternik-Graves theorem. The results are applied in Section 5 to convergence analysis of a Newton-type method. The final Section 6 proposes some open problems for further research.

2 Preliminaries↩︎

Our basic notation is standard, see, e.g., [1], [4], [5]. Throughout the paper, if not explicitly stated otherwise, \(X\) and \(Y\) are metric spaces. Products of metric or normed spaces are assumed to be equipped with the maximum distance or norm. The topological dual of a normed space \(X\) is denoted by \(X^*\), while \(\langle\cdot,\cdot\rangle\) denotes the bilinear form defining the pairing between the two spaces. In a primal space, the open and closed balls with center \(x\) and radius \(\delta>0\) are denoted, respectively, by \(B_\delta(x)\) and \(\overline{B}_\delta(x)\), while \(\mathbb{B}\) and \(\overline{\mathbb{B}}\) stand for, respectively, the open and closed unit balls. The open unit ball in the dual space is denoted by \(\mathbb{B}^*\). A set \(\Omega\) is said to be closed near \(\bar x\in\Omega\) if there exists a \(\delta>0\) such that \(\Omega\cap\overline{B}_\delta(\bar x)\) is closed. Symbols \(\mathbb{R}\), \(\mathbb{R}_+\) and \(\mathbb{N}\) stand for the real line, the set of all nonnegative reals, and the set of all nonnegative integers, respectively.

A metric \(d\) on a vector space \(X\) is called shift-invariant if \(d(x'+z,x+z)=d(x,x')\) for all \(x',x,z\in X\). For subsets \(A,B\) of a metric space \(X\), the excess of \(A\) beyond \(B\) is defined by \(e(A,B):=\sup_{x\in A}d(x,B)\) with the convention that \(e(\emptyset,B):=0\) when \(B\ne\emptyset\) and \(+\infty\) otherwise.

Let \(\{x_k\}_{k\in \mathbb{N}}\) be a sequence in a normed space \(X\) converging to a point \(\bar x\in X\). It is said to converge quadratically to \(\bar x\) if there exist \(\gamma>0\) and \(k_0\in\mathbb{N}\) such that \(\|x_{k+1}-\bar x\|\le \gamma \|x_k-\bar x\|^2\) for all \(k\ge k_0\).

Let \(X\) be a normed space, \(\Omega\subset X\), and \(f:X\to\mathbb{R}\cup\{+\infty\}\). The Fréchet normal cone to \(\Omega\) at \(\bar x\in \Omega\) and the Fréchet subdifferential of \(f\) at \(\bar x\in{\rm dom}\,f:=\{x\in X\mid f(x)< +\infty\}\) are defined, respectively, by \[\begin{gather} N_{\Omega}(\bar x):= \left\{x^\ast\in X^\ast\mid \limsup_{\Omega\ni x\to\bar x,\,x\ne \bar x} \frac{\langle x^\ast,x-\bar x\rangle}{\|x-\bar x\|} \le 0 \right\},\\ \partial f(\bar x):=\left\{x^*\in X^*\mid \liminf_{\substack{x\to \bar x,\,x\ne\bar x}} \dfrac{f(x)-f(\bar x)-\langle x^*,x-\bar x\rangle}{\|x-\bar x\|}\ge 0\right\}. \end{gather}\] By convention, we set \(N_{\Omega}(\bar x) :=\emptyset\) if \(\bar x\notin \Omega\) and \(\partial{f}(\bar x):=\emptyset\) if \(\bar x\notin{\rm dom}\,f\). If \(\Omega\) and \(f\) are convex, the aforementioned concepts reduce to the normal cone and subdifferential in the sense of convex analysis. If \(f\) is Fréchet differentiable with a derivative \(\nabla f(\bar x)\), then \(\partial f(\bar x)=\{\nabla f(\bar x)\}\).

The Fréchet coderivative of a set-valued mapping \(F:X\rightrightarrows Y\) between normed spaces at \((\bar x,\bar y)\in{\rm gph}\,F\) is a set-valued mapping \(D^*F(\bar x,\bar y):Y^*\rightrightarrows X^*\) defined for any \(y^*\in Y^*\) by \[\begin{align} \label{coder} D^*F(\bar x,\bar y)(y^*):=\{x^*\in X^*\mid (x^*,-y^*)\in N_{{\rm gph}\,F}(\bar x,\bar y)\}. \end{align}\tag{2}\]

The following results are well known [5], [30], [31].

Lemma 1. Let \(X\) be a normed space, \(f:X\to\mathbb{R}\cup\{+\infty\}\), and \(\bar x\in{\rm dom}\,f\). The following statements hold.

If \(x\) is a point of local minimum of \(f\), then \(0\in\partial f(\bar x)\).
\(\partial(\lambda f)(\bar x)=\lambda\partial f(\bar x)\) for any \(\lambda>0\).
\(\partial\|\cdot\|(0)=\{x^*\in X^*\mid \|x^*\|\le 1\}\).
\(\partial\|\cdot\|(x)=\{x^*\in X^*\mid \langle x^*,x\rangle=\|x\|\;\; \text{and} \;\; \|x^*\|= 1\}, \;\; x\ne 0\).

Let \(X\) be a metric space, \(f:X\rightarrow\mathbb{R}\cup\{+\infty\}\). The slope [25], [32]–[34] of \(f\) at \(x\in{\rm dom}\,f\) is defined by \[\begin{align} |\nabla f|(x):=\limsup_{u\rightarrow x,u\ne x}\dfrac{ [f(x)-f(u)]_+}{d(x,u)} \end{align}\] where \(\alpha_+:=\max\{0,\alpha\}\) for any \(\alpha\in\mathbb{R}\). If \(x\notin{\rm dom}\,f\), we set \({|\nabla f|(x):=+\infty}\).

The next statement offers chain rules for slopes [35] and Fréchet subdifferentials [36].

Lemma 2. Let \(X\) be a metric space, \(f: X\rightarrow\mathbb{R}\cup\{+\infty\}\), \(\bar x\in{\rm dom}\,f\) with \(f(\bar x)>0\), and \(q>0\). The following statements hold.

\(|\nabla f^q|(\bar x)=qf^{q-1}(\bar x)|\nabla f|(\bar x)\).
If \(X\) is a normed space, then \(\partial f^q(\bar x)=qf^{q-1}(\bar x)\partial f(\bar x)\).

The other fundamental tools for our analysis are the contraction mapping principle for set-valued mappings [37], the Ekeland variational principle [38], and subdifferential sum rules [30], [31], [39].

Lemma 3. Let \(X\) be a complete metric space, \(\Phi:X\rightrightarrows X\), \(x\in X\), \(\theta\in(0,1)\), and \(\delta>0\). Suppose that the following conditions are satisfied:

\({\rm gph}\,\Phi\cap [\overline{B}_\delta(x)\times \overline{B}_\delta(x)]\) is closed;
\(d(x,\Phi(x))<\delta(1-\theta)\);
\(e(\Phi(u)\cap B_\delta(x),\Phi(v))\le\theta d(u,v)\) for all \(u,v\in\overline{B}_\delta(x)\).

Then, there exists an \(\hat{x}\in \overline{B}_\delta(x)\) with \(\hat{x}\in\Phi(\hat{x})\). If \(\Phi\) is single-valued, then \(\hat{x}\) is the unique fixed point in \(\overline{B}_\delta(x)\).

Lemma 4. Let \(X\) be a complete metric space, \(f: X\to \mathbb{R} \cup \{ +\infty\}\) be lower semicontinuous, \(x\in X\), \(\varepsilon>0\) and \(\lambda>0\). If \(f(x)<\inf_{X} f+\varepsilon\), then there exists an \(\hat{x}\in X\) such that

\(d(\hat{x},x)<\lambda\);
\(f(\hat{x})\le f(x)\);
\(f(u)+(\varepsilon/\lambda)d(u,\hat{x})\ge f(\hat{x})\) for all \(u\in X.\)

Lemma 5. Let \(X\) be a normed space, \(f_1,f_2:X\to\mathbb{R}\cup\{+\infty\}\), and \(\bar x\in{\rm dom}\,f_1\cap{\rm dom}\,f_2\).

Suppose \(f_1\) and \(f_2\) are convex, and \(f_1\) be continuous at a point in \({\rm dom}\,f_2\). Then \[\partial(f_1+f_2)(\bar x)=\partial f_1(\bar x)+\partial f_2(\bar x).\]
Suppose \(X\) is Asplund, \(f_1\) is Lipschitz continuous and \(f_2\) is lower semicontinuous in a neighbourhood of \(\bar x\). Then, for any \(x^*\in\partial(f_1+f_2)(\bar x)\) and \(\varepsilon>0\), there exist \(x_1,x_2\in X\) with \(\|x_i-\bar x\|<\varepsilon\), \(|f_i(x_i)-f_i(\bar x)|<\varepsilon\) \((i=1,2)\) such that \[x^*\in\partial f_1(x_1) +\partial f_2(x_2)+\varepsilon\mathbb{B}^\ast.\]

Recall that a Banach space is Asplund if every continuous convex function on an open convex set is Fréchet differentiable on a dense subset [40], or equivalently, if the dual of each its separable subspace is separable. We refer the reader to [5], [40], [41] for discussions about and characterizations of Asplund spaces. All reflexive, particularly, all finite dimensional Banach spaces are Asplund.

3 Necessary and sufficient conditions for Hölder metric regularity↩︎

Along with the standard maximum distance on \(X\times Y\), we also use a metric depending on a parameter \(\gamma>0\) defined by \[\begin{gather} \label{pdist} d_\gamma((u_1,v_1),(u_2,v_2)) :=\max\left\{d(u_1,u_2),\gamma d(v_1,v_2)\right\} \end{gather}\tag{3}\] for any \(u_1,u_2\in X,\;v_1,v_2\in Y\). When \(X\), \(Y\) are normed spaces, the distance 3 yields the definition of the parametric norm \[\begin{gather} \notag \|(x,y)\|_{\gamma}:=\max\{\|x\|,{\gamma}\|y\|\},\quad x\in X,\;y\in Y, \end{gather}\] and the corresponding dual norm \[\begin{align} \label{dnorm} \|(x^*,y^*)\|_{\gamma}=\|x^*\|+\gamma^{-1} \|y^*\|,\quad x^*\in X^*,\;y^*\in Y^*. \end{align}\tag{4}\]

Theorem 2. Let \(X, Y\) be metric spaces, \(F: X\rightrightarrows Y\), \((\bar x,\bar y)\in{\rm gph}\,F\), and \(q\in (0,1]\).

Suppose \(X\) and \(Y\) are complete, and \({\rm gph}\,F\) is closed. If there exist \(\tau>0\), \(\delta>0\), \(\mu>0\), and \(\gamma>0\) such that \[\begin{align} \label{R3461-2} \limsup_{\substack{ u\to x,\,v\to z,\;(u,v)\in{\rm gph}\,F\\(u,v)\ne (x,z),\,d(u,\bar x)<\delta+\mu,\,d(v,y)<(\tau\mu)^{\frac{1}{q}}}} {\dfrac{d^q(z,y)-d^q(v,y)}{d_\gamma((u,v),(x,z))}}\ge\tau \end{align}\qquad{(3)}\] for all \(x\in B_{\delta+\mu}(\bar x)\), \(y\in B_{\delta}(\bar y)\) with \(x\notin F^{-1} (y)\), and \(z\in F(x)\) with \(d(y,z)<(\tau\mu)^{\frac{1}{q}}\), then \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) with \(\tau\), \(\delta\) and \(\mu\).
Suppose \(X,Y\) are normed spaces, and \({\rm gph}\,F\) is convex. If \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) with some \(\tau>0\), \(\delta>0\) and \(\mu>0\), then \[\begin{align} \label{R3461-3} \limsup_{\substack{ u\to x,\,v\to z,\;(u,v)\in{\rm gph}\,F\\(u,v)\ne (x,z),\,\|u-\bar x\|<\delta+\mu,\,\|v-y\|<(\tau\mu)^{\frac{1}{q}}}} {\dfrac{\|z-y\|^q-\|v-y\|^q}{\|(u-z,v-z)\|_\gamma}}\ge\tau \end{align}\qquad{(4)}\] for \(\gamma:=\tau^{-1}\), and all \(x\in B_{\delta}(\bar x)\), \(y\in B_{\delta}(\bar y)\) with \(x\notin F^{-1} (y)\), and \(z\in F(x)\) with \(\|z-y\|<\min\{(\tau\mu)^{\frac{1}{q}},1\}\).

Proof.

Let \(\tau>0\), \(\delta>0\), \(\mu>0\), and \(\gamma>0\). Suppose \(F\) is not metrically regular of order \(q\) at \((\bar x,\bar y)\) with \(\tau\), \(\delta\), and \(\mu\). By Proposition [P3461], there exist \(x\in B_{\delta}(\bar x)\) and \(y\in B_{\delta}(\bar y)\) such that \(d^q(y,F(x))<\tau\mu_0\) with \(\mu_0:=\min\{ d(x,F^{-1} (y)),\mu\}\). Choose a number \(\varepsilon\) such that \(d^q(y,F(x))<\varepsilon<\tau\mu_0\), and a point \(z\in F(x)\) such that \(d^q(z,y)<\varepsilon\). Let \(\psi_y:X\times Y\rightarrow\mathbb{R}_+\cup\{+\infty\}\) be defined by \[\begin{gather} \label{psi} \psi_y(u,v):=d^q(v,y)+i_{{\rm gph}\,F}(u,v), \quad u\in X,\;v\in Y. \end{gather}\tag{5}\] In view of the closedness of \({\rm gph}\,F\), the indicator function in 5 is lower semicontinuous, and consequently, \(\psi_y\) is lower semicontinuous on \(\overline{B}_{\delta+\mu}(\bar x)\times{\overline{B}_{(\tau\mu)^{1/q}}(\bar y)}\). Besides, \[\begin{align} \psi_y(x,z)=d^q(z,y)+i_{{\rm gph}\,F}(x,z)=d^q(z,y)< \inf_{\overline{B}_{\delta+\mu}(\bar x)\times {\overline{B}_{(\tau\mu)^{1/q}}(\bar y)}}\psi_y+\varepsilon. \end{align}\] Applying the Ekeland variational principle (Lemma 4) to the restriction of \(\psi_y\) to the complete metric space \(\overline{B}_{\delta+\mu}(\bar x)\times {\overline{B}_{(\tau\mu)^{1/q}}(y)}\) with the metric 3 , we can find a point \((\hat{x},\hat{z})\in \overline{B}_{\delta+\mu}(\bar x)\times {\overline{B}_{(\tau\mu)^{1/q}}(y)}\) such that \[\begin{gather} \tag{6} d_\gamma((\hat{x},\hat{z}),(x,z))<\mu_0,\\ \tag{7} \psi_y(\hat{x},\hat{z})\le\psi_y(x,z), \\\tag{8} \psi_y(\hat{x},\hat{z})\le\psi_y(u,v) +(\varepsilon/\mu_0) d_\gamma((u,v),(\hat{x},\hat{z})) \end{gather}\] for all \((u,v)\in \overline{B}_{\delta+\mu}(\bar x)\times {\overline{B}_{(\tau\mu)^{1/q}}(y)}\). It is clear from 7 that \((\hat{x},\hat{z})\in{\rm gph}\,F\). By 6 , \(d(\hat{x},x)<d(x,F^{-1} (y))\). Hence, \(\hat{x}\notin F^{-1} (y)\) and \(\hat{z}\ne y\). Besides, \[\begin{gather} d(\hat{x},\bar x)\le d(\hat{x},x)+d(x,\bar x)<\delta+\mu,\;\; d(\hat{z},y)\le d(z,y)<\varepsilon^{\frac{1}{q}}<(\tau\mu)^{\frac{1}{q}}. \end{gather}\] It follows from 8 that \[\begin{align} \sup_{\substack{(u,v)\in{\rm gph}\,F,\, (u,v)\ne(\hat{x},\hat{z}),\\ d(u,\bar x)<\delta+\mu,\,d(v,y)<(\tau\mu)^{\frac{1}{q}}}} \dfrac{d^q(\hat{z},y)-d^q(v,y)}{d_\gamma((u,v),(\hat{x},\hat{z}))} \le\dfrac{\varepsilon}{\mu_0}<\tau. \end{align}\] The last estimate contradicts ?? .
Suppose \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) with some \(\tau>0\), \(\delta>0\) and \(\mu>0\). By Proposition 2, inequality ?? holds for all \(x\in B_{\delta}(\bar x)\), \(y\in B_{\delta}(\bar y)\), and \(d^q(y,F(x))<\tau\mu\). Let \(x\in B_{\delta}(\bar x)\), \(y\in B_{\delta}(\bar x)\) with \(x\notin F^{-1} (y)\), \(z\in F(x)\) with \(\|z-y\|<\min\{(\tau\mu)^{\frac{1}{q}},1\}\), \(\eta>1\), and \(\gamma:=\tau^{-1}\). One can find a \(\xi\in(1,\eta)\) and a point \(\hat{x}\in F^{-1} (y)\) such that \(\xi \|y-z\|^q<\tau\mu\), and \(\tau{\|x-\hat{x}\|}<\xi\|z-y\|^q.\) Thus, \((\hat{x},y)\in{\rm gph}\,F\), \((\hat{x},y)\ne(x,z)\), \[\begin{gather} \|\hat{x}-\bar x\|\le \|\hat{x}-x\|+\|x-\bar x\|<\tau^{-1} \xi \|z-y\|^q+\delta<\delta+\mu, \end{gather}\] and \[\begin{align} \|(x-\hat{x},z-y)\|_\gamma &=\max\{ \|x-\hat{x}\|,\gamma\|z-y\|\}\\ &\le\tau^{-1} \max\{\xi,1\}\|z-y\|^q=\tau^{-1} \xi\|z-y\|^q. \end{align}\] Hence, \[\begin{align} \sup_{\substack{(u,v)\in{\rm gph}\,F,\,(u,v)\ne (x,z)\\ \|u-\bar x\|<\delta+\mu,\,\|v-y\|<(\tau\mu)^{\frac{1}{q}}}} \dfrac{\|z-y\|^q-\|v-y\|^q}{\|(u-x,v-z)\|_\gamma} \ge \dfrac{\|z-y\|^q}{\|(\hat{x}-x,y-z)\|_\gamma} \ge\tau\xi^{-1} >\tau\eta^{-1} . \end{align}\] Letting \(\eta\downarrow 1\), we arrive at ?? . 0◻

◻

In the case \(q=1\), part (i) of Theorem 2 improves [42] and can be seen as a quantitative version of the first part of [3], while part (ii) recaptures [42].
The only difference between the expressions in the left-hand sides of ?? and ?? is that the first one is computed on metric spaces, while the second one is calculated on normed spaces. The two expressions are the slope at \((x,z)\) of the restriction of the function \(\psi_y\), given by 5 , to \({\rm gph}\,F\cap[B_{\delta+\mu}(\bar x)\times B_{(\tau\mu)^{1/q}}(y)]\).
The completeness and closedness assumptions in Theorem 2(i) can be weakened: it suffices to require that \({\rm gph}\,F\cap [\overline{B}_{\delta+\mu}(\bar x)\times \overline{B}_{(\tau\mu)^{1/q}}(y)]\) is complete.

Theorem 3. Let \(X,Y\) be normed spaces, \(F: X\rightrightarrows Y\), \((\bar x,\bar y)\in{\rm gph}\,F\), and \(q\in (0,1]\).

Suppose \(X\) and \(Y\) are Asplund, and \({\rm gph}\,F\) is closed. If there exist \(\tau>0\), \(\delta>0\), \(\mu>0\), \(\eta>0\), and \(\alpha\in(0,1)\) such that \[\begin{align} \label{C3463-3} q\|z-y\|^{q-1} d(0,D^*F(x,z)(y^*))\ge \tau \end{align}\qquad{(5)}\] for all \(x\in B_{\delta+\mu}(\bar x)\), \(y\in B_{\delta}(\bar y)\) with \(x\notin F^{-1} (y)\), and \(z\in F(x)\) with \(\|z-y\|<(\tau\mu)^{\frac{1}{q}}\), \(y^*,z^*\in Y^*\) with \[\begin{gather} \|z^*\|=1,\; \langle z^*,z-y\rangle>\alpha\|z-y\|,\; q\|z-y\|^{q-1}\|y^*-z^*\|<\eta, \end{gather}\] then \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) with \(\tau\), \(\delta\) and \(\mu\).
Suppose \({\rm gph}\,F\) is convex. If \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) with some \(\tau>0\), \(\delta>0\) and \(\mu>0\), then \[\begin{gather} q\|z-y\|^{q-1} d(0,D^*F(x,z)(y^*))\ge\tau(1-\eta) \end{gather}\] for all \(\eta\in(0,1)\), \(x\in B_{\delta}(\bar x)\), \(y\in B_{\delta}(\bar y)\) with \(x\notin F^{-1} (y)\), and \(z\in F(x)\) with \(\|z-y\|<\min\{(\tau\mu)^{\frac{1}{q}},1\}\), \(y^*,z^*\in Y^*\) satisfying \[\begin{align} \|z^*\|=1,\;\; \langle z^*,z-y\rangle=\|z-y\|,\;\; q\|z-y\|^{q-1}\|y^*-z^*\|<\eta. \end{align}\]

Proof.

Let \(\tau>0\), \(\delta>0\), \(\mu>0\), \(\eta>0\), \(\alpha\in (0,1)\), \(\gamma:=\tau^{-1} \eta\), and \(\hat{\tau}\in(0,\tau)\). Suppose \(F\) is not metrically regular of order \(q\) at \((\bar x,\bar y)\) with \(\tau\), \(\delta\), and \(\mu\). By Theorem 2(i), there exist \(x\in B_{\delta+\mu}(\bar x)\), \(y\in B_{\delta}(\bar y)\) with \(x\notin F^{-1} (y)\), \(z\in F(x)\) with \(\|z-y\|<(\hat{\tau}\mu)^{\frac{1}{q}}\), and \(\tau'\in(0,\hat{\tau})\) such that \[\begin{align} \|z-y\|^q-\|v-y\|^q\le\tau'\|(u-x,v-z)\|_\gamma \end{align}\] for all \((u,v)\in{\rm gph}\,F\cap [B_{\delta+\mu}(\bar x)\times B_{(\hat{\tau}\mu)^{1/q}}(y)]\). In other words, \((x,z)\) is a local minimizer of the function \[\begin{align} \label{P5P1} (u,v)\mapsto\psi_y(u,v)+\tau'\|(u-x,v-z)\|_\gamma, \end{align}\tag{9}\] where the function \(\psi_y\) is defined by 5 . By Lemma 1(i), its Fréchet subdifferential at this point contains 0. Observe that 9 is the sum of the function \(\psi_y\) and the Lipschitz continuous convex function \((u,v)\mapsto \tau'\|(u-x,v-z)\|_{\gamma}.\) In view of Lemma 1(iii) and (iv), all subgradients \((u^*,v^*)\) of the latter function at any points satisfy \(\|(u^*,v^*)\|_{\gamma}\le\tau'.\) Let \(\varepsilon>0\) be such that \[\begin{align} \varepsilon<\min\left\{\delta+\mu-\|x-\bar x\|, (\tau\mu)^{\frac{1}{q}}-\|z-y\|, \hat{\tau}-\tau',\frac{1}{2}d(x,F^{-1} (y))\right\}. \end{align}\] By Lemma 5(ii), there exist points \(x'\in B_\varepsilon({x}),z'\in B_\varepsilon({z})\) with \((x',z')\in{\rm gph}\,F\), and \((\hat{x}^*,\hat{z}^*)\in \partial\psi_y(x',z')\) such that \(\|(\hat{x}^*,\hat{z}^*)\|_{\gamma}<\tau'+\varepsilon.\) It is straighforward from the choice of \(\varepsilon\) that \(x'\in{B_{\delta+\mu}(\bar x)}\), \(x'\notin F^{-1} (y)\), \(\|z'-y\|<(\tau\mu)^{\frac{1}{q}}\), and \(\|(\hat{x}^*,\hat{z}^*)\|_{\gamma}<\hat{\tau}\). Recall from 5 that \(\psi_y\) is a sum of two functions: the Lipschitz continuous convex function \(v\mapsto g_y(v):=\|v-y\|^q\) and the indicator function of the closed set \({\rm gph}\,F\). Let \(\lambda:=\frac{\tau-\hat{\tau}}{\tau+\hat{\tau}}\) and choose \(\varepsilon>0\) such that \[\begin{gather} \varepsilon<\min\Big\{\delta+\mu-\|x'-\bar x\|,\dfrac{1}{2}d(x',F^{-1} (y)), (\hat{\tau}\mu)^{\frac{1}{q}}-\|z'-y\|,\\ \hat{\tau}-\|(\hat{x}^*,\hat{z}^*)\|_\gamma, \min\left\{1/2,(1-\alpha)/8,\lambda\right\}\|z'-y\|\Big\}. \end{gather}\] Applying Lemma 5(ii), there exist \(\hat{x}\in B_\varepsilon(x')\), and \(\hat{z},\hat{z}'\in B_\varepsilon(z')\) with \((\hat{x},\hat{z})\in{\rm gph}\,F\), \(w^*\in\partial g(\hat{z}')\), \((u^*,v^*)\in N_{{\rm gph}\,F}(\hat{x},\hat{z})\) such that \(\|(0,z^*)+(u^*,v^*)-(\hat{x}^*,\hat{z}^*)\|_{\gamma}<\varepsilon.\) From the choice of \(\varepsilon\), one obtain \(\hat{x}\in B_{\delta+\mu}(\bar x)\), \(\hat{x}\notin F^{-1} (y)\), \(\|\hat{z}-y\|<(\hat{\tau}\mu)^{\frac{1}{q}}\), \(\|(0,w^*)+(u^*,v^*)\|_{\gamma}<\hat{\tau}\), and \(\|\hat{z}-y\|\ge\max\{\frac{1}{2},1-\lambda\}\|z'-y\|\). It follows from the last estimate that \[\begin{gather} \|\hat{z}'-\hat{z}\|<\frac{1-\alpha}{4}\|z'-y\|\le\frac{1-\alpha}{2} \|\hat{z}-y\|,\\ \|\hat{z}'-y\|\le \|\hat{z}'-\hat{z}\|+\|\hat{z}-y\|\le \dfrac{2\lambda}{1-\lambda}\|\hat{z}-y\|=\dfrac{\tau}{\hat{\tau}}\|\hat{z}-y\|. \end{gather}\] Note that \(\hat{z}'\ne y\), and consequently, \(\partial g(\hat{z}')=q\|\hat{z}'-y\|^{q-1}\partial\|\cdot-y\|(\hat{z}').\) Then, there exists a \(z^*\in Y^*\) such that \(w^*=\theta z^*\) with \(\theta:=q\|\hat{z}'-y\|^{q-1}\), \(\|z^*\|=1\) and \(\langle z^*,\hat{z}'-y\rangle=\|\hat{z}'-y\|\). One has \[\begin{align} \langle z^*,\hat{z}-y\rangle &\ge\langle z^*,\hat{z}'-y\rangle-\|\hat{z}'-\hat{z}\|=\|\hat{z}'-y\|-\|\hat{z}'-\hat{z}\|\\ &\ge\|\hat{z}-y\|-2\|\hat{z}'-\hat{z}\|>\alpha\|\hat{z}-y\|. \end{align}\] Let \(\hat{u}^*:=u^*/\theta\), \(y^*:=-v^*/\theta\). Then \((\hat{u}^*,-y^*)\in N_{{\rm gph}\,F}(\hat{x},\hat{z})\), and \[\begin{gather} \|\hat{u}^*\|+\gamma^{-1} {\|z^*-y^*\|}<\hat{\tau} q^{-1} \|\hat{z}'-y\|^{1-q}. \end{gather}\] Then, \[\begin{align} \|\hat{u}^*\|+\gamma^{-1} {\|z^*-y^*\|} &< \hat{\tau} q^{-1} \left(\dfrac{ \tau}{\hat{\tau}}\right)^{1-q}\|\hat{z}-y\|^{1-q}\\ &=\hat{\tau} q^{-1} \dfrac{\tau}{\hat{\tau}}\|\hat{z}-y\|^{1-q}<\tau q^{-1} \|\hat{z}-y\|^{1-q}. \end{align}\] Then, \(q\|\hat{z}-y\|^{q-1}\|z^*-y^*\|<\eta\) and \(q\|\hat{z}-y\|^{q-1}\|\hat{u}^*\|<\tau.\) The last inequality contradicts the assumption.
Let \(\gamma:=\tau^{-1}\), \(x\in B_{\delta}(\bar x)\), \(y\in B_{\delta}(\bar y)\) with \(x\notin F^{-1} (y)\), and \(z\in F(x)\) with \(\|z-y\|<\min\{(\tau\mu)^{\frac{1}{q}},1\}\). Under the assumptions made, the function \(\psi_y\) is convex. For any \((\hat{x}^*,\hat{z}^*)\in\partial\psi_y(x,z)\), we have \[\begin{align} \|(\hat{x}^*,\hat{z}^*)\|_{\gamma} &=\sup_{\substack{(u,v)\ne(0,0)}} \dfrac{\left\langle (\hat{x}^*,\hat{z}^*),(u,v) \right\rangle} {\|(u,v)\|_{\gamma}}\\ &=\limsup_{\substack{u{\to}x,\, v\to z\\ (u,v)\ne(x,z)}} \dfrac{-\left\langle (\hat{x}^*,\hat{z}^*),(u-x,v-z) \right\rangle}{\|(u-x,v-z)\|_{\gamma}}\\ &\ge\limsup_{\substack{u{\to}x,\, v\to z\\ (u,v)\ne(x,y)}} \dfrac{\psi_y(x,z)-\psi_y(u,v)} {\|(u-x,v-z)\|_{\gamma}}\\ &=\limsup_{\substack{u\to x,\,v\to z\\(u,v)\in{\rm gph}\,F,\,(u,v)\ne(x,z)}} \dfrac{\|z-y\|^q-\|v-y\|^q}{\|(u-x,v-z)\|_\gamma} \ge \tau. \end{align}\] Observe that \(\psi_y\) is the sum of the convex continuous function \(v\mapsto g(v):=\|v-y\|^q\) and the indicator function of the convex set \({\rm gph}\,F\). Note that \(z\ne y\), and consequently, \(\partial g(z)=q\|z-y\|^{q-1}\partial\|\cdot-y\|(z).\) By Lemma 5(i), \(\partial\psi_y(x,z)=\{0\}\times\partial g(z)+N_{{\rm gph}\,F}(x,z)\). Let \(\theta:=q\|z-y\|^{q-1}\). Then, there exist \((u^*,v^*)\in N_{{\rm gph}\,F}(x,z)\) and \(z^*\in Y^*\) satisfying \(\|z^*\|=1\), \(\langle z^*,z-y\rangle=\|z-y\|\), and \(\|u^*\|+\tau\|\theta z^*+v^*\|\ge \tau.\) Let \(\hat{u}^*:= u^*/\theta\), \(y^*:=-v ^*/\theta\). Then \((\hat{u}^*,-y^*)\in N_{{\rm gph}\,F}(x,z)\), and \(\|\hat{u}^*\|+\tau\|z^*-y^*\|\ge \tau/\theta.\) Thus, \(q\|z-y\|^{q-1}\|\hat{u}^*\|>\tau(1-\eta)\) if \(q\|z-y\|^{q-1}\|y^*-z^*\|<\eta\) for any \(\eta\in (0,1)\).

The proof is complete. 0◻ ◻

Parts (i) and (ii) of Theorem 3 improve [10], respectively. In the case \(q=1\), Theorem 3 recaptures [42].

4 The Hölder version of the Lyusternik-Graves theorem↩︎

The next theorem establishes the Hölder version of Theorem 1.

Theorem 4. Let \(X\) be a complete metric space, \(Y\) be a linear space with a shift-invariant metric, \(F:X\rightrightarrows Y\), \((\bar x,\bar y)\in{\rm gph}\,F\), \(f: X\rightarrow Y\) with \(f(\bar x)=0\), \({\rm gph}\,F\) be closed near \((\bar x,\bar y)\), and \(q\in(0,1]\). Then \[\begin{align} \label{T5462-4} {\rm{rg}}^q(F+f)(\bar x,\bar y)\ge {\rm{rg}}^qF(\bar x,\bar y)-{\rm{lip}}^qf(\bar x). \end{align}\qquad{(6)}\]

Proof. If either \({\rm{rg}}^qF(\bar x,\bar y)=0\) or \({\rm{lip}}^{q}f(\bar x)=+\infty\), then ?? holds automatically. Suppose \(F\) is metrically regular of order \(q\) at \((\bar x,\bar y)\), and \(f\) is Hölder continuous of order \(q\) at \(\bar x\). Let \(\mu:={\rm{lip}}^{q}f(\bar x)\). If \(\mu\ge {\rm{rg}}^qF(\bar x,\bar y)\), then there is nothing to prove. Suppose \({\mu}< {\rm{rg}}^qF(\bar x,\bar y)\) and let \(\tau\in (\mu, {\rm{rg}}^qF(\bar x,\bar y))\). Choose a \(\gamma>0\) such that \(\tau-\mu>\gamma^{-1}\). Under the assumptions made, there exists a \(\delta>0\) such that the following conditions are satisfied:

inequality ?? holds for all \(x\in B_{\delta}(\bar x)\) and \(y\in B_{\delta}(\bar y)\);
the set \({\rm gph}\,F\cap[\overline{B}_{\delta}(\bar x)\times\overline{B}_{\delta}(\bar y)]\) is closed;
\(d^q(f(x),f(x'))\le\mu d(x,x')\) for all \(x,x'\in\overline{B}_{\delta}(\bar x)\).

Choose a \(\hat{\delta}>0\) such that \(\gamma(4\hat{\delta})^q+\hat{\delta}<\delta\), \(\mu^{\frac{1}{q}}(\gamma(4\hat{\delta})^q+\hat{\delta})^{{\frac{1}{q}}}+\hat{\delta}<\delta\), and \(\hat{\delta}^{1-q}<\gamma(2^q-1)\). Let \(x\in B_{\hat{\delta}}(\bar x)\) and \(y\in B_{\hat{\delta}}(\bar y)\). We first show that \[\begin{gather} \label{T1461-1} d(x,(F+f)^{-1} (y))\le \gamma d^q(y,y') \end{gather}\tag{10}\] for all \(y'\in (F(x)+f(x))\cap B_{3\hat{\delta}}(\bar y)\). If \(y'=y\), then inequality 10 is satisfied. Suppose \(y'\ne y\). Define \(\Phi: X\rightrightarrows X\) by \(\Phi(u):=F^{-1} (-f(u)+y)\) for all \(u\in B_{\hat{\delta}}(\bar x)\). We are going to prove that three conditions (i)-(iii) in Lemma 3 are satisfied with \(\delta':=\gamma d^q(y,y')\) and \(\theta:=\tau^{-1} \mu\).

Observe that \(\delta'\le \gamma(d(y,\bar y)+d(\bar y,y'))^q< \gamma(4\hat{\delta})^q\). Let \(\{(x_n,z_n)\}_{n\in\mathbb{N}}\) be a sequence in \({\rm gph}\,\Phi\cap[\overline{B}_{\delta'}(x)\times \overline{B}_{\delta'}(x)]\) and suppose that it converges to a point \((\hat{x},\hat{z})\in X\times Y\). For all \(n\in\mathbb{N}\), it holds that \((z_n,-f(x_n)+y)\in{\rm gph}\,F\), \[\begin{gather} d(z_n,\bar x)\le d(z_n,x)+d(x,\bar x)<\delta'+\hat{\delta}<\gamma(4\hat{\delta})^q+\hat{\delta}<\delta, \end{gather}\] and \[\begin{align} d(-f(x_n)+y,\bar y) &\le d(f(x_n),0)+d(y,\bar y) \le \mu^{\frac{1}{q}} d^{{\frac{1}{q}}}(x_n,\bar x)+\hat{\delta}\\ &<\mu^{\frac{1}{q}} (\delta'+\hat{\delta})^{{\frac{1}{q}}}+\hat{\delta}<\mu^{\frac{1}{q}} (\gamma(4\hat{\delta})^q+\hat{\delta})^{{\frac{1}{q}}}+\hat{\delta}<\delta. \end{align}\] Thus, \((z_n,-f(x_n)+y)\in{\rm gph}\,F\cap[\overline{B}_{\delta}(\bar x)\times\overline{B}_{\delta}(\bar y)]\) for all \(n\in\mathbb{N}\). Note that \(\overline{B}_{\delta'}(x)\subset \overline{B}_{\delta}(\bar x)\) since for any \(x'\in \overline{B}_{\delta'}(x)\), one has \(d(x',\bar x)\le d(x',x)+d(x,\bar x)\le\delta'+\hat{\delta}<\delta.\) The continuity of \(f\) on \(\overline{B}_{\delta'}(x)\) implies \((\hat{z},-f(\hat{x})+y)\in{\rm gph}\,F\cap[\overline{B}_{\delta'}(x)\times\overline{B}_{\delta}(\bar y)]\). Thus, \((\hat{x},\hat{z})\in{\rm gph}\,\Phi\cap[\overline{B}_{\delta'}(x)\times\overline{B}_{\delta'}(x)]\), and consequently, \({\rm gph}\,\Phi\cap[\overline{B}_{\delta'}(x)\times \overline{B}_{\delta'}(x)]\) is closed.
One has \[d(-f(x)+y,\bar y)\le d(f(x),0)+d(y,\bar y) <\mu^{\frac{1}{q}}\hat{\delta}^{{\frac{1}{q}}}+\hat{\delta}<\delta.\] Hence, \(-f(x)+y\in B_{\delta}(\bar y)\). One has \[\begin{align} d(x,\Phi(x)) &= d(x,F^{-1} (-f(x)+y)) \le\tau^{-1} d^q(-f(x)+y,F(x))\\ &\le\tau^{-1} d^q(-f(x)+y,-f(x)+y')= \tau^{-1} d^q(y,y')\\ &<\gamma d^q(y,y')(1-\tau^{-1} \mu)=\delta'(1-\theta). \end{align}\]
Observe that \[d(-f(v )+y,\bar y) \le d(f(v),0)+d(y,\bar y)<\mu^{\frac{1}{q}}\hat{\delta}^{{\frac{1}{q}}}+\hat{\delta} <\delta\] for any \(v\in\overline{B}_{\delta'}(x)\). For all \(u,v\in\overline{B}_{\delta'}(x)\), one has \[\begin{align} e(\Phi(u)\cap\overline{B}_{\delta'}(x),\Phi(v)) &=\sup_{z\in F^{-1} (-f(u)+y)\cap \overline{B}_{\delta'}(x)}d(z,F^{-1} (-f(v)+y))\\ &\le\tau^{-1} \sup_{z\in F^{-1} (-f(u)+y)\cap \overline{B}_{\delta'}(x)} d^q(-f(v)+y,F(z))\\ &\le\tau^{-1} d^q(f(u),f(v)) \le\tau^{-1} \mu d(u,v)=\theta d(u,v). \end{align}\]

By Lemma 3, there exists an \(\hat{x}\in \overline{B}_{\delta'}(x)\) such that \(\hat{x}\in\Phi(\hat{x})\). In other words, \(y\in F(\hat{x})+f(\hat{x})\) and \(d(\hat{x},x)\le \gamma d^q(y,y')\), and consequently, 10 holds.

We now arrive at the final stage of showing that \[\begin{gather} \label{T1461-2} d(x,(F+f)^{-1} (y))\le \gamma d^q(y,F(x)+f(x)). \end{gather}\tag{11}\] If \(F(x)+f(x)=\emptyset\), then there is nothing to prove. Let \(\varepsilon>0\) and choose a point \(\hat{y}\in F(x)+f(x)\) with \(d^q(y,\hat{y})< d^q(y,F(x)+f(x))+\varepsilon\). If \(\hat{y}\in B_{3\hat{\delta}}(\bar y)\), then in view of 10 , one obtains \[\begin{gather} d(x,(F+f)^{-1} (y))\le\gamma d^q(y,\hat{y})\le \gamma(d^q(y,F(x)+f(x))+\varepsilon). \end{gather}\] Letting \(\varepsilon\downarrow 0\), one arrives at 11 . If \(\hat{y}\notin B_{3\hat{\delta}}(\bar y)\), then \(d(\hat{y},y)\ge d(\hat{y},\bar y)-d(y,\bar y)>2\hat{\delta},\) and consequently, \[\begin{align} d(x,(F+f)^{-1} (y)) &\le e(B_{\hat{\delta}}(\bar x),(F+f)^{-1} (y))=\sup_{x'\in B_{\hat{\delta}}(\bar x)}d(x',(F+f)^{-1} (y))\\ &<\hat{\delta}+d(\bar x,(F+f)^{-1} (y))\le {\hat{\delta}}+\tau^{-1} d^q(y,\bar y) < {\hat{\delta}}+\gamma d^q(y,\bar y)\\ &<\hat{\delta}+\gamma\hat{\delta}^q\le \gamma(2^q-1)\hat{\delta}^q+\gamma\hat{\delta}^q =\gamma(2\hat{\delta})^q\\ &<\gamma d^q(\hat{y},y)< \gamma(d^q(y,F(x)+f(x))+\varepsilon). \end{align}\] Letting \(\varepsilon\downarrow 0\), one arrives at 11 . The inequality ?? is obtained by letting \(\gamma\downarrow (\tau-\mu)\). The proof is complete. 0◻ ◻

Theorem 4 offers an affirmative answer to the question posed by Dontchev [27] and enhances [15] providing the exact quantitative estimate for the modulus of regularity of the perturbed mapping.

Let \(f,g:X\rightarrow Y\) be functions between metric spaces, and \(q>0\). We say that \(g\) is a strict approximation of order \(q\) to \(f\) at \(\bar x\in X\) if \(f(\bar x)=g(\bar x)\) and \({\rm{lip}}^q(f-g)(\bar x)=0\). In the case \(q=1\), the function \(g\) is called a strict first-order approximation to \(f\) at \(\bar x\); cf. [1].

Corollary 1. Let \(X\) be a complete metric space, \(Y\) be a linear space with a shift-invariant metric, \(F:X\rightrightarrows Y\), \((\bar x,\bar y)\in{\rm gph}\,F\), \(f,g:X\rightarrow Y\) with \(f(\bar x)=0\), \({\rm gph}\,F\) be closed near \((\bar x,\bar y)\), and \(q\in(0,1]\). Suppose that \(g\) is a strict approximation of order \(q\) to \(f\) at \(\bar x\). The mapping \(F+f\) is metrically regular of order \(q\) at \((\bar x,\bar y)\) if and only if \(F+g\) is metrically regular of order \(q\) at \((\bar x,\bar y)\). Moreover, \[\begin{align} {\rm{rg}}^q(F+f)(\bar x,\bar y)={\rm{rg}}^q(F+g)(\bar x,\bar y). \end{align}\]

Proof. Observe that \(F+f=F+g+(f-g)\) and \({\rm{lip}}^{q}(f-g)(\bar x)=0\). The statement is a direct consequence of Theorem 4. 0◻ ◻

In the case \(q=1\), Corollary 1 yields the following result; cf. [43], [6].

Corollary 2. Let \(X,Y\) be Banach spaces, \(F:X\rightrightarrows Y\), \({\rm gph}\,F\) be closed, \(f:X\rightarrow Y\) be continuously Fréchet differentiable at \(\bar x\in X\), and \(\bar y\in F(\bar x)+f(\bar x)\). The mapping \(F+f\) is metrically regular at \((\bar x,\bar y)\) if and only if \(F(\cdot)+f(\bar x)+\nabla f(\bar x)(\cdot-\bar x)\) is metrically regular at \((\bar x,\bar y)\). Moreover, \[\begin{align} {\rm{rg}}(F+f)(\bar x,\bar y)={\rm{rg}}(F(\cdot)+f(\bar x)+\nabla f(\bar x)(\cdot-\bar x))(\bar x,\bar y). \end{align}\]

Proof. Let \(g(x):=f(\bar x)+\nabla f(\bar x)(x-\bar x)\) for all \(x\in X\). The continuous differentiability of \(f\) implies that \({\rm{lip}}(f-g)(\bar x)=0\). The statement follows from Corollary 1 with \(q=1\). 0◻ ◻

5 Convergence analysis of a Newton-type method↩︎

Let \(X,Y\) be Banach spaces, \(F:X\rightrightarrows Y\), and \(f: X\rightarrow Y\). Consider the problem

‘find \(x\in X\) such that \(f(x)+F(x)\ni 0\)’.

The aforementioned model has been used to describe in a unified way various problems [1], [44], [45]. The classical case of a nonlinear equation corresponds to \(F(x)=0\), whereas by taking \(Y=\mathbb{R}^n\) and \(F=\mathbb{R}^n_+\) one has a system of inequality constraints. The case of \(F\) being the normal cone mapping associated with a closed convex subset of a normed space \(X\) and \(Y=X^*\) results in a variational inequality.

Let \(S:=\{x\in X\mid f(x)+F(x)\ni 0\}\) be the solution set. From now on, we assume that \(\bar x\) is a given point in \(S\). Consider the Newton sequence \(\{x_k\}_{k\in\mathbb{N}}\) given by \[\begin{gather} f(x_k)+\nabla f(x_k)(x_{k+1}-x_k)+F(x_{k+1})\ni 0 \;\; \text{for} \;\; k=0,1,2,\ldots. \end{gather}\]

Theorem 5. Suppose \({\rm gph}\,F\) is closed, \(f\) is continuously Fréchet differentiable near \(\bar x\), the derivative mapping \(\nabla f\) is Lipschitz continuous at \(\bar x\), and \(F+f\) is metrically regular at \((\bar x,0)\). Then there exists a \(\delta>0\) such that for any \(x^\star\in S\cap B_{\frac{\delta}{2}}(\bar x)\) and \(u\in B_{\delta}(\bar x)\), there exists a Newton sequence \(\{x_k\}_{k\in\mathbb{N}}\) in \(B_\delta(\bar x)\) with \(x_0:=u\) converging quadratically to \(x^\star\).

Proof. For each \(u\in X\), define \(\Phi_u: X\rightrightarrows Y\) by \(\Phi_u(x):=f(u)+\nabla f(u)(x-u)+F(x)\) for all \(x\in X\). We first show that there exist \(\tau>0\) and \(\delta'>0\) such that \[\begin{gather} \label{phi} \tau d(x,\Phi_u^{-1} (0))\le d(0,\Phi_u(x)) \end{gather}\tag{12}\] for all \(u,x\in B_{\delta'}(\bar x)\). By Corollary 2, the mapping \(\Phi_{\bar x}\) is metrically regular at \((\bar x,0)\), and \({\rm{rg}}\Phi_{\bar x}(\bar x,0)={\rm{rg}}(F+f)(\bar x,0)\). Let \(\gamma>0\) be such that \(\frac{1}{2}{\rm{lip}}\nabla f(\bar x)/{\rm{rg}}(F+f)(\bar x,0)<\gamma.\) Choose positive numbers \(\tau,\tau',\mu,\mu'\) such that \(\tau<\tau'-\mu'<\tau'<{\rm{rg}}(F+f)(\bar x,0)\), \(\mu>{\rm{lip}}\nabla f(\bar x)\), and \(\frac{1}{2}\mu\tau^{-1} <\gamma.\) The Lipschitz condition guarantees the existence of some constants \(\delta_1>0\) and \(\lambda>0\) such that \(\|\nabla f(x)-\nabla f(x')\|\le \lambda \|x-x'\|\) for all \(x,x'\in B_{\delta_1}(\bar x).\) Taking \(\delta_1\) smaller if necessary, one can ensure that \(\Phi_{\bar x}\) is metrically regular at \((\bar x,0)\) with constants \(\tau\) and \(\delta_1\), and \(\|\nabla f(u)-\nabla f(\bar x)\|\le\mu'\) for all \(u\in B_{\delta_1}(\bar x).\) Let \(u\in B_{\delta_1}(\bar x)\). Define \(\psi_u:B_{\delta_1}(\bar x)\rightarrow Y\) by \[\begin{gather} \psi_u(x):=f(u)+\nabla f(u)(x-u)-f(\bar x)-\nabla f(\bar x)(x-\bar x) \;\; \text{for all} \;\; x\in B_{\delta_1}(\bar x). \end{gather}\] We obtain \(\|\psi_u(x)-\psi_u(x')\|\le \|\nabla f(u)-\nabla f(\bar x)\|\|x-x'\|\le\mu'\|x-x'\|\) for all \(x,x'\in B_{\delta'}(\bar x)\). Thus, \(\psi_u\) is Lipschitz continuous near \(\bar x\) with \({\rm{lip}}\psi_u(\bar x)\le\mu'\). In view of Theorem 4 with \(q=1\), the mapping \(\Phi_u(x)=\psi_u+\Phi_{\bar x}\) is metrically regular at \((\bar x,\psi_u(\bar x))\) with constant \(\tau'-\mu'\) (and consequently, with constant \(\tau\)) for some \(\delta_2\in(0,\delta_1]\). In other words, \(\tau d(x,\Phi_u^{-1} (y))\le d(y,\Phi_u(x))\) for all \(x\in B_{\delta_2}(\bar x)\) and \(y\in B_{\delta_2}(\psi_u(\bar x)).\) Choose a \(\delta'\in (0,\delta_2]\) such that \(\frac{\gamma}{2}\delta'^2\le\delta_2\). For any \(u\in B_{\delta'}(\bar x)\), \[\begin{align} \|\psi_u(\bar x)\| &=\|f(u)+\nabla f(u)(\bar x-u)-f(\bar x)\|\\ &=\left\|\int_{0}^{1}\nabla f(\bar x+t(u-x))(u-\bar x)dt-\nabla f(u)(u-\bar x)\right\|\\ &\le\gamma\|u-\bar x\|^{2}\int_{0}^{1}(1-t)dt=\dfrac{\gamma}{2}\|u-\bar x\|^{2}<\dfrac{\gamma}{2}\delta'^2\le\delta_2. \end{align}\] Thus, 12 holds for all \(u,x\in B_{\delta'}(\bar x)\). Choose \(\delta\in(0,\delta']\) such that \(\frac{9}{2}\gamma \delta\le1\), and let \(x^\star\in S\cap B_{\frac{\delta}{2}}(\bar x),\) \(u\in B_{\delta}(\bar x)\). We are going to prove the existence of \(x_1\) satisfying \[\begin{gather} \label{T5463-6} \Phi_u(x_1)\ni 0,\;\;\; \|x_1-x^\star\|\le \gamma\|u-x^\star\|^{2},\;\;\; x_1\in B_{\delta}(\bar x). \end{gather}\tag{13}\] If \(d(0,\Phi_u(x^\star))=0\), then 13 holds with \(x_1:=x^\star\). Suppose \(d(0,\Phi_u(x^\star))>0\). One has \[\begin{align} d(x^\star,\Phi_u^{-1} (0)) &\le \tau^{-1} d(0,\Phi_u(x^\star))<2\gamma\mu^{-1} d(0,\Phi_u(x^\star))\\ &\le 2\gamma\mu^{-1} \|f(u)+\nabla f(u)(x-u)-f(x^\star)\|\le \gamma\|u-x^\star\|^{2}. \end{align}\] Thus, there exists an \(x_1\in \Phi_u^{-1} (0)\) such that \(\|x_1-x^\star\|\le \gamma\|u-x^\star\|^{2}.\) Besides, \[\begin{align} \|x_1-\bar x\| &\le \|x_1-x^\star\|+\|x^\star-\bar x\|<\gamma\|u-x^\star\|^{2}+\dfrac{\delta}{2}\\ &\le \gamma (\|u-\bar x\|+\|x^\star-\bar x\|)^{2}+\dfrac{\delta}{2}< \dfrac{9}{4}\gamma \delta^{2}+\dfrac{\delta}{2}<\dfrac{\delta}{2}+\dfrac{\delta}{2}=\delta. \end{align}\] Applying the same argument with \(u:=x_1\), one obtains the existence of \(x_2\) such that \[\begin{gather} \Phi_u(x_2)\ni 0,\;\;\; \|x_2-x^\star\|\le \gamma\|x_1-x^\star\|^{2},\;\;\; x_2\in B_{\delta}(\bar x). \end{gather}\] By this procedure, one can find a Newton sequence \(\{x_k\}_{k\in\mathbb{N}}\) in \(B_\delta(\bar x)\) satisfying \(\|x_{k+1}-x^\star\|\le\gamma\|x_k-x^\star\|^{2}\) for all \(k\in\mathbb{N}\). Let \(\theta:=\gamma\|u-x^\star\|\). Then, \(\theta\le\gamma(\|u-\bar x\|+\|x^\star-\bar x\|) \le\dfrac{3}{2}\delta\gamma \le \dfrac{3}{2}\cdot\dfrac{2}{9} <1.\) One has \(\|x_{k}-x^\star\|\le\theta^{2^{k}-1}\|u-x^\star\|\) for all \(k\in\mathbb{N}\), and consequently, \(\{x_k\}_{k\in\mathbb{N}}\) converges quadratically to \(x^\star\). The proof is complete. 0◻ ◻

In the particular case \(x^\star=\bar x\), Theorem 5 recaptures [29].

6 Conclusions↩︎

Primal and dual necessary and sufficient conditions for Hölder metric regularity have been established. The Hölder version of the extended Lyusternik-Graves theorem providing an affirmative answer to the open question posed by Donchev [27] has been proved. The results have been applied to convergence analysis of a Newton-type method. The following problems are going to be studied in the future research.

Establishing radius results for regularity properties is, of course, an important topic of variational analysis. However, up to now, most of the results have been for the linear setting, and there are very few publications studying perturbations of metric regularity properties in the Hölder framework. Among them, let us mention the papers by He and Ng [15] for metric regularity; Mordukhovich and Ouyang [46], Ouyang and Li [47], and Cibulka et al. [48] for strong subregularity. Hence, it would be good to have a (hopefully) complete picture of stability results under various types of single-valued and set-valued perturbations in the Hölder setting for properties such as semiregularity [49]; subregularity, strong subregularity, regularity, strong regularity [1]; pseudo-regularity [50]; and others.
Formulating parameterized versions of Theorem 4 for other regularity properties. In the case of metric regularity, we expect to recapture [1] when \(q=1\).
Is the estimate ?? sharp? In other words, if \({\rm{rg}}^qF(\bar x,\bar y)<+\infty\) and \(\mu\in [0,{\rm{rg}}^qF(\bar x,\bar y)]\), does there exist a single-valued mapping \(f:X\rightarrow Y\) with \(f(\bar x)=0\), \[\begin{gather} {\rm{lip}}^{q}f(\bar x)=\mu, \;\; \text{and} \;\; {\rm{rg}}^q(F+f)(\bar x,\bar y)= {\rm{rg}}^qF(\bar x,\bar y)-\mu^q. \end{gather}\] In the case \(q=1\), the aforementioned problem can be traced back to the open question raised by Ioffe [51] to which Gfrerer and Kruger [52] have recently provided an affimative answer in the Asplund setting.
It is well known that metric regularity properties of set-valued mappings have strong connections with transversality properties of collections of sets [8], [53], [54]. While radius results for regularity properties have been investigated, there are no available results for transversality properties. It would be good to establish radius theorems for models involving collections of sets.

Nguyen Duy Cuong has been supported by the Postdoctoral Scholarship Programme of Vingroup Innovation Foundation (VinIF) code VINIF.2022.STS.40. The author wishes to thank Alexander Kruger for comments and suggestions.

The author has no competing interests to declare that are relevant to the content of this article.

Data sharing is not applicable to this article as no datasets have been generated or analysed during the current study.

References↩︎

[1]

Dontchev, A.L., Rockafellar, R.T.: Implicit Functions and Solution Mappings. A View from Variational Analysis, 2 edn. Springer Series in Operations Research and Financial Engineering. Springer, New York (2014).

[2]

Mordukhovich, B.S.: Variational Analysis and Generalized Differentiation. II: Applications, Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 331. Springer, Berlin (2006).

[3]

Ioffe, A.D.: Variational Analysis of Regular Mappings. Theory and Applications. Springer Monographs in Mathematics. Springer (2017).

[4]

Rockafellar, R.T., Wets, R.J.B.: Variational Analysis. Springer, Berlin (1998).

[5]

Mordukhovich, B.S.: Variational Analysis and Generalized Differentiation. I: Basic Theory, Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 330. Springer, Berlin (2006).

[6]

Dontchev, A.L., Rockafellar, R.T.: Regularity and conditioning of solution mappings in variational analysis. Set-Valued Anal. 12(1-2), 79–109 (2004).

[7]

Dontchev, A.L., Lewis, A.S., Rockafellar, R.T.: The radius of metric regularity. Trans. Amer. Math. Soc. 355(2), 493–517 (2003).

[8]

Cuong, N.D., Kruger, A.Y.: Transversality properties: Primal sufficient conditions. Set-Valued Var. Anal. 29(2), 221–256 (2021).

[9]

Frankowska, H., Quincampoix, M.: Hölder metric regularity of set-valued maps. Math. Program., Ser. A 132(1-2), 333–354 (2012).

[10]

Chuong, T.D.: Metric regularity of a positive order for generalized equations. Appl. Anal. 94(6), 1270–1287 (2015).

[11]

Chuong, T.D., Kim, D.S.: Hölder-like property and metric regularity of a positive-order for implicit multifunctions. Math. Oper. Res. 41(2), 596–611 (2016).

[12]

Yen, N.D., Yao, J.C., Kien, B.T.: Covering properties at positive-order rates of multifunctions and some related topics. J. Math. Anal. Appl. 338(1), 467–478 (2008).

[13]

Lee, J.H., Pham, T.S.: Openness, hölder metric regularity, and hölder continuity properties of semialgebraic set-valued maps. SIAM Journal on Optimization 32(1), 56–74 (2022).

[14]

Klatte, D., Kruger, A.Y., Kummer, B.: From convergence principles to stability and optimality conditions. J. Convex Anal. 19(4), 1043–1072 (2012).

[15]

He, Y., Ng, K.F.: Stability of \(p-\)order metric regularity. Vietnam Journal of Mathematics 46(2), 285–291 (2018).

[16]

Kirk, W.A.: Hölder continuity and minimal displacement. Numerical Functional Analysis and Optimization 19(1-2), 71–79 (1998).

[17]

Penot, J.P.: Calculus Without Derivatives, Graduate Texts in Mathematics, vol. 266. Springer, New York (2013).

[18]

Adly, S., Ngai, H.V., Vu, N.V.: Stability of metric regularity with set-valued perturbations and application to Newton’s method for solving generalized equations. Set-Valued and Variational Analysis 25(3), 543–567 (2017).

[19]

Dontchev, A.L., Frankowska, H.: Lyusternik-Graves theorem and fixed points II. J. Convex Anal. 19(4), 955–973 (2012).

[20]

He, Y., Xu, W.: An improved stability result on the metric regularity under Lipschitz set-valued perturbations. J. Math. Anal. Appl. 514(1), article no. 126253 (2022).

[21]

Ioffe, A.D.: On perturbation stability of metric regularity. Set-Valued Anal. 9(1-2), 101–109 (2001).

[22]

Lyusternik, L.A.: On the conditional extrema of functionals. Mat. Sbornik 41(3), 390–401 (1934).

[23]

Graves, L.M.: Some mapping theorems. Duke Math. J. 17, 111–114 (1950).

[24]

Dmitruk, A.V., Milyutin, A.A., Osmolovsky, N.P.: Lyusternik’s theorem and the theory of extrema. Russian Math. Surveys 35, 11–51 (1980).

[25]

Ioffe, A.D.: Metric regularity and subdifferential calculus. Russian Math. Surveys 55, 501–558 (2000).

[26]

Dontchev A.L., Frankowska, H.: Lyusternik-Graves theorem and fixed points. Proceedings of the American Mathematical Society 139(2), 521–534 (2011).

[27]

Dontchev, A.L.: A proof of the Lyusternik-Graves theorem. Optimization 64(1), 41–48 (2015).

[28]

Xu, W.: Estimation of the Modulus of Hölder Metric Regularity. Functional Analysis and Its Applications 56(2), 138–143 (2022).

[29]

Dontchev, A.L.: Lectures on Variational Analysis. Applied Mathematical Sciences. Springer Cham (2022).

[30]

Kruger, A.Y.: On Fréchet subdifferentials. J. Math. Sci. (N.Y.) 116(3), 3325–3358 (2003).

[31]

Zălinescu, C.: Convex Analysis in General Vector Spaces. World Scientific Publishing Co. Inc., River Edge, NJ (2002).

[32]

Ngai, H.V., Théra, M.: Error bounds in metric spaces and application to the perturbation stability of metric regularity. SIAM J. Optim. 19(1), 1–20 (2008).

[33]

Azé, D., Corvellec, J.N., Lucchetti, R.E.: Variational pairs and applications to stability in nonsmooth analysis. Nonlinear Anal., Ser. A: Theory Methods 49(5), 643–670 (2002).

[34]

Kruger, A.Y.: Error bounds and metric subregularity. Optimization 64(1), 49–79 (2015).

[35]

Cuong, N.D., Kruger, A.Y.: Primal necessary characterizations of transversality properties. Positivity 25(2), 531–558 (2021).

[36]

Cuong, N.D., Kruger, A.Y.: Dual sufficient characterizations of transversality properties. Positivity 24(5), 1313–1359 (2020).

[37]

Dontchev, A.L., Hager, W.W.: An inverse mapping theorem for set-valued maps. Proc. Amer. Math. Soc. 121(2), 481–489 (1994).

[38]

Ekeland, I.: On the variational principle. J. Math. Anal. Appl. 47, 324–353 (1974).

[39]

Fabian, M.: Subdifferentiability and trustworthiness in the light of a new variational principle of Borwein and Preiss. Acta Univ. Carolinae 30, 51–56 (1989).

[40]

Phelps, R.R.: Convex Functions, Monotone Operators and Differentiability, Lecture Notes in Mathematics, vol. 1364, 2nd edn. Springer-Verlag, Berlin (1993).

[41]

Borwein, J.M., Zhu, Q.J.: Techniques of Variational Analysis. Springer, New York (2005).

[42]

Cuong, N.D., Kruger, A.Y.: Uniform regularity of set-valued mappings and stability of implicit multifunctions. 2(2021).

[43]

Dontchev, A.L.: Lectures on Variational Analysis. Applied Mathematical Sciences. Springer, Cham (2021).

[44]

Klatte, D., Kummer, B.: Nonsmooth Equations in Optimization. Regularity, Calculus, Methods and Applications, Nonconvex Optimization and its Applications, vol. 60. Kluwer Academic Publishers, Dordrecht (2002).

[45]

Alexey F. Izmailov, M.V.S.: Newton-Type Methods for Optimization and Variational Problems. Springer Series in Operations Research and Financial Engineering. Springer Cham (2014).

[46]

Mordukhovich, B.S., Ouyang, W.: Higher-order metric subregularity and its applications. J. Global Optim. 63(4), 777–795 (2015).

[47]

Ouyang, W., Li, L.: Hölder strong metric subregularity and its applications to convergence analysis of inexact newton methods (2021).

[48]

Cibulka, R., Dontchev, A.L., Kruger, A.Y.: Strong metric subregularity of mappings in variational analysis and optimization. J. Math. Anal. Appl. 457(2), 1247–1282 (2018).

[49]

Cibulka, R., Fabian, M., Kruger, A.Y.: On semiregularity of mappings. J. Math. Anal. Appl. 473(2), 811–836 (2019).

[50]

Gfrerer, H.: On metric pseudo-(sub)regularity of multifunctions and optimality conditions for degenerated mathematical programs. Set-Valued Var. Anal 22(1), 79–115 (2014).

[51]

Ioffe, A.D.: On robustness of the regularity property of maps. Control Cybernet. 32, 543–554 (2003).

[52]

Gfrerer, H., Kruger, A.Y.: The radius of metric regularity revisited. Set-Valued and Variational Analysis 31(3), 20 (2023).

[53]

Cuong, N.D., Kruger, A.Y.: Nonlinear transversality of collections of sets: Dual space necessary characterizations. J. Convex Anal. 27(1), 287–308 (2020).

[54]

Kruger, A.Y., Thao, N.H.: Regularity of collections of sets and convergence of inexact alternating projections. J. Convex Anal. 23(3), 823–847 (2016).

Lyusternik-Graves Theorem for Hölder Metric Regularity