Banach fixed-point theorem: Difference between revisions
imported>Tito Omburo →Generalizations: compact case |
imported>Ascchrvalstr m →Statement: Rewrite math expressions using LaTeX |
||
| Line 3: | Line 3: | ||
==Statement== | ==Statement== | ||
''Definition.'' Let <math>(X, d)</math> be a [[metric space]]. Then a map <math>T : X \to X</math> is called a [[contraction mapping]] on | ''Definition.'' Let <math>(X, d)</math> be a [[metric space]]. Then a map <math>T : X \to X</math> is called a [[contraction mapping]] on <math>X</math> if there exists <math>q \in [0, 1)</math> such that | ||
:<math>d(T(x),T(y)) \le q d(x,y)</math> | :<math>d(T(x),T(y)) \le q d(x,y)</math> | ||
for all <math>x, y \in X.</math> | for all <math>x, y \in X.</math> | ||
<blockquote> | <blockquote> | ||
'''Banach fixed-point theorem.''' Let <math>(X, d)</math> be a non-[[Empty set|empty]] [[complete metric space]] with a contraction mapping <math>T : X \to X.</math> Then | '''Banach fixed-point theorem.''' Let <math>(X, d)</math> be a non-[[Empty set|empty]] [[complete metric space]] with a contraction mapping <math>T : X \to X.</math> Then <math>T</math> admits a unique [[Fixed point (mathematics)|fixed-point]] <math>x^*</math> in <math>X</math> (i.e. <math>T(x^*) = x^*</math>). Furthermore, <math>x^*</math> can be found as follows: start with an arbitrary element <math>x_0 \in X</math> and define a [[sequence]] <math>(x_n)_{n\in\mathbb N}</math> by <math>x_n = T(x_{n-1})</math> for <math>n \geq 1.</math> Then <math>\lim_{n \to \infty} x_n = x^*</math>.</blockquote> | ||
''Remark 1.'' The following inequalities are equivalent and describe the [[Rate of convergence|speed of convergence]]: | ''Remark 1.'' The following inequalities are equivalent and describe the [[Rate of convergence|speed of convergence]]: | ||
<math display="block"> | |||
\begin{align} | \begin{align} | ||
d(x^*, x_n) & \leq \frac{q^n}{1-q} d(x_1,x_0), \\[5pt] | d(x^*, x_n) & \leq \frac{q^n}{1-q} d(x_1,x_0), \\[5pt] | ||
| Line 20: | Line 19: | ||
</math> | </math> | ||
Any such value of | Any such value of <math>q</math> is called a ''[[Lipschitz constant]]'' for <math>T</math>, and the smallest one is sometimes called "the best Lipschitz constant" of <math>T</math>. | ||
''Remark 2.'' <math>d(T(x),T(y))<d(x,y)</math> for all <math>x \neq y</math> is in general not enough to ensure the existence of a fixed point, as is shown by the map | ''Remark 2.'' <math>d(T(x),T(y))<d(x,y)</math> for all <math>x \neq y</math> is in general not enough to ensure the existence of a fixed point, as is shown by the map | ||
<math display="block">T : [1,\infty) \to [1,\infty), \,\, T(x)=x+\tfrac{1}{x}\,,</math> | |||
which lacks a fixed point. However, if <math>X</math> is [[Compact space|compact]], then this weaker assumption does imply the existence and uniqueness of a fixed point, that can be easily found as a minimizer of <math>d(x,T(x))</math>, indeed, a minimizer exists by compactness, and has to be a fixed point of <math>T.</math> It then easily follows that the fixed point is the limit of any sequence of iterations of <math>T | which lacks a fixed point. However, if <math>X</math> is [[Compact space|compact]], then this weaker assumption does imply the existence and uniqueness of a fixed point, that can be easily found as a minimizer of <math>d(x,T(x))</math>, indeed, a minimizer exists by compactness, and has to be a fixed point of <math>T.</math> It then easily follows that the fixed point is the limit of any sequence of iterations of <math>T</math>. | ||
''Remark 3.'' When using the theorem in practice, the most difficult part is typically to define <math>X</math> properly so that <math>T(X) \subseteq X | ''Remark 3.'' When using the theorem in practice, the most difficult part is typically to define <math>X</math> properly so that <math>T(X)\subseteq X</math>. | ||
==Proof== | ==Proof== | ||
| Line 58: | Line 57: | ||
As a contraction mapping, <math>T</math> is continuous, so bringing the limit inside <math>T</math> was justified. Lastly, <math>T</math> cannot have more than one fixed point in <math>(X, d)</math>, since any pair of distinct fixed points <math>p_1</math> and <math>p_2</math> would contradict the contraction of <math>T</math>: | As a contraction mapping, <math>T</math> is continuous, so bringing the limit inside <math>T</math> was justified. Lastly, <math>T</math> cannot have more than one fixed point in <math>(X, d)</math>, since any pair of distinct fixed points <math>p_1</math> and <math>p_2</math> would contradict the contraction of <math>T</math>: | ||
<math display="block">d(p_1,p_2)=d(T(p_1),T(p_2))\leq qd(p_1, p_2)<d(p_1,p_2),</math> | |||
where the equality is due to <math>p_1,p_2</math> being fixed points of <math>T</math>, the first inequality is due to <math>T</math> being a contraction mapping, and the last inequality is due to <math>q<1</math> and <math>d(p_1,p_2)>0</math> as <math>p_1\neq p_2</math>. | |||
==Applications== | ==Applications== | ||
* A standard application is the proof of the [[Picard–Lindelöf theorem]] about the existence and uniqueness of solutions to certain [[ordinary differential equation]]s. The sought solution of the differential equation is expressed as a fixed point of a suitable integral operator on the space of continuous functions under the [[uniform norm]]. The Banach fixed-point theorem is then used to show that this integral operator has a unique fixed point. | * A standard application is the proof of the [[Picard–Lindelöf theorem]] about the existence and uniqueness of solutions to certain [[ordinary differential equation]]s. The sought solution of the differential equation is expressed as a fixed point of a suitable integral operator on the space of continuous functions under the [[uniform norm]]. The Banach fixed-point theorem is then used to show that this integral operator has a unique fixed point. | ||
* One consequence of the Banach fixed-point theorem is that small Lipschitz perturbations of the identity are [[Lipschitz continuity#Definitions|bi-lipschitz]] homeomorphisms. Let | * One consequence of the Banach fixed-point theorem is that small Lipschitz perturbations of the identity are [[Lipschitz continuity#Definitions|bi-lipschitz]] homeomorphisms. Let <math>\Omega</math> be an open set of a Banach space <math>E</math>; let <math>I:\Omega\rightarrow E</math> denote the identity (inclusion) map and let <math>g:\Omega\rightarrow E</math> be a Lipschitz map of constant <math>k<1</math>. Then | ||
# | # <math>\Omega':=(I+g)\Omega</math> is an open subset of <math>E</math>: precisely, for any <math>x</math> in <math>\Omega</math> such that <math>B(x,r)\subset\Omega</math> one has <math>B((I+g)(x),r(1-k))\subset\Omega'</math>; | ||
# | # <math>I+g:\Omega\rightarrow\Omega'</math> is a bi-Lipschitz homeomorphism; | ||
: precisely, ( | : precisely, <math>(I+g)^{-1}</math> is still of the form <math>I+h:\Omega\rightarrow\Omega'</math> with <math>h</math> a Lipschitz map of constant <math>k/(1-k)</math>. A direct consequence of this result yields the proof of the [[inverse function theorem]]. | ||
* It can be used to give sufficient conditions under which Newton's method of successive approximations is guaranteed to work, and similarly for Chebyshev's third-order method. | * It can be used to give sufficient conditions under which Newton's method of successive approximations is guaranteed to work, and similarly for Chebyshev's third-order method. | ||
* It can be used to prove existence and uniqueness of solutions to integral equations. | * It can be used to prove existence and uniqueness of solutions to integral equations. | ||
| Line 75: | Line 76: | ||
Several converses of the Banach contraction principle exist. The following is due to [[Czesław Bessaga]], from 1959: | Several converses of the Banach contraction principle exist. The following is due to [[Czesław Bessaga]], from 1959: | ||
Let | Let <math>f:X\rightarrow X</math> be a map of an abstract [[set (mathematics)|set]] such that each [[iterated function|iterate]] <math>f^n</math> has a unique fixed point. Let <math>q \in (0, 1)</math>, then there exists a complete metric on <math>X</math> such that <math>f</math> is contractive, and <math>q</math> is the contraction constant. | ||
Indeed, very weak assumptions suffice to obtain such a kind of converse. For example if <math>f : X \to X</math> is a map on a [[T1 space|''T''<sub>1</sub> topological space]] with a unique [[fixed point (mathematics)|fixed point]] | Indeed, very weak assumptions suffice to obtain such a kind of converse. For example if <math>f : X \to X</math> is a map on a [[T1 space|''T''<sub>1</sub> topological space]] with a unique [[fixed point (mathematics)|fixed point]] <math>a</math>, such that for each <math>x \in X</math> we have <math>f^n(x)\rightarrow a</math>, then there already exists a metric on <math>X</math> with respect to which <math>f</math> satisfies the conditions of the Banach contraction principle with contraction constant <math>1/2</math>.<ref>{{cite journal |first1=Pascal |last1=Hitzler | author-link1=Pascal Hitzler|first2=Anthony K. |last2=Seda |title=A 'Converse' of the Banach Contraction Mapping Theorem |journal=Journal of Electrical Engineering |volume=52 |issue=10/s |year=2001 |pages=3–6 }}</ref> In this case the metric is in fact an [[ultrametric]]. | ||
==Generalizations== | ==Generalizations== | ||
There are a number of generalizations (some of which are immediate [[Corollary|corollaries]]).<ref name=Latif2014>{{cite book |first=Abdul |last=Latif |title=Topics in Fixed Point Theory |pages=33–64 |chapter=Banach Contraction Principle and its Generalizations |publisher=Springer |year=2014 |doi=10.1007/978-3-319-01586-6_2 |isbn=978-3-319-01585-9 }}</ref> | There are a number of generalizations (some of which are immediate [[Corollary|corollaries]]).<ref name=Latif2014>{{cite book |first=Abdul |last=Latif |title=Topics in Fixed Point Theory |pages=33–64 |chapter=Banach Contraction Principle and its Generalizations |publisher=Springer |year=2014 |doi=10.1007/978-3-319-01586-6_2 |isbn=978-3-319-01585-9 }}</ref> | ||
Let | Let <math>T:X\rightarrow X</math> be a map on a complete non-empty metric space. Then, for example, some generalizations of the Banach fixed-point theorem are: | ||
*Assume that some iterate | *Assume that some iterate <math>T^n</math> of <math>T</math> is a contraction. Then <math>T</math> has a unique fixed point. | ||
*Assume that for each | *Assume that for each <math>n</math>, there exist <math>c_n</math> such that <math>d(T^n(x),T^n(y))\leq c_nd(x,y)</math> for all <math>x</math> and <math>y</math>, and that | ||
::<math>\sum\nolimits_n c_n <\infty.</math> | ::<math>\sum\nolimits_n c_n <\infty.</math> | ||
:Then | :Then <math>T</math> has a unique fixed point. | ||
In applications, the existence and uniqueness of a fixed point often can be shown directly with the standard Banach fixed point theorem, by a suitable choice of the metric that makes the map | In applications, the existence and uniqueness of a fixed point often can be shown directly with the standard Banach fixed point theorem, by a suitable choice of the metric that makes the map <math>T</math> a contraction. Indeed, the above result by Bessaga strongly suggests to look for such a metric. See also the article on [[fixed point theorems in infinite-dimensional spaces]] for generalizations. | ||
In a non-empty [[compact metric space]], any function <math>T</math> satisfying <math>d(T(x),T(y))<d(x,y)</math> for all distinct <math>x,y</math>, has a unique fixed point. The proof is simpler than the Banach theorem, because the function <math>d(T(x),x)</math> is continuous, and therefore assumes a minimum, which is easily shown to be zero. | In a non-empty [[compact metric space]], any function <math>T</math> satisfying <math>d(T(x),T(y))<d(x,y)</math> for all distinct <math>x,y</math>, has a unique fixed point. The proof is simpler than the Banach theorem, because the function <math>d(T(x),x)</math> is continuous, and therefore assumes a minimum, which is easily shown to be zero. | ||
| Line 95: | Line 96: | ||
==Example== | ==Example== | ||
An application of the Banach fixed-point theorem and fixed-point iteration can be used to quickly obtain an approximation of | An application of the Banach fixed-point theorem and fixed-point iteration can be used to quickly obtain an approximation of <math>\pi</math> with high accuracy. Consider the function <math>f(x)=\sin(x)+x</math>. It can be verified that <math>\pi</math> is a fixed point of <math>f</math>, and that <math>f</math> maps the interval <math>\left[3\pi/4,5\pi/4\right]</math> to itself. Moreover, <math>f'(x)=1+\cos(x)</math>, and it can be verified that | ||
:<math>0\leq1+\cos(x)\leq1-\frac{1}{\sqrt{2}}<1</math> | :<math>0\leq1+\cos(x)\leq1-\frac{1}{\sqrt{2}}<1</math> | ||
on this interval. Therefore, by an application of the [[mean value theorem]], | on this interval. Therefore, by an application of the [[mean value theorem]], <math>f</math> has a Lipschitz constant less than 1 (namely <math>1-1/\sqrt{2}</math>). Applying the Banach fixed-point theorem shows that the fixed point <math>\pi</math> is the unique fixed point on the interval, allowing for fixed-point iteration to be used. | ||
For example, the value 3 may be chosen to start the fixed-point iteration, as <math>3\pi/4\leq3\leq5\pi/4</math>. The Banach fixed-point theorem may be used to conclude that | For example, the value 3 may be chosen to start the fixed-point iteration, as <math>3\pi/4\leq3\leq5\pi/4</math>. The Banach fixed-point theorem may be used to conclude that | ||
| Line 105: | Line 106: | ||
: <math>\pi=f(f(f(\cdots f(3)\cdots)))).</math> | : <math>\pi=f(f(f(\cdots f(3)\cdots)))).</math> | ||
Applying ''f'' to 3 only three times already yields an expansion of | Applying ''f'' to 3 only three times already yields an expansion of <math>\pi</math> accurate to 33 digits: | ||
: <math>f(f(f(3)))=3.141592653589793238462643383279502\ldots\,.</math> | : <math>f(f(f(3)))=3.141592653589793238462643383279502\ldots\,.</math> | ||
| Line 139: | Line 140: | ||
[[Category:Articles containing proofs]] | [[Category:Articles containing proofs]] | ||
[[Category:Fixed-point theorems]] | [[Category:Fixed-point theorems]] | ||
[[Category:Metric geometry]] | [[Category:Metric geometry]] | ||
[[Category:Topology]] | [[Category:Topology]] | ||
Latest revision as of 19:51, 14 November 2025
Template:Short description In mathematics, the Banach fixed-point theorem (also known as the contraction mapping theorem or contractive mapping theorem or Banach–Caccioppoli theorem) is an important tool in the theory of metric spaces; it guarantees the existence and uniqueness of fixed points of certain self-maps of metric spaces and provides a constructive method to find those fixed points. It can be understood as an abstract formulation of Picard's method of successive approximations.[1] The theorem is named after Stefan Banach (1892–1945) who first stated it in 1922.[2][3]
Statement
Definition. Let be a metric space. Then a map is called a contraction mapping on if there exists such that
for all
Banach fixed-point theorem. Let
be a non-empty complete metric space with a contraction mapping
Then
admits a unique fixed-point
in
(i.e.
). Furthermore,
can be found as follows: start with an arbitrary element
and define a sequence
by
for
Then
.
Remark 1. The following inequalities are equivalent and describe the speed of convergence:
Any such value of is called a Lipschitz constant for , and the smallest one is sometimes called "the best Lipschitz constant" of .
Remark 2. for all is in general not enough to ensure the existence of a fixed point, as is shown by the map which lacks a fixed point. However, if is compact, then this weaker assumption does imply the existence and uniqueness of a fixed point, that can be easily found as a minimizer of , indeed, a minimizer exists by compactness, and has to be a fixed point of It then easily follows that the fixed point is the limit of any sequence of iterations of .
Remark 3. When using the theorem in practice, the most difficult part is typically to define properly so that .
Proof
Let be arbitrary and define a sequence by setting . We first note that for all we have the inequality
This follows by induction on , using the fact that is a contraction mapping. Then we can show that is a Cauchy sequence. In particular, let such that :
Let be arbitrary. Since , we can find a large so that
Therefore, by choosing and greater than we may write:
This proves that the sequence is Cauchy. By completeness of , the sequence has a limit Furthermore, must be a fixed point of :
As a contraction mapping, is continuous, so bringing the limit inside was justified. Lastly, cannot have more than one fixed point in , since any pair of distinct fixed points and would contradict the contraction of :
where the equality is due to being fixed points of , the first inequality is due to being a contraction mapping, and the last inequality is due to and as .
Applications
- A standard application is the proof of the Picard–Lindelöf theorem about the existence and uniqueness of solutions to certain ordinary differential equations. The sought solution of the differential equation is expressed as a fixed point of a suitable integral operator on the space of continuous functions under the uniform norm. The Banach fixed-point theorem is then used to show that this integral operator has a unique fixed point.
- One consequence of the Banach fixed-point theorem is that small Lipschitz perturbations of the identity are bi-lipschitz homeomorphisms. Let be an open set of a Banach space ; let denote the identity (inclusion) map and let be a Lipschitz map of constant . Then
- is an open subset of : precisely, for any in such that one has ;
- is a bi-Lipschitz homeomorphism;
- precisely, is still of the form with a Lipschitz map of constant . A direct consequence of this result yields the proof of the inverse function theorem.
- It can be used to give sufficient conditions under which Newton's method of successive approximations is guaranteed to work, and similarly for Chebyshev's third-order method.
- It can be used to prove existence and uniqueness of solutions to integral equations.
- It can be used to give a proof to the Nash embedding theorem.[4]
- It can be used to prove existence and uniqueness of solutions to value iteration, policy iteration, and policy evaluation of reinforcement learning.[5]
- It can be used to prove existence and uniqueness of an equilibrium in Cournot competition,[6] and other dynamic economic models.[7]
Converses
Several converses of the Banach contraction principle exist. The following is due to Czesław Bessaga, from 1959:
Let be a map of an abstract set such that each iterate has a unique fixed point. Let , then there exists a complete metric on such that is contractive, and is the contraction constant.
Indeed, very weak assumptions suffice to obtain such a kind of converse. For example if is a map on a T1 topological space with a unique fixed point , such that for each we have , then there already exists a metric on with respect to which satisfies the conditions of the Banach contraction principle with contraction constant .[8] In this case the metric is in fact an ultrametric.
Generalizations
There are a number of generalizations (some of which are immediate corollaries).[9]
Let be a map on a complete non-empty metric space. Then, for example, some generalizations of the Banach fixed-point theorem are:
- Assume that some iterate of is a contraction. Then has a unique fixed point.
- Assume that for each , there exist such that for all and , and that
- Then has a unique fixed point.
In applications, the existence and uniqueness of a fixed point often can be shown directly with the standard Banach fixed point theorem, by a suitable choice of the metric that makes the map a contraction. Indeed, the above result by Bessaga strongly suggests to look for such a metric. See also the article on fixed point theorems in infinite-dimensional spaces for generalizations.
In a non-empty compact metric space, any function satisfying for all distinct , has a unique fixed point. The proof is simpler than the Banach theorem, because the function is continuous, and therefore assumes a minimum, which is easily shown to be zero.
A different class of generalizations arise from suitable generalizations of the notion of metric space, e.g. by weakening the defining axioms for the notion of metric.[10] Some of these have applications, e.g., in the theory of programming semantics in theoretical computer science.[11]
Example
An application of the Banach fixed-point theorem and fixed-point iteration can be used to quickly obtain an approximation of with high accuracy. Consider the function . It can be verified that is a fixed point of , and that maps the interval to itself. Moreover, , and it can be verified that
on this interval. Therefore, by an application of the mean value theorem, has a Lipschitz constant less than 1 (namely ). Applying the Banach fixed-point theorem shows that the fixed point is the unique fixed point on the interval, allowing for fixed-point iteration to be used.
For example, the value 3 may be chosen to start the fixed-point iteration, as . The Banach fixed-point theorem may be used to conclude that
Applying f to 3 only three times already yields an expansion of accurate to 33 digits:
See also
- Brouwer fixed-point theorem
- Caristi fixed-point theorem
- Contraction mapping
- Fichera's existence principle
- Fixed-point iteration
- Fixed-point theorems
- Infinite compositions of analytic functions
- Kantorovich theorem
Notes
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "Citation/CS1".
- ↑ Script error: No such module "Citation/CS1".
- ↑ Script error: No such module "Citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "Citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "Citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "citation/CS1".
- ↑ Script error: No such module "Citation/CS1".
References
- Script error: No such module "citation/CS1".
- Script error: No such module "citation/CS1".
- Script error: No such module "citation/CS1".
- Script error: No such module "citation/CS1". See chapter 7.
- Script error: No such module "citation/CS1".
This article incorporates material from Banach fixed point theorem on PlanetMath, which is licensed under the Creative Commons Attribution/Share-Alike License.