Functional derivative: Difference between revisions

Latest revision as of 03:49, 1 November 2025

Template:Short description In the calculus of variations, a field of mathematical analysis, the functional derivative (or variational derivative)^[1] relates a change in a functional (a functional in this sense is a function that acts on functions) to a change in a function on which the functional depends.

In the calculus of variations, functionals are usually expressed in terms of an integral of functions, their arguments, and their derivatives. In an integrand $L$ Script error: No such module "Check for unknown parameters". of a functional, if a function $f$ Script error: No such module "Check for unknown parameters". is varied by adding to it another function $δf$ Script error: No such module "Check for unknown parameters". that is arbitrarily small, and the resulting integrand is expanded in powers of $δf$ Script error: No such module "Check for unknown parameters"., the coefficient of $δf$ Script error: No such module "Check for unknown parameters". in the first order term is called the functional derivative.

For example, consider the functional $J [f] = \int_{a}^{b} L (x, f (x), f^{'} (x)) d x,$ where $f'(x) \equiv df / dx$ Script error: No such module "Check for unknown parameters".. If $f$ Script error: No such module "Check for unknown parameters". is varied by adding to it a function $δf$ Script error: No such module "Check for unknown parameters"., and the resulting integrand $L (x, f + δf, f'+ δf')$ Script error: No such module "Check for unknown parameters". is expanded in powers of $δf$ Script error: No such module "Check for unknown parameters"., then the change in the value of $J$ Script error: No such module "Check for unknown parameters". to first order in $δf$ Script error: No such module "Check for unknown parameters". can be expressed as follows:^[1]^{[Note 1]} $\begin{aligned} δ J & = \int_{a}^{b} (\frac{\partial L}{\partial f} δ f (x) + \frac{\partial L}{\partial f^{'}} \frac{d}{d x} δ f (x)) d x \\ = \int_{a}^{b} (\frac{\partial L}{\partial f} - \frac{d}{d x} \frac{\partial L}{\partial f^{'}}) δ f (x) d x + \frac{\partial L}{\partial f^{'}} (b) δ f (b) - \frac{\partial L}{\partial f^{'}} (a) δ f (a) \end{aligned}$ where the variation in the derivative, $δf'$ Script error: No such module "Check for unknown parameters". was rewritten as the derivative of the variation $(δf) '$ Script error: No such module "Check for unknown parameters"., and integration by parts was used in these derivatives.

Definition

In this section, the functional differential (or variation or first variation)^{[Note 2]} is defined. Then the functional derivative is defined in terms of the functional differential.

Functional differential

Suppose $B$ is a Banach space and $F$ is a functional defined on $B$ . The differential of $F$ at a point $ρ \in B$ is the linear functional $δ F [ρ, \cdot]$ on $B$ defined^[2] by the condition that, for all $ϕ \in B$ , $F [ρ + ϕ] - F [ρ] = δ F [ρ; ϕ] + ε ‖ ϕ ‖$ where $ε$ is a real number that depends on $‖ ϕ ‖$ in such a way that $ε \to 0$ as $‖ ϕ ‖ \to 0$ . This means that $δ F [ρ, \cdot]$ is the Fréchet derivative of $F$ at $ρ$ .

However, this notion of functional differential is so strong it may not exist,^[3] and in those cases a weaker notion, like the Gateaux derivative is preferred. In many practical cases, the functional differential is defined^[4] as the directional derivative $\begin{aligned} δ F [ρ, ϕ] & = \lim_{ε \to 0} \frac{F [ρ + ε ϕ] - F [ρ]}{ε} \\ = {[\frac{d}{d ε} F [ρ + ε ϕ]]}_{ε = 0} . \end{aligned}$ Note that this notion of the functional differential can even be defined without a norm.

In a more general case the function space $B$ appearing as the domain of $F$ is not a vector space, and therefore variations of the form $ρ + ε ϕ$ do not make sense. In this case we consider a variation $α_{?} : (- ε_{0}, ε_{0}) \to B$ of $ρ$ to be a $C^{1}$ -family of functions such that $α_{0} = ρ$ .^{[Note 3]} Denoting the space of all such variations as $𝒱_{ρ}$ , the functional differential $δ F [ρ] : 𝒱_{ρ} \to ℝ$ is the functional $\begin{aligned} δ F [ρ; α] = δ F [ρ] [α] = \lim_{ϵ \to 0} \frac{F [α_{ϵ}] - F [ρ]}{ϵ} = F [α_{?}]^{'} (0) \end{aligned}$

where $F [α_{?}] (ϵ) = F [α_{ϵ}]$ . The above then becomes the special case $α_{ϵ} = ρ + ϵ η$ .^[5]

Functional derivative

In many applications, the domain of the functional $F$ is a space of differentiable functions $ρ$ defined on some space $Ω$ and $F$ is of the form $F [ρ] = \int_{Ω} L (x, ρ (x), D ρ (x)) d x$ for some function $L (x, ρ (x), D ρ (x))$ that may depend on $x$ , the value $ρ (x)$ and the derivative $D ρ (x)$ . If this is the case and, moreover, $δ F [ρ, ϕ]$ can be written as the integral of $ϕ$ times another function (denoted $δF / δρ$ Script error: No such module "Check for unknown parameters".) $δ F [ρ, ϕ] = \int_{Ω} \frac{δ F}{δ ρ} (x) ϕ (x) d x$ then this function $δF / δρ$ Script error: No such module "Check for unknown parameters". is called the functional derivative of $F$ Script error: No such module "Check for unknown parameters". at $ρ$ Script error: No such module "Check for unknown parameters"..^[6]^[7] If $F$ is restricted to only certain functions $ρ$ (for example, if there are some boundary conditions imposed) then $ϕ$ is restricted to functions such that $ρ + ε ϕ$ continues to satisfy these conditions.

Heuristically, $ϕ$ is the change in $ρ$ , so we 'formally' have $ϕ = δ ρ$ , and then this is similar in form to the total differential of a function $F (ρ_{1}, ρ_{2}, \dots, ρ_{n})$ , $d F = \sum_{i = 1}^{n} \frac{\partial F}{\partial ρ_{i}} d ρ_{i},$ where $ρ_{1}, ρ_{2}, \dots, ρ_{n}$ are independent variables. Comparing the last two equations, the functional derivative $δ F / δ ρ (x)$ has a role similar to that of the partial derivative $\partial F / \partial ρ_{i}$ , where the variable of integration $x$ is like a continuous version of the summation index $i$ .^[8] One thinks of $δF / δρ$ Script error: No such module "Check for unknown parameters". as the gradient of $F$ Script error: No such module "Check for unknown parameters". at the point $ρ$ Script error: No such module "Check for unknown parameters"., so the value $δF / δρ(x)$ Script error: No such module "Check for unknown parameters". measures how much the functional $F$ Script error: No such module "Check for unknown parameters". will change if the function $ρ$ Script error: No such module "Check for unknown parameters". is changed at the point $x$ Script error: No such module "Check for unknown parameters".. Hence the formula $\int \frac{δ F}{δ ρ} (x) ϕ (x) d x$ is regarded as the directional derivative at point $ρ$ in the direction of $ϕ$ . This is analogous to vector calculus, where the inner product of a vector $v$ with the gradient gives the directional derivative in the direction of $v$ .

Properties

Like the derivative of a function, the functional derivative satisfies the following properties, where $F [ρ]$ Script error: No such module "Check for unknown parameters". and $G [ρ]$ Script error: No such module "Check for unknown parameters". are functionals:^{[Note 4]}

Linearity:^[9] $\frac{δ (λ F + μ G) [ρ]}{δ ρ (x)} = λ \frac{δ F [ρ]}{δ ρ (x)} + μ \frac{δ G [ρ]}{δ ρ (x)},$ where $λ, μ$ Script error: No such module "Check for unknown parameters". are constants.
Product rule:^[10] $\frac{δ (F G) [ρ]}{δ ρ (x)} = \frac{δ F [ρ]}{δ ρ (x)} G [ρ] + F [ρ] \frac{δ G [ρ]}{δ ρ (x)},$
Chain rules:
- If $F$ Script error: No such module "Check for unknown parameters". is a functional and $G$ Script error: No such module "Check for unknown parameters". another functional, then^[11] $\frac{δ F [G [ρ]]}{δ ρ (y)} = \int d x {\frac{δ F [G]}{δ G (x)}}_{G = G [ρ]} \cdot \frac{δ G [ρ] (x)}{δ ρ (y)} .$
- If $G$ Script error: No such module "Check for unknown parameters". is an ordinary differentiable function (local functional) $g$ Script error: No such module "Check for unknown parameters"., then this reduces to^[12] $\frac{δ F [g (ρ)]}{δ ρ (y)} = \frac{δ F [g (ρ)]}{δ g [ρ (y)]} \frac{d g (ρ)}{d ρ (y)} .$

Determining functional derivatives

A formula to determine functional derivatives for a common class of functionals can be written as the integral of a function and its derivatives. This is a generalization of the Euler–Lagrange equation: indeed, the functional derivative was introduced in physics within the derivation of the Lagrange equation of the second kind from the principle of least action in Lagrangian mechanics (18th century). The first three examples below are taken from density functional theory (20th century), the fourth from statistical mechanics (19th century).

Formula

Given a functional $F [ρ] = \int f (r, ρ (r), \nabla ρ (r)) d r,$ and a function $ϕ (r)$ that vanishes on the boundary of the region of integration, from a previous section Definition, $\begin{aligned} \int \frac{δ F}{δ ρ (r)} ϕ (r) d r & = {[\frac{d}{d ε} \int f (r, ρ + ε ϕ, \nabla ρ + ε \nabla ϕ) d r]}_{ε = 0} \\ = \int (\frac{\partial f}{\partial ρ} ϕ + \frac{\partial f}{\partial \nabla ρ} \cdot \nabla ϕ) d r \\ = \int [\frac{\partial f}{\partial ρ} ϕ + \nabla \cdot (\frac{\partial f}{\partial \nabla ρ} ϕ) - (\nabla \cdot \frac{\partial f}{\partial \nabla ρ}) ϕ] d r \\ = \int [\frac{\partial f}{\partial ρ} ϕ - (\nabla \cdot \frac{\partial f}{\partial \nabla ρ}) ϕ] d r \\ = \int (\frac{\partial f}{\partial ρ} - \nabla \cdot \frac{\partial f}{\partial \nabla ρ}) ϕ (r) d r . \end{aligned}$

The second line is obtained using the total derivative, where $\partialf / \partial\nablaρ$ Script error: No such module "Check for unknown parameters". is a derivative of a scalar with respect to a vector.^{[Note 5]}

The third line was obtained by use of a product rule for divergence. The fourth line was obtained using the divergence theorem and the condition that $ϕ = 0$ on the boundary of the region of integration. Since $ϕ$ is also an arbitrary function, applying the fundamental lemma of calculus of variations to the last line, the functional derivative is $\frac{δ F}{δ ρ (r)} = \frac{\partial f}{\partial ρ} - \nabla \cdot \frac{\partial f}{\partial \nabla ρ}$

where $ρ = ρ (r)$ Script error: No such module "Check for unknown parameters". and $f = f (r, ρ, \nabla ρ)$ Script error: No such module "Check for unknown parameters".. This formula is for the case of the functional form given by $F [ρ]$ Script error: No such module "Check for unknown parameters". at the beginning of this section. For other functional forms, the definition of the functional derivative can be used as the starting point for its determination. (See the example Coulomb potential energy functional.)

The above equation for the functional derivative can be generalized to the case that includes higher dimensions and higher order derivatives. The functional would be, $F [ρ (r)] = \int f (r, ρ (r), \nabla ρ (r), \nabla^{(2)} ρ (r), \dots, \nabla^{(N)} ρ (r)) d r,$

where the vector $r \in R n$ Script error: No such module "Check for unknown parameters"., and $\nabla (i)$ Script error: No such module "Check for unknown parameters". is a tensor whose $n i$ Script error: No such module "Check for unknown parameters". components are partial derivative operators of order $i$ Script error: No such module "Check for unknown parameters"., ${[\nabla^{(i)}]}_{α_{1} α_{2} \dots α_{i}} = \frac{\partial^{i}}{\partial r_{α_{1}} \partial r_{α_{2}} \dots \partial r_{α_{i}}} where α_{1}, α_{2}, \dots, α_{i} = 1, 2, \dots, n .$ ^{[Note 6]}

An analogous application of the definition of the functional derivative yields $\begin{aligned} \frac{δ F [ρ]}{δ ρ} & = \frac{\partial f}{\partial ρ} - \nabla \cdot \frac{\partial f}{\partial (\nabla ρ)} + \nabla^{(2)} \cdot \frac{\partial f}{\partial (\nabla^{(2)} ρ)} + \dots + (- 1)^{N} \nabla^{(N)} \cdot \frac{\partial f}{\partial (\nabla^{(N)} ρ)} \\ = \frac{\partial f}{\partial ρ} + \sum_{i = 1}^{N} (- 1)^{i} \nabla^{(i)} \cdot \frac{\partial f}{\partial (\nabla^{(i)} ρ)} . \end{aligned}$

In the last two equations, the $n i$ Script error: No such module "Check for unknown parameters". components of the tensor $\frac{\partial f}{\partial (\nabla^{(i)} ρ)}$ are partial derivatives of $f$ Script error: No such module "Check for unknown parameters". with respect to partial derivatives of ρ, ${[\frac{\partial f}{\partial (\nabla^{(i)} ρ)}]}_{α_{1} α_{2} \dots α_{i}} = \frac{\partial f}{\partial ρ_{α_{1} α_{2} \dots α_{i}}}$ where $ρ_{α_{1} α_{2} \dots α_{i}} \equiv \frac{\partial^{i} ρ}{\partial r_{α_{1}} \partial r_{α_{2}} \dots \partial r_{α_{i}}}$ , and the tensor scalar product is, $\nabla^{(i)} \cdot \frac{\partial f}{\partial (\nabla^{(i)} ρ)} = \sum_{α_{1}, α_{2}, \dots, α_{i} = 1}^{n} \frac{\partial^{i}}{\partial r_{α_{1}} \partial r_{α_{2}} \dots \partial r_{α_{i}}} \frac{\partial f}{\partial ρ_{α_{1} α_{2} \dots α_{i}}} .$ ^{[Note 7]}

Examples

Thomas–Fermi kinetic energy functional

The Thomas–Fermi model of 1927 used a kinetic energy functional for a noninteracting uniform electron gas in a first attempt of density-functional theory of electronic structure: $T_{T F} [ρ] = C_{F} \int ρ^{5 / 3} (𝐫) d 𝐫 .$ Since the integrand of $T TF [ρ]$ Script error: No such module "Check for unknown parameters". does not involve derivatives of $ρ (r)$ Script error: No such module "Check for unknown parameters"., the functional derivative of $T TF [ρ]$ Script error: No such module "Check for unknown parameters". is,^[13] $\frac{δ T_{T F}}{δ ρ (r)} = C_{F} \frac{\partial ρ^{5 / 3} (𝐫)}{\partial ρ (𝐫)} = \frac{5}{3} C_{F} ρ^{2 / 3} (𝐫) .$

Coulomb potential energy functional

The electron-nucleus potential energy is $V [ρ] = \int \frac{ρ (r)}{| r |} d r .$

Applying the definition of functional derivative, $\begin{aligned} \int \frac{δ V}{δ ρ (r)} ϕ (r) d r & = {[\frac{d}{d ε} \int \frac{ρ (r) + ε ϕ (r)}{| r |} d r]}_{ε = 0} \\ = \int \frac{ϕ (r)}{| r |} d r . \end{aligned}$ So, $\frac{δ V}{δ ρ (r)} = \frac{1}{| r |} .$

The functional derivative of the classical part of the electron-electron interaction (often called Hartree energy) is $J [ρ] = \frac{1}{2} \iint \frac{ρ (𝐫) ρ (𝐫^{'})}{| 𝐫 - 𝐫^{'} |} d 𝐫 d 𝐫^{'} .$ From the definition of the functional derivative, $\begin{aligned} \int \frac{δ J}{δ ρ (r)} ϕ (r) d r & = {[\frac{d}{d ε} J [ρ + ε ϕ]]}_{ε = 0} \\ = {[\frac{d}{d ε} (\frac{1}{2} \iint \frac{[ρ (r) + ε ϕ (r)] [ρ (r^{'}) + ε ϕ (r^{'})]}{| r - r^{'} |} d r d r^{'})]}_{ε = 0} \\ = \frac{1}{2} \iint \frac{ρ (r^{'}) ϕ (r)}{| r - r^{'} |} d r d r^{'} + \frac{1}{2} \iint \frac{ρ (r) ϕ (r^{'})}{| r - r^{'} |} d r d r^{'} \end{aligned}$ The first and second terms on the right hand side of the last equation are equal, since $r$ Script error: No such module "Check for unknown parameters". and $r'$ Script error: No such module "Check for unknown parameters". in the second term can be interchanged without changing the value of the integral. Therefore, $\int \frac{δ J}{δ ρ (r)} ϕ (r) d r = \int (\int \frac{ρ (r^{'})}{| r - r^{'} |} d r^{'}) ϕ (r) d r$ and the functional derivative of the electron-electron Coulomb potential energy functional $J$ Script error: No such module "Check for unknown parameters".[ρ] is,^[14] $\frac{δ J}{δ ρ (r)} = \int \frac{ρ (r^{'})}{| r - r^{'} |} d r^{'} .$

The second functional derivative is $\frac{δ^{2} J [ρ]}{δ ρ (𝐫^{'}) δ ρ (𝐫)} = \frac{\partial}{\partial ρ (𝐫^{'})} (\frac{ρ (𝐫^{'})}{| 𝐫 - 𝐫^{'} |}) = \frac{1}{| 𝐫 - 𝐫^{'} |} .$

von Weizsäcker kinetic energy functional

In 1935 von Weizsäcker proposed to add a gradient correction to the Thomas-Fermi kinetic energy functional to make it better suit a molecular electron cloud: $T_{W} [ρ] = \frac{1}{8} \int \frac{\nabla ρ (𝐫) \cdot \nabla ρ (𝐫)}{ρ (𝐫)} d 𝐫 = \int t_{W} (𝐫) d 𝐫,$ where $t_{W} \equiv \frac{1}{8} \frac{\nabla ρ \cdot \nabla ρ}{ρ} and ρ = ρ (r) .$ Using a previously derived formula for the functional derivative, $\begin{aligned} \frac{δ T_{W}}{δ ρ} & = \frac{\partial t_{W}}{\partial ρ} - \nabla \cdot \frac{\partial t_{W}}{\partial \nabla ρ} \\ = - \frac{1}{8} \frac{\nabla ρ \cdot \nabla ρ}{ρ^{2}} - (\frac{1}{4} \frac{\nabla^{2} ρ}{ρ} - \frac{1}{4} \frac{\nabla ρ \cdot \nabla ρ}{ρ^{2}}) where \nabla^{2} = \nabla \cdot \nabla, \end{aligned}$ and the result is,^[15] $\frac{δ T_{W}}{δ ρ} = \frac{1}{8} \frac{\nabla ρ \cdot \nabla ρ}{ρ^{2}} - \frac{1}{4} \frac{\nabla^{2} ρ}{ρ} .$

Entropy

The entropy of a discrete random variable is a functional of the probability mass function.

$H [p (x)] = - \sum_{x} p (x) \log p (x)$ Thus, $\begin{aligned} \sum_{x} \frac{δ H}{δ p (x)} ϕ (x) & = {[\frac{d}{d ε} H [p (x) + ε ϕ (x)]]}_{ε = 0} \\ = {[- \frac{d}{d ε} \sum_{x} [p (x) + ε ϕ (x)] \log [p (x) + ε ϕ (x)]]}_{ε = 0} \\ = - \sum_{x} [1 + \log p (x)] ϕ (x) . \end{aligned}$ Thus, $\frac{δ H}{δ p (x)} = - 1 - \log p (x) .$

Exponential

Let $F [φ (x)] = e^{\int φ (x) g (x) d x} .$

Using the delta function as a test function, $\begin{aligned} \frac{δ F [φ (x)]}{δ φ (y)} & = \lim_{ε \to 0} \frac{F [φ (x) + ε δ (x - y)] - F [φ (x)]}{ε} \\ = \lim_{ε \to 0} \frac{e^{\int (φ (x) + ε δ (x - y)) g (x) d x} - e^{\int φ (x) g (x) d x}}{ε} \\ = e^{\int φ (x) g (x) d x} \lim_{ε \to 0} \frac{e^{ε \int δ (x - y) g (x) d x} - 1}{ε} \\ = e^{\int φ (x) g (x) d x} \lim_{ε \to 0} \frac{e^{ε g (y)} - 1}{ε} \\ = e^{\int φ (x) g (x) d x} g (y) . \end{aligned}$

Thus, $\frac{δ F [φ (x)]}{δ φ (y)} = g (y) F [φ (x)] .$

This is particularly useful in calculating the correlation functions from the partition function in quantum field theory.

Functional derivative of a function

A function can be written in the form of an integral like a functional. For example, $ρ (r) = F [ρ] = \int ρ (r^{'}) δ (r - r^{'}) d r^{'} .$ Since the integrand does not depend on derivatives of ρ, the functional derivative of ρ $(r)$ Script error: No such module "Check for unknown parameters". is, $\frac{δ ρ (r)}{δ ρ (r^{'})} \equiv \frac{δ F}{δ ρ (r^{'})} = \frac{\partial}{\partial ρ (r^{'})} [ρ (r^{'}) δ (r - r^{'})] = δ (r - r^{'}) .$

Functional derivative of iterated function

The functional derivative of the iterated function $f (f (x))$ is given by: $\frac{δ f (f (x))}{δ f (y)} = f^{'} (f (x)) δ (x - y) + δ (f (x) - y)$ and $\frac{δ f (f (f (x)))}{δ f (y)} = f^{'} (f (f (x)) (f^{'} (f (x)) δ (x - y) + δ (f (x) - y)) + δ (f (f (x)) - y)$

In general: $\frac{δ f^{N} (x)}{δ f (y)} = f^{'} (f^{N - 1} (x)) \frac{δ f^{N - 1} (x)}{δ f (y)} + δ (f^{N - 1} (x) - y)$

Putting in $N = 0$ Script error: No such module "Check for unknown parameters". gives: $\frac{δ f^{- 1} (x)}{δ f (y)} = - \frac{δ (f^{- 1} (x) - y)}{f^{'} (f^{- 1} (x))}$

Using the delta function as a test function

In physics, it is common to use the Dirac delta function $δ (x - y)$ in place of a generic test function $ϕ (x)$ , for yielding the functional derivative at the point $y$ (this is a point of the whole functional derivative as a partial derivative is a component of the gradient):^[16] $\frac{δ F [ρ (x)]}{δ ρ (y)} = \lim_{ε \to 0} \frac{F [ρ (x) + ε δ (x - y)] - F [ρ (x)]}{ε} .$

This works in cases when $F [ρ (x) + ε f (x)]$ formally can be expanded as a series (or at least up to first order) in $ε$ . The formula is however not mathematically rigorous, since $F [ρ (x) + ε δ (x - y)]$ is usually not even defined.

The definition given in a previous section is based on a relationship that holds for all test functions $ϕ (x)$ , so one might think that it should hold also when $ϕ (x)$ is chosen to be a specific function such as the delta function. However, the latter is not a valid test function (it is not even a proper function).

In the definition, the functional derivative describes how the functional $F [ρ (x)]$ changes as a result of a small change in the entire function $ρ (x)$ . The particular form of the change in $ρ (x)$ is not specified, but it should stretch over the whole interval on which $x$ is defined. Employing the particular form of the perturbation given by the delta function has the meaning that $ρ (x)$ is varied only in the point $y$ . Except for this point, there is no variation in $ρ (x)$ .

Notes

↑ According to Template:Harvp, this notation is customary in physical literature.
↑ Called first variation in Script error: No such module "Footnotes"., variation or first variation in Script error: No such module "Footnotes"., variation or differential in Script error: No such module "Footnotes". and differential in Script error: No such module "Footnotes"..
↑ cf. homotopy.
↑ Here the notation $\frac{δ F}{δ ρ} (x) \equiv \frac{δ F}{δ ρ (x)}$ is introduced.
↑ For a three-dimensional Cartesian coordinate system, $\frac{\partial f}{\partial \nabla ρ} = \frac{\partial f}{\partial ρ_{x}} \hat{i} + \frac{\partial f}{\partial ρ_{y}} \hat{j} + \frac{\partial f}{\partial ρ_{z}} \hat{k},$ where $ρ_{x} = \frac{\partial ρ}{\partial x}, ρ_{y} = \frac{\partial ρ}{\partial y}, ρ_{z} = \frac{\partial ρ}{\partial z}$ and $\hat{i}$ , $\hat{j}$ , $\hat{k}$ are unit vectors along the x, y, z axes.
↑ For example, for the case of three dimensions ( $n = 3$ Script error: No such module "Check for unknown parameters".) and second order derivatives ( $i = 2$ Script error: No such module "Check for unknown parameters".), the tensor $\nabla (2)$ Script error: No such module "Check for unknown parameters". has components, ${[\nabla^{(2)}]}_{α β} = \frac{\partial^{2}}{\partial r_{α} \partial r_{β}}$ where $α$ and $β$ can be $1, 2, 3$ .
↑ For example, for the case $n = 3$ Script error: No such module "Check for unknown parameters". and $i = 2$ Script error: No such module "Check for unknown parameters"., the tensor scalar product is, $\nabla^{(2)} \cdot \frac{\partial f}{\partial (\nabla^{(2)} ρ)} = \sum_{α, β = 1}^{3} \frac{\partial^{2}}{\partial r_{α} \partial r_{β}} \frac{\partial f}{\partial ρ_{α β}},$ where $ρ_{α β} \equiv \frac{\partial^{2} ρ}{\partial r_{α} \partial r_{β}}$ .

Script error: No such module "Check for unknown parameters".

Footnotes

↑ ^a ^b Template:Harvp
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Script error: No such module "citation/CS1".
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp.
↑ Template:Harvp

Script error: No such module "Check for unknown parameters".

References

Script error: No such module "citation/CS1"..
Script error: No such module "citation/CS1"..
Script error: No such module "citation/CS1"..
Script error: No such module "citation/CS1"..
Script error: No such module "citation/CS1"..
Script error: No such module "citation/CS1".

External links

Template:Springer

Template:Functional analysis Template:Analysis in topological vector spaces

[2] According to Template:Harvp, this notation is customary in physical literature.

[3] Called first variation in Script error: No such module "Footnotes"., variation or first variation in Script error: No such module "Footnotes"., variation or differential in Script error: No such module "Footnotes". and differential in Script error: No such module "Footnotes"..

[7] . homotopy.

[12] Here the notation $\frac{δ F}{δ ρ} (x) \equiv \frac{δ F}{δ ρ (x)}$ is introduced.

[17] For a three-dimensional Cartesian coordinate system, $\frac{\partial f}{\partial \nabla ρ} = \frac{\partial f}{\partial ρ_{x}} \hat{i} + \frac{\partial f}{\partial ρ_{y}} \hat{j} + \frac{\partial f}{\partial ρ_{z}} \hat{k},$ where $ρ_{x} = \frac{\partial ρ}{\partial x}, ρ_{y} = \frac{\partial ρ}{\partial y}, ρ_{z} = \frac{\partial ρ}{\partial z}$ and $\hat{i}$ , $\hat{j}$ , $\hat{k}$ are unit vectors along the x, y, z axes.

[18] For example, for the case of three dimensions ( $n = 3$ Script error: No such module "Check for unknown parameters".) and second order derivatives ( $i = 2$ Script error: No such module "Check for unknown parameters".), the tensor $\nabla (2)$ Script error: No such module "Check for unknown parameters". has components, ${[\nabla^{(2)}]}_{α β} = \frac{\partial^{2}}{\partial r_{α} \partial r_{β}}$ where $α$ and $β$ can be $1, 2, 3$ .

[19] For example, for the case $n = 3$ Script error: No such module "Check for unknown parameters". and $i = 2$ Script error: No such module "Check for unknown parameters"., the tensor scalar product is, $\nabla^{(2)} \cdot \frac{\partial f}{\partial (\nabla^{(2)} ρ)} = \sum_{α, β = 1}^{3} \frac{\partial^{2}}{\partial r_{α} \partial r_{β}} \frac{\partial f}{\partial ρ_{α β}},$ where $ρ_{α β} \equiv \frac{\partial^{2} ρ}{\partial r_{α} \partial r_{β}}$ .

[GiaquintaHildebrandtP18-1] Template:Harvp

[GelfandFominp11-4] Template:Harvp.

[GiaquintaHildebrandtP180-5] Template:Harvp.

[GiaquintaHildebrandtP3-6] Template:Harvp.

[8] Script error: No such module "citation/CS1".

[ParrYangP246A.2-9] Template:Harvp.

[GreinerReinhardtP36.2-10] Template:Harvp.

[ParrYangP246-11] Template:Harvp.

[ParrYangP247A.3-13] Template:Harvp.

[ParrYangP247A.4-14] Template:Harvp.

[15] Template:Harvp.

[16] Template:Harvp.

[ParrYangP247A.6-20] Template:Harvp.

[ParrYangP248A.11-21] Template:Harvp.

[ParrYangP247A.9-22] Template:Harvp.

[23] Template:Harvp

[1]

[Note 1]

[Note 2]

[2]

[3]

[4]

[Note 3]

[5]

[6]

[7]

[8]

[Note 4]

[9]

[10]

[11]

[12]

[Note 5]

[Note 6]

[Note 7]

[13]

[14]

[15]

[16]

@@ Line 6: / Line 6: @@
 For example, consider the functional
 <math display="block"> J[f] = \int_a^b L( \, x, f(x), f'{(x)} \, ) \, dx \, , </math>
-where {{math|''f'' &prime;(''x'') &equiv; ''df''/''dx''}}. If {{math|''f''}} is varied by adding to it a function {{math|''δf''}}, and the resulting integrand {{math|''L''(''x'', ''f'' +''δf'', ''f'' &prime;+''δf'' &prime;)}} is expanded in powers of {{math|''δf''}}, then the change in the value of {{math|''J''}} to first order in {{math|''δf''}} can be expressed as follows:<ref name="GiaquintaHildebrandtP18" /><ref Group = 'Note'>According to {{Harvp|Giaquinta|Hildebrandt|1996|p=18}}, this notation is customary in [[Physics|physical]] literature.</ref>
+where {{math|''f'' &prime;(''x'') &equiv; ''df''/''dx''}}. If {{math|''f''}} is varied by adding to it a function {{math|''δf''}}, and the resulting integrand {{math|''L''(''x'', ''f'' +''δf'', ''f'' &prime;+''δf'' &prime;)}} is expanded in powers of {{math|''δf''}}, then the change in the value of {{math|''J''}} to first order in {{math|''δf''}} can be expressed as follows:<ref name="GiaquintaHildebrandtP18" /><ref group="Note">According to {{Harvp|Giaquinta|Hildebrandt|1996|p=18}}, this notation is customary in [[Physics|physical]] literature.</ref>
 <math display="block">\begin{align}
 \delta J &= \int_a^b \left( \frac{\partial L}{\partial f} \delta f(x) + \frac{\partial L}{\partial f'} \frac{d}{dx} \delta f(x) \right) \, dx \, \\[1ex]
@@ Line 15: / Line 15: @@
 ==Definition==
-In this section, the functional differential (or variation or first variation)<Ref Group = 'Note'> Called ''first variation'' in {{harv|Giaquinta|Hildebrandt|1996|p=3}}, ''variation'' or ''first variation'' in {{harv|Courant|Hilbert|1953|p=186}}, ''variation'' or ''differential'' in {{harv|Gelfand|Fomin|2000|loc= p. 11, § 3.2}} and ''differential'' in {{harv|Parr|Yang|1989|p=246}}.</ref> is defined. Then the functional derivative is defined in terms of the functional differential.
+In this section, the functional differential (or variation or first variation)<ref group="Note">Called ''first variation'' in {{harv|Giaquinta|Hildebrandt|1996|p=3}}, ''variation'' or ''first variation'' in {{harv|Courant|Hilbert|1953|p=186}}, ''variation'' or ''differential'' in {{harv|Gelfand|Fomin|2000|loc= p. 11, § 3.2}} and ''differential'' in {{harv|Parr|Yang|1989|p=246}}.</ref> is defined. Then the functional derivative is defined in terms of the functional differential.
 ===Functional differential===
@@ Line 37: / Line 37: @@
 </math>
 Note that this notion of the functional differential can even be defined without a norm.
+In a more general case the [[function space]] <math>
+B
+</math> appearing as the domain of <math>F</math> is not a vector space,
+and therefore variations of the form <math>\rho + \varepsilon \phi</math> do not make sense.
+In this case we consider a variation <math>\alpha_{?} : (-\varepsilon_0, \varepsilon_0) \to B</math> of <math>\rho</math> to be a <math>C^1</math>-family of functions such that <math>\alpha_0 = \rho</math>.<ref group="Note">cf. [[homotopy]].</ref>
+Denoting the space of all such variations as <math>\mathcal V_\rho</math>, the functional differential <math>\delta F[\rho] : \mathcal V_\rho \to \mathbb R</math> is the functional
+<math display="block">
+\begin{align}
+\delta F[\rho;\alpha] = \delta F[\rho][\alpha] = \lim_{ \epsilon \to 0 } \frac{F[\alpha_{\epsilon}] - F[\rho]}{\epsilon} = F[\alpha_{?}]'(0)
+\end{align}
+</math>
+where <math>
+F[\alpha_?](\epsilon) = F[\alpha_\epsilon]
+</math>. The above then becomes the special case <math>
+\alpha_\epsilon = \rho + \epsilon \eta
+</math>.<ref>{{Cite web |last=Terek |first=Ivo |date=2019-06-12 |title=Introductory Variational Calculus on Manifolds |url=https://web.williams.edu/Mathematics/it3/texts/var_noether.pdf |url-status=live |access-date=2025-11-01 |format=PDF}}</ref>
 ===Functional derivative===
@@ Line 49: / Line 69: @@
 If this is the case and, moreover, <math>\delta F[\rho,\phi]</math> can be written as the integral of <math>\phi</math> times another function (denoted {{math|''δF''/''δρ''}})
 <math display="block">\delta F [\rho, \phi] = \int_\Omega \frac {\delta F} {\delta \rho}(x) \ \phi(x) \ dx</math>
-then this function {{math|''δF''/''δρ''}} is called the '''functional derivative''' of {{math|''F''}} at {{math|''ρ''}}.<ref name=ParrYangP246A.2>{{harvp|Parr|Yang|1989|loc= p. 246, Eq. A.2}}.</ref><ref name=GreinerReinhardtP36.2>{{harvp|Greiner|Reinhardt|1996|p=36,37}}.</ref> If <math>F</math> is restricted to only certain functions <math>\rho</math> (for example, if there are some boundary conditions imposed) then <math>\phi</math> is restricted to  functions such that <math>\rho+\varepsilon\phi</math> continues to satisfy these conditions.
+then this function {{math|''δF''/''δρ''}} is called the '''functional derivative''' of {{math|''F''}} at {{math|''ρ''}}.<ref name="ParrYangP246A.2">{{harvp|Parr|Yang|1989|loc= p. 246, Eq. A.2}}.</ref><ref name="GreinerReinhardtP36.2">{{harvp|Greiner|Reinhardt|1996|p=36,37}}.</ref> If <math>F</math> is restricted to only certain functions <math>\rho</math> (for example, if there are some boundary conditions imposed) then <math>\phi</math> is restricted to functions such that <math>\rho+\varepsilon\phi</math> continues to satisfy these conditions.
 Heuristically, <math>\phi</math> is the change in <math>\rho</math>, so we 'formally' have <math>\phi = \delta\rho</math>, and then this is similar in form to the [[total differential]] of a function <math>F(\rho_1,\rho_2,\dots,\rho_n)</math>,
 <math display="block"> dF = \sum_{i=1} ^n \frac {\partial F} {\partial \rho_i} \ d\rho_i ,</math>
 where <math>\rho_1,\rho_2,\dots,\rho_n</math> are independent variables.
-Comparing the last two equations, the functional derivative <math>\delta F/\delta\rho(x)</math> has a role similar to that of the partial derivative <math>\partial F/\partial\rho_i</math>, where the variable of integration <math>x</math> is like a continuous version of the summation index <math>i</math>.<ref name=ParrYangP246>{{harvp|Parr|Yang|1989|p=246}}.</ref> One thinks of {{math|''δF''/''δρ''}} as the gradient of {{math|''F''}} at the point {{math|''ρ''}}, so the value {{math|''δF''/''δρ(x)''}} measures how much the functional {{math|''F''}} will change if the function {{math|''ρ''}} is changed at the point {{math|''x''}}. Hence the formula
+Comparing the last two equations, the functional derivative <math>\delta F/\delta\rho(x)</math> has a role similar to that of the partial derivative <math>\partial F/\partial\rho_i</math>, where the variable of integration <math>x</math> is like a continuous version of the summation index <math>i</math>.<ref name="ParrYangP246">{{harvp|Parr|Yang|1989|p=246}}.</ref> One thinks of {{math|''δF''/''δρ''}} as the gradient of {{math|''F''}} at the point {{math|''ρ''}}, so the value {{math|''δF''/''δρ(x)''}} measures how much the functional {{math|''F''}} will change if the function {{math|''ρ''}} is changed at the point {{math|''x''}}. Hence the formula
 <math display="block">\int \frac{\delta F}{\delta\rho}(x) \phi(x) \; dx</math>
 is regarded as the directional derivative at point <math>\rho</math> in the direction of <math>\phi</math>. This is analogous to vector calculus, where the inner product of a vector <math>v</math> with the gradient gives the directional derivative in the direction of <math>v</math>.
@@ Line 64: / Line 84: @@
 is introduced.
 </ref>
-* Linearity:<ref name=ParrYangP247A.3>{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.3}}.</ref> <math display="block">\frac{\delta(\lambda F + \mu G)[\rho ]}{\delta \rho(x)} = \lambda \frac{\delta F[\rho]}{\delta \rho(x)} + \mu \frac{\delta G[\rho]}{\delta \rho(x)},</math> where {{math|''λ'', ''μ''}} are constants.
+* Linearity:<ref name="ParrYangP247A.3">{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.3}}.</ref> <math display="block">\frac{\delta(\lambda F + \mu G)[\rho ]}{\delta \rho(x)} = \lambda \frac{\delta F[\rho]}{\delta \rho(x)} + \mu \frac{\delta G[\rho]}{\delta \rho(x)},</math> where {{math|''λ'', ''μ''}} are constants.
-* Product rule:<ref name=ParrYangP247A.4>{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.4}}.</ref> <math display="block">\frac{\delta(FG)[\rho]}{\delta \rho(x)} = \frac{\delta F[\rho]}{\delta \rho(x)} G[\rho] + F[\rho] \frac{\delta G[\rho]}{\delta \rho(x)} \, , </math>
+* Product rule:<ref name="ParrYangP247A.4">{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.4}}.</ref> <math display="block">\frac{\delta(FG)[\rho]}{\delta \rho(x)} = \frac{\delta F[\rho]}{\delta \rho(x)} G[\rho] + F[\rho] \frac{\delta G[\rho]}{\delta \rho(x)} \, , </math>
 * Chain rules:
 **If {{math|''F''}} is a functional and {{math|''G''}} another functional, then<ref>{{harvp|Greiner|Reinhardt|1996|loc=p. 38, Eq. 6}}.</ref> <math display="block">\frac{\delta F[G[\rho]] }{\delta\rho(y)} = \int dx \frac{\delta F[G]}{\delta G(x)}_{G = G[\rho]}\cdot\frac {\delta G[\rho](x)} {\delta\rho(y)} \ . </math>
@@ Line 109: / Line 129: @@
 In the last two equations, the {{math|''n<sup>i</sup>''}} components of the tensor <math> \frac{\partial f}{\partial\left(\nabla^{(i)}\rho\right)} </math> are partial derivatives of {{math|''f''}} with respect to partial derivatives of ''ρ'',
-<math display="block"> \left [ \frac {\partial f} {\partial \left (\nabla^{(i)}\rho \right ) } \right ]_{\alpha_1 \alpha_2 \cdots \alpha_i} = \frac {\partial f} {\partial \rho_{\alpha_1 \alpha_2 \cdots \alpha_i} }  </math>
+<math display="block"> \left [ \frac {\partial f} {\partial \left (\nabla^{(i)}\rho \right ) } \right ]_{\alpha_1 \alpha_2 \cdots \alpha_i} = \frac {\partial f} {\partial \rho_{\alpha_1 \alpha_2 \cdots \alpha_i} } </math>
-where <math>  \rho_{\alpha_1 \alpha_2 \cdots \alpha_i} \equiv \frac {\partial^{\,i}\rho} {\partial r_{\alpha_1} \, \partial r_{\alpha_2} \cdots \partial r_{\alpha_i} } </math>, and the tensor scalar product is,
+where <math> \rho_{\alpha_1 \alpha_2 \cdots \alpha_i} \equiv \frac {\partial^{\,i}\rho} {\partial r_{\alpha_1} \, \partial r_{\alpha_2} \cdots \partial r_{\alpha_i} } </math>, and the tensor scalar product is,
 <math display="block"> \nabla^{(i)} \cdot \frac{\partial f}{\partial\left(\nabla^{(i)}\rho\right)} = \sum_{\alpha_1, \alpha_2, \cdots, \alpha_i = 1}^n \ \frac {\partial^{\, i} } {\partial r_{\alpha_1} \, \partial r_{\alpha_2} \cdots \partial r_{\alpha_i} } \ \frac {\partial f} {\partial \rho_{\alpha_1 \alpha_2 \cdots \alpha_i} } \ . </math> <ref group="Note">For example, for the case {{math|1=''n'' = 3}} and {{math|1=''i'' = 2}}, the tensor scalar product is,
 <math display="block"> \nabla^{(2)} \cdot \frac{\partial f}{\partial\left(\nabla^{(2)}\rho\right)} = \sum_{\alpha, \beta = 1}^3 \ \frac {\partial^{\, 2} } {\partial r_{\alpha} \, \partial r_{\beta} } \, \frac {\partial f} {\partial \rho_{\alpha \beta} } , </math>where <math>\rho_{\alpha \beta} \equiv \frac {\partial^{\, 2}\rho} {\partial r_{\alpha} \, \partial r_{\beta} }</math>.</ref>
@@ Line 119: / Line 139: @@
 The [[Thomas–Fermi model]] of 1927 used a kinetic energy functional for a noninteracting uniform [[free electron model|electron gas]] in a first attempt of [[density-functional theory]] of electronic structure:
 <math display="block">T_\mathrm{TF}[\rho] = C_\mathrm{F} \int \rho^{5/3}(\mathbf{r}) \, d\mathbf{r} \, .</math>
-Since the integrand of {{math|''T''<sub>TF</sub>[''ρ'']}} does not involve derivatives of {{math|''ρ''('''''r''''')}}, the functional derivative of {{math|''T''<sub>TF</sub>[''ρ'']}} is,<ref name=ParrYangP247A.6>{{harvp|Parr|Yang|1989|loc=p. 247, Eq. A.6}}.</ref>
+Since the integrand of {{math|''T''<sub>TF</sub>[''ρ'']}} does not involve derivatives of {{math|''ρ''('''''r''''')}}, the functional derivative of {{math|''T''<sub>TF</sub>[''ρ'']}} is,<ref name="ParrYangP247A.6">{{harvp|Parr|Yang|1989|loc=p. 247, Eq. A.6}}.</ref>
 <math display="block">\frac{\delta T_{\mathrm{TF}}}{\delta \rho (\boldsymbol{r}) }
 = C_\mathrm{F} \frac{\partial \rho^{5/3}(\mathbf{r})}{\partial \rho(\mathbf{r})}
@@ Line 148: / Line 168: @@
 The first and second terms on the right hand side of the last equation are equal, since {{math|'''''r'''''}} and {{math|'''''r&prime;'''''}} in the second term can be interchanged without changing the value of the integral. Therefore,
 <math display="block"> \int \frac{\delta J}{\delta\rho(\boldsymbol{r})} \phi(\boldsymbol{r})d\boldsymbol{r} = \int \left ( \int \frac {\rho(\boldsymbol{r}') }{| \boldsymbol{r}-\boldsymbol{r}' |} d\boldsymbol{r}' \right ) \phi(\boldsymbol{r}) d\boldsymbol{r} </math>
-and the functional derivative of the electron-electron Coulomb potential energy functional {{math|''J''}}[''ρ''] is,<ref name=ParrYangP248A.11>{{harvp|Parr|Yang|1989|loc=p. 248, Eq. A.11}}.</ref>
+and the functional derivative of the electron-electron Coulomb potential energy functional {{math|''J''}}[''ρ''] is,<ref name="ParrYangP248A.11">{{harvp|Parr|Yang|1989|loc=p. 248, Eq. A.11}}.</ref>
 <math display="block"> \frac{\delta J}{\delta\rho(\boldsymbol{r})} = \int \frac {\rho(\boldsymbol{r}') }{| \boldsymbol{r}-\boldsymbol{r}' |} d\boldsymbol{r}' \, . </math>
@@ Line 165: / Line 185: @@
 & = -\frac{1}{8}\frac{\nabla\rho \cdot \nabla\rho}{\rho^2} - \left ( \frac {1}{4} \frac {\nabla^2\rho} {\rho} - \frac {1}{4} \frac {\nabla\rho \cdot \nabla\rho} {\rho^2} \right ) \qquad \text{where} \ \ \nabla^2 = \nabla \cdot \nabla \ ,
 \end{align}</math>
-and the result is,<ref name=ParrYangP247A.9>{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.9}}.</ref>
+and the result is,<ref name="ParrYangP247A.9">{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.9}}.</ref>
 <math display="block"> \frac{\delta T_\mathrm{W}}{\delta \rho} = \ \ \, \frac{1}{8}\frac{\nabla\rho \cdot \nabla\rho}{\rho^2} - \frac{1}{4}\frac{\nabla^2\rho}{\rho} \ . </math>

Functional derivative: Difference between revisions

Latest revision as of 03:49, 1 November 2025

Contents

Definition

Functional differential

Functional derivative

Properties

Determining functional derivatives

Formula

Examples

Thomas–Fermi kinetic energy functional

Coulomb potential energy functional

von Weizsäcker kinetic energy functional

Entropy

Exponential

Functional derivative of a function

Functional derivative of iterated function

Using the delta function as a test function

Notes

Footnotes

References

External links

Navigation menu

Functional derivative: Difference between revisions

Latest revision as of 03:49, 1 November 2025

Definition

Functional differential

Functional derivative

Properties

Determining functional derivatives

Formula

Examples

Thomas–Fermi kinetic energy functional

Coulomb potential energy functional

von Weizsäcker kinetic energy functional

Entropy

Exponential

Functional derivative of a function

Functional derivative of iterated function

Using the delta function as a test function

Notes

Footnotes

References

External links

Navigation menu

Search