Directional derivative: Difference between revisions

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Definition: Add alternative common definition of directional derivative.
 
 
Line 5: Line 5:
In [[multivariable calculus]], the '''directional derivative''' measures the rate at which a function changes in a particular direction at a given point.{{cn|date=November 2023}}
In [[multivariable calculus]], the '''directional derivative''' measures the rate at which a function changes in a particular direction at a given point.{{cn|date=November 2023}}


The directional derivative of a multivariable [[differentiable function|differentiable (scalar) function]] along a given [[vector (mathematics)|vector]] '''v''' at a given point '''x''' intuitively represents the instantaneous rate of change of the function, moving through '''x''' with a direction specified by '''v'''.   
The directional derivative of a multivariable [[differentiable function|differentiable scalar function]] along a given [[vector (mathematics)|vector]] '''v''' at a given point '''x''' represents the instantaneous rate of change of the function in the direction '''v''' through '''x'''.   


The directional derivative of a [[Scalar field|scalar function]] ''f'' with respect to a vector '''v''' at a point (e.g., position) '''x''' may be denoted by any of the following:
Many mathematical texts assume that the directional vector is [[Unit vector|normalized]] (a unit vector), meaning that its magnitude is equivalent to one. This is by convention and not required for proper calculation. In order to adjust a formula for the directional derivative to work for any vector, one must divide the expression by the magnitude of the vector. Normalized vectors are denoted with a [[circumflex]] (hat) symbol: <math>\mathbf{\widehat{}}</math>. 
 
The directional derivative of a [[Scalar field|scalar function]] ''f'' with respect to a vector '''v''' (denoted as <math>\mathbf{\hat{v}}</math> when [[Unit vector|normalized]]) at a point (e.g., position) ('''x''',f('''x''')) may be denoted by any of the following:
<math display="block">
<math display="block">
\begin{aligned}
\begin{aligned}
Line 15: Line 17:
&=Df(\mathbf{x})(\mathbf{v})\\
&=Df(\mathbf{x})(\mathbf{v})\\
&=\partial_\mathbf{v}f(\mathbf{x})\\
&=\partial_\mathbf{v}f(\mathbf{x})\\
&=\mathbf{v}\cdot{\nabla f(\mathbf{x})}\\
&=\frac{\partial f(\mathbf{x})}{\partial \mathbf{v}}\\
&=\mathbf{v}\cdot \frac{\partial f(\mathbf{x})}{\partial\mathbf{x}}.\\
&=\mathbf{\hat{v}}\cdot{\nabla f(\mathbf{x})}\\
&=\mathbf{\hat{v}} \cdot \frac{\partial f(\mathbf{x})}{\partial\mathbf{x}}.\\
\end{aligned}
\end{aligned}
</math>
</math>
Line 31: Line 34:
<math display="block">\mathbf{v} = (v_1, \ldots, v_n)</math>
<math display="block">\mathbf{v} = (v_1, \ldots, v_n)</math>
is the [[function (mathematics)|function]] <math>\nabla_{\mathbf{v}}{f}</math> defined by the [[limit (mathematics)|limit]]<ref>{{cite book |author1=R. Wrede |author2=M.R. Spiegel | title=Advanced Calculus|edition=3rd| publisher=Schaum's Outline Series| year=2010 | isbn=978-0-07-162366-7}}</ref>
is the [[function (mathematics)|function]] <math>\nabla_{\mathbf{v}}{f}</math> defined by the [[limit (mathematics)|limit]]<ref>{{cite book |author1=R. Wrede |author2=M.R. Spiegel | title=Advanced Calculus|edition=3rd| publisher=Schaum's Outline Series| year=2010 | isbn=978-0-07-162366-7}}</ref>
<math display="block">\nabla_{\mathbf{v}}{f}(\mathbf{x}) = \lim_{h \to 0}{\frac{f(\mathbf{x} + h\mathbf{v}) - f(\mathbf{x})}{h}} = \left.\frac{\mathrm{d}}{\mathrm{d}t}f(\mathbf{x}+t\mathbf{v})\right|_{t=0}.</math>
<math display="block">\nabla_{\mathbf{v}}{f}(\mathbf{x}) = \lim_{h \to 0}{\frac{f(\mathbf{x} + h\mathbf{v}) - f(\mathbf{x})}{h||\mathbf{v}||}} = \left.\frac{1}{||\mathbf{v}||} \frac{\mathrm{d}}{\mathrm{d}t}f(\mathbf{x}+t\mathbf{v})\right|_{t=0}.</math>


This definition is valid in a broad range of contexts, for example where the [[Euclidean norm|norm]] of a vector (and hence a unit vector) is undefined.<ref>The applicability extends to functions over spaces without a [[metric (mathematics)|metric]] and to [[differentiable manifold]]s, such as in [[general relativity]].</ref>
This definition is valid in a broad range of contexts, for example, where the [[Euclidean norm|norm]] of a vector (and hence a unit vector) is defined.<ref>The applicability extends to functions over spaces without a [[metric (mathematics)|metric]] and to [[differentiable manifold]]s, such as in [[general relativity]].</ref>


=== For differentiable functions ===
=== For differentiable functions ===


If the function ''f'' is [[Differentiable function#Differentiability in higher dimensions|differentiable]] at '''x''', then the directional derivative exists along any unit vector '''v''' at x, and one has
If the function ''f'' is [[Differentiable function#Differentiability in higher dimensions|differentiable]] at '''x''', then the directional derivative exists along any vector '''v''' at '''x''', and one has
 
<math display="block">\nabla_{\mathbf{v}}{f}(\mathbf{x}) = \nabla f(\mathbf{x}) \cdot \frac{\mathbf{v}}{||\mathbf{v}||}</math>
 
where the <math>\nabla</math> on the right denotes the ''[[gradient]]'' and <math>\cdot</math> is the [[dot product]].<ref>If the dot product is undefined, the [[gradient]] is also undefined; however, for differentiable ''f'', the directional derivative is still defined, and a similar relation exists with the exterior derivative.</ref>


<math display="block">\nabla_{\mathbf{v}}{f}(\mathbf{x}) = \nabla f(\mathbf{x}) \cdot \mathbf{v}</math>
It can be derived by using the property that all directional derivatives at a point make up a single tangent plane which can be defined using partial derivatives. This can be used to find a formula for the gradient vector and an alternative formula for the directional derivative, the latter of which can be rewritten as shown above for convenience.


where the <math>\nabla</math> on the right denotes the ''[[gradient]]'', <math>\cdot</math> is the [[dot product]] and '''v''' is a unit vector.<ref>If the dot product is undefined, the [[gradient]] is also undefined; however, for differentiable ''f'', the directional derivative is still defined, and a similar relation exists with the exterior derivative.</ref> This follows from defining a path <math>h(t) = x + tv</math> and using the definition of the derivative as a limit which can be calculated along this path to get:
It also follows from defining a path <math>h(t) = x + tv</math> and using the definition of the derivative as a limit which can be calculated along this path to get:
<math display="block">\begin{align}
<math display="block">\begin{align}
0
0
&=\lim_{t \to 0}\frac {f(x+tv)-f(x)-tDf(x)(v)} t \\
&=\lim_{t \to 0}\frac {f(x+t\hat{v})-f(x)-t\nabla f(x)\cdot \hat{v}} t \\
&=\lim_{t \to 0}\frac {f(x+tv)-f(x)} t - Df(x)(v) \\
&=\lim_{t \to 0}\frac {f(x+t\hat{v})-f(x)} t - \nabla f(x)\cdot \hat{v} \\
&=\nabla_v f(x)-Df(x)(v).
&=\nabla_v f(x)-\nabla f(x)\cdot \hat{v}.\\
&\nabla f(x)\cdot \hat{v}=\nabla_v f(x)
\end{align}</math>
\end{align}</math>
Intuitively, the directional derivative of ''f'' at a point '''x''' represents the [[derivative|rate of change]] of ''f'', in the direction of '''v'''.


=== Using only direction of vector ===
=== Using only direction of vector ===
Line 56: Line 62:


This definition gives the rate of increase of {{math|''f''}} per unit of distance moved in the direction given by {{math|'''v'''}}. In this case, one has
This definition gives the rate of increase of {{math|''f''}} per unit of distance moved in the direction given by {{math|'''v'''}}. In this case, one has
<math display="block">\nabla_{\mathbf{v}}{f}(\mathbf{x}) = \lim_{h \to 0}{\frac{f(\mathbf{x} + h\mathbf{v}) - f(\mathbf{x})}{h|\mathbf{v}|}},</math>
<math display="block">\nabla_{\mathbf{v}}{f}(\mathbf{x}) = \lim_{h \to 0}{\frac{f(\mathbf{x} + h\mathbf{v}) - f(\mathbf{x})}{h||\mathbf{v}||}},</math>
or in case ''f'' is differentiable at '''x''',
or in case ''f'' is differentiable at '''x''',
<math display="block">\nabla_{\mathbf{v}}{f}(\mathbf{x}) = \nabla f(\mathbf{x}) \cdot \frac{\mathbf{v}}{|\mathbf{v}|} .</math>
<math display="block">\nabla_{\mathbf{v}}{f}(\mathbf{x}) = \nabla f(\mathbf{x}) \cdot \frac{\mathbf{v}}{||\mathbf{v}||} .</math>


=== Restriction to a unit vector ===
=== Restriction to a unit vector ===


In the context of a function on a [[Euclidean space]], some texts restrict the vector '''v''' to being a [[unit vector]].  With this restriction, both the above definitions are equivalent.<ref>{{Cite book| title=Calculus : Single and multivariable.|last1=Hughes Hallett|first1=Deborah|author1-link=Deborah Hughes Hallett|last2=McCallum|first2=William G.| author2-link=William G. McCallum|last3=Gleason|first3=Andrew M.|author3-link=Andrew M. Gleason| date=2012-01-01| publisher=John wiley|isbn=9780470888612|pages=780|oclc=828768012}}</ref>
In the context of a function on a [[Euclidean space]], some texts restrict the vector '''v''' to being a [[unit vector]] for conventionBoth of the above equations remain true, though redundant, when a vector is normalized.<ref>{{Cite book| title=Calculus : Single and multivariable.|last1=Hughes Hallett|first1=Deborah|author1-link=Deborah Hughes Hallett|last2=McCallum|first2=William G.| author2-link=William G. McCallum|last3=Gleason|first3=Andrew M.|author3-link=Andrew M. Gleason| date=2012-01-01| publisher=John wiley|isbn=9780470888612|pages=780|oclc=828768012}}</ref>


== Properties ==
== Properties ==

Latest revision as of 20:33, 8 November 2025

Template:Short description Template:Refimprove section Script error: No such module "sidebar".

In multivariable calculus, the directional derivative measures the rate at which a function changes in a particular direction at a given point.Script error: No such module "Unsubst".

The directional derivative of a multivariable differentiable scalar function along a given vector v at a given point x represents the instantaneous rate of change of the function in the direction v through x.

Many mathematical texts assume that the directional vector is normalized (a unit vector), meaning that its magnitude is equivalent to one. This is by convention and not required for proper calculation. In order to adjust a formula for the directional derivative to work for any vector, one must divide the expression by the magnitude of the vector. Normalized vectors are denoted with a circumflex (hat) symbol: ^.

The directional derivative of a scalar function f with respect to a vector v (denoted as v^ when normalized) at a point (e.g., position) (x,f(x)) may be denoted by any of the following: 𝐯f(𝐱)=f'𝐯(𝐱)=D𝐯f(𝐱)=Df(𝐱)(𝐯)=𝐯f(𝐱)=f(𝐱)𝐯=v^f(𝐱)=v^f(𝐱)𝐱.

It therefore generalizes the notion of a partial derivative, in which the rate of change is taken along one of the curvilinear coordinate curves, all other coordinates being constant. The directional derivative is a special case of the Gateaux derivative.

Definition

File:Directional derivative contour plot.svg
A contour plot of f(x,y)=x2+y2, showing the gradient vector in black, and the unit vector 𝐮 scaled by the directional derivative in the direction of 𝐮 in orange. The gradient vector is longer because the gradient points in the direction of greatest rate of increase of a function.

The directional derivative of a scalar function f(𝐱)=f(x1,x2,,xn) along a vector 𝐯=(v1,,vn) is the function 𝐯f defined by the limit[1] 𝐯f(𝐱)=limh0f(𝐱+h𝐯)f(𝐱)h||𝐯||=1||𝐯||ddtf(𝐱+t𝐯)|t=0.

This definition is valid in a broad range of contexts, for example, where the norm of a vector (and hence a unit vector) is defined.[2]

For differentiable functions

If the function f is differentiable at x, then the directional derivative exists along any vector v at x, and one has

𝐯f(𝐱)=f(𝐱)𝐯||𝐯||

where the on the right denotes the gradient and is the dot product.[3]

It can be derived by using the property that all directional derivatives at a point make up a single tangent plane which can be defined using partial derivatives. This can be used to find a formula for the gradient vector and an alternative formula for the directional derivative, the latter of which can be rewritten as shown above for convenience.

It also follows from defining a path h(t)=x+tv and using the definition of the derivative as a limit which can be calculated along this path to get: 0=limt0f(x+tv^)f(x)tf(x)v^t=limt0f(x+tv^)f(x)tf(x)v^=vf(x)f(x)v^.f(x)v^=vf(x)

Using only direction of vector

File:Geometrical interpretation of a directional derivative.svg
The angle α between the tangent A and the horizontal will be maximum if the cutting plane contains the direction of the gradient A.

In a Euclidean space, some authors[4] define the directional derivative to be with respect to an arbitrary nonzero vector v after normalization, thus being independent of its magnitude and depending only on its direction.[5]

This definition gives the rate of increase of fScript error: No such module "Check for unknown parameters". per unit of distance moved in the direction given by vScript error: No such module "Check for unknown parameters".. In this case, one has 𝐯f(𝐱)=limh0f(𝐱+h𝐯)f(𝐱)h||𝐯||, or in case f is differentiable at x, 𝐯f(𝐱)=f(𝐱)𝐯||𝐯||.

Restriction to a unit vector

In the context of a function on a Euclidean space, some texts restrict the vector v to being a unit vector for convention. Both of the above equations remain true, though redundant, when a vector is normalized.[6]

Properties

Many of the familiar properties of the ordinary derivative hold for the directional derivative. These include, for any functions f and g defined in a neighborhood of, and differentiable at, p:

  1. sum rule: 𝐯(f+g)=𝐯f+𝐯g.
  2. constant factor rule: For any constant c, 𝐯(cf)=c𝐯f.
  3. product rule (or Leibniz's rule): 𝐯(fg)=g𝐯f+f𝐯g.
  4. chain rule: If g is differentiable at p and h is differentiable at g(p), then 𝐯(hg)(𝐩)=h(g(𝐩))𝐯g(𝐩).

In differential geometry

Script error: No such module "Labelled list hatnote".

Let MScript error: No such module "Check for unknown parameters". be a differentiable manifold and pScript error: No such module "Check for unknown parameters". a point of MScript error: No such module "Check for unknown parameters".. Suppose that fScript error: No such module "Check for unknown parameters". is a function defined in a neighborhood of pScript error: No such module "Check for unknown parameters"., and differentiable at pScript error: No such module "Check for unknown parameters".. If vScript error: No such module "Check for unknown parameters". is a tangent vector to MScript error: No such module "Check for unknown parameters". at pScript error: No such module "Check for unknown parameters"., then the directional derivative of fScript error: No such module "Check for unknown parameters". along vScript error: No such module "Check for unknown parameters"., denoted variously as df(v)Script error: No such module "Check for unknown parameters". (see Exterior derivative), 𝐯f(𝐩) (see Covariant derivative), L𝐯f(𝐩) (see Lie derivative), or 𝐯𝐩(f) (see Template:Section link), can be defined as follows. Let γ : [−1, 1] → MScript error: No such module "Check for unknown parameters". be a differentiable curve with γ(0) = pScript error: No such module "Check for unknown parameters". and γ′(0) = vScript error: No such module "Check for unknown parameters".. Then the directional derivative is defined by 𝐯f(𝐩)=ddτfγ(τ)|τ=0. This definition can be proven independent of the choice of γScript error: No such module "Check for unknown parameters"., provided γScript error: No such module "Check for unknown parameters". is selected in the prescribed manner so that γ(0) = pScript error: No such module "Check for unknown parameters". and γ′(0) = vScript error: No such module "Check for unknown parameters"..

The Lie derivative

The Lie derivative of a vector field Wμ(x) along a vector field Vμ(x) is given by the difference of two directional derivatives (with vanishing torsion): VWμ=(V)Wμ(W)Vμ. In particular, for a scalar field ϕ(x), the Lie derivative reduces to the standard directional derivative: Vϕ=(V)ϕ.

The Riemann tensor

Directional derivatives are often used in introductory derivations of the Riemann curvature tensor. Consider a curved rectangle with an infinitesimal vector δ along one edge and δ along the other. We translate a covector S along δ then δ and then subtract the translation along δ and then δ. Instead of building the directional derivative using partial derivatives, we use the covariant derivative. The translation operator for δ is thus 1+νδνDν=1+δD, and for δ, 1+μδ'μDμ=1+δD. The difference between the two paths is then (1+δD)(1+δD)Sρ(1+δD)(1+δD)Sρ=μ,νδ'μδν[Dμ,Dν]Sρ. It can be argued[7] that the noncommutativity of the covariant derivatives measures the curvature of the manifold: [Dμ,Dν]Sρ=±σRσρμνSσ, where R is the Riemann curvature tensor and the sign depends on the sign convention of the author.

In group theory

Translations

In the Poincaré algebra, we can define an infinitesimal translation operator P as 𝐏=i. (the i ensures that P is a self-adjoint operator) For a finite displacement λ, the unitary Hilbert space representation for translations is[8] U(λ)=exp(iλ𝐏). By using the above definition of the infinitesimal translation operator, we see that the finite translation operator is an exponentiated directional derivative: U(λ)=exp(λ). This is a translation operator in the sense that it acts on multivariable functions f(x) as U(λ)f(𝐱)=exp(λ)f(𝐱)=f(𝐱+λ).

Template:Math proof

Rotations

The rotation operator also contains a directional derivative. The rotation operator for an angle θ, i.e. by an amount θ = |θ| about an axis parallel to θ^=θ/θ is U(R(θ))=exp(iθ𝐋). Here L is the vector operator that generates SO(3): 𝐋=(000001010)𝐢+(001000100)𝐣+(010100000)𝐤. It may be shown geometrically that an infinitesimal right-handed rotation changes the position vector x by 𝐱𝐱δθ×𝐱. So we would expect under infinitesimal rotation: U(R(δθ))f(𝐱)=f(𝐱δθ×𝐱)=f(𝐱)(δθ×𝐱)f. It follows that U(R(δθ))=1(δθ×𝐱). Following the same exponentiation procedure as above, we arrive at the rotation operator in the position basis, which is an exponentiated directional derivative:[9] U(R(θ))=exp((θ×𝐱)).

Normal derivative

A normal derivative is a directional derivative taken in the direction normal (that is, orthogonal) to some surface in space, or more generally along a normal vector field orthogonal to some hypersurface. See for example Neumann boundary condition. If the normal direction is denoted by 𝐧, then the normal derivative of a function f is sometimes denoted as f𝐧. In other notations, f𝐧=f(𝐱)𝐧=𝐧f(𝐱)=f𝐱𝐧=Df(𝐱)[𝐧].

In the continuum mechanics of solids

Several important results in continuum mechanics require the derivatives of vectors with respect to vectors and of tensors with respect to vectors and tensors.[10] The directional directive provides a systematic way of finding these derivatives.

Template:Excerpt

See also


Notes

<templatestyles src="Reflist/styles.css" />

  1. Script error: No such module "citation/CS1".
  2. The applicability extends to functions over spaces without a metric and to differentiable manifolds, such as in general relativity.
  3. If the dot product is undefined, the gradient is also undefined; however, for differentiable f, the directional derivative is still defined, and a similar relation exists with the exterior derivative.
  4. Thomas, George B. Jr.; and Finney, Ross L. (1979) Calculus and Analytic Geometry, Addison-Wesley Publ. Co., fifth edition, p. 593.
  5. This typically assumes a Euclidean space – for example, a function of several variables typically has no definition of the magnitude of a vector, and hence of a unit vector.
  6. Script error: No such module "citation/CS1".
  7. Script error: No such module "citation/CS1".
  8. Script error: No such module "citation/CS1".
  9. Script error: No such module "citation/CS1".
  10. J. E. Marsden and T. J. R. Hughes, 2000, Mathematical Foundations of Elasticity, Dover.

Script error: No such module "Check for unknown parameters".

References

  • Script error: No such module "citation/CS1".
  • Script error: No such module "citation/CS1".
  • Script error: No such module "Citation/CS1".

External links

Template:Commons category inline

Template:Calculus topics