找回密码
 快速注册
搜索
查看: 75|回复: 2

二重偏导数连续⇒偏导数可以互相交换

[复制链接]

3147

主题

8381

回帖

6万

积分

$\style{scale:11;fill:#eff}꩜$

积分
65357
QQ

显示全部楼层

hbghlyj 发表于 2022-6-23 00:47 |阅读模式
en.wikipedia.org/wiki/Symmetry_of_second_derivatives#Theorem_of_Schwarz

In mathematical analysis, Schwarz's theorem (or Clairaut's theorem on equality of mixed partials)[9] named after Alexis Clairaut and Hermann Schwarz, states that for a function ${\displaystyle f\colon \Omega \to \mathbb {R} }$ defined on a set $\Omega \subset \mathbb{R}^n$, if ${\displaystyle \mathbf {p} \in \mathbb {R} ^{n}}$ is a point such that some neighborhood of $\mathbf {p} $ is contained in $\Omega $ and $f$ has continuous second partial derivatives on that neighborhood of $\mathbf {p} $, then for all $i$ and $j$ in ${\displaystyle \{1,2\ldots ,\,n\}.}$

    ${\displaystyle {\frac {\partial ^{2}}{\partial x_{i}\,\partial x_{j}}}f(\mathbf {p} )={\frac {\partial ^{2}}{\partial x_{j}\,\partial x_{i}}}f(\mathbf {p} ).}$

The partial derivatives of this function commute at that point.

One easy way to establish this theorem (in the case where $n=2$, $i=1$, and $j=2$, which readily entails the result in general) is by applying Green's theorem to the gradient of $f.$

An elementary proof for functions on open subsets of the plane is as follows (by a simple reduction, the general case for the theorem of Schwarz easily reduces to the planar case).[10] Let ${\displaystyle f(x.y)}$ be a differentiable function on an open rectangle Ω containing a point $(a,b)$ and suppose that $df$ is continuous with continuous ${\displaystyle \partial _{x}\partial _{y}f}$ and ${\displaystyle \partial _{y}\partial _{x}f}$ over Ω. Define

    ${\displaystyle {\begin{aligned}u\left(h,\,k\right)&=f\left(a+h,\,b+k\right)-f\left(a+h,\,b\right),\\v\left(h,\,k\right)&=f\left(a+h,\,b+k\right)-f\left(a,\,b+k\right),\\w\left(h,\,k\right)&=f\left(a+h,\,b+k\right)-f\left(a+h,\,b\right)-f\left(a,\,b+k\right)+f\left(a,\,b\right).\end{aligned}}}$

These functions are defined for ${\displaystyle \left|h\right|,\,\left|k\right|<\varepsilon }$, where $\varepsilon >0$ and ${\displaystyle \left[a-\varepsilon ,\,a+\varepsilon \right]\times \left[b-\varepsilon ,\,b+\varepsilon \right]}$ is contained in Ω.

By the mean value theorem, for fixed h and k non-zero, ${\displaystyle \theta ,\,\theta ^{\prime },\,\,\phi ,\,\,\phi ^{\prime }}$ can be found in the open interval $ (0,1)$ with

    ${\displaystyle {\begin{aligned}w\left(h,\,k\right)&=u\left(h,\,k\right)-u\left(0,\,k\right)=h\,\partial _{x}u\left(\theta h,\,k\right)\\&=h\,\left[\partial _{x}f\left(a+\theta h,\,b+k\right)-\partial _{x}f\left(a+\theta h,\,b\right)\right]\\&=hk\,\partial _{y}\partial _{x}f\left(a+\theta h,\,b+\theta ^{\prime }k\right)\\w\left(h,\,k\right)&=v\left(h,\,k\right)-v\left(h,\,0\right)=k\,\partial _{y}v\left(h,\,\phi k\right)\\&=k\left[\partial _{y}f\left(a+h,\,b+\phi k\right)-\partial _{y}f\left(a,\,b+\phi k\right)\right]\\&=hk\,\partial _{x}\partial _{y}f\left(a+\phi ^{\prime }h,\,b+\phi k\right).\end{aligned}}}$

Since ${\displaystyle h,\,k\neq 0}$, the first equality below can be divided by $hk$:

    ${\displaystyle {\begin{aligned}hk\,\partial _{y}\partial _{x}f\left(a+\theta h,\,b+\theta ^{\prime }k\right)&=hk\,\partial _{x}\partial _{y}f\left(a+\phi ^{\prime }h,\,b+\phi k\right),\\\partial _{y}\partial _{x}f\left(a+\theta h,\,b+\theta ^{\prime }k\right)&=\partial _{x}\partial _{y}f\left(a+\phi ^{\prime }h,\,b+\phi k\right).\end{aligned}}}$

Letting ${\displaystyle h,\,k}$ tend to zero in the last equality, the continuity assumptions on ${\displaystyle \partial _{y}\partial _{x}f}$ and ${\displaystyle \partial _{x}\partial _{y}f}$ now imply that

    ${\displaystyle {\frac {\partial ^{2}}{\partial x\partial y}}f\left(a,\,b\right)={\frac {\partial ^{2}}{\partial y\partial x}}f\left(a,\,b\right).}$

This account is a straightforward classical method found in many text books, for example in Burkill, Apostol and Rudin.[10][11][12]

Although the derivation above is elementary, the approach can also be viewed from a more conceptual perspective so that the result becomes more apparent.[13][14][15][16][17] Indeed the difference operators ${\displaystyle \Delta _{x}^{t},\,\,\Delta _{y}^{t}}$ commute and ${\displaystyle \Delta _{x}^{t}f,\,\,\Delta _{y}^{t}f}$ tend to ${\displaystyle \partial _{x}f,\,\,\partial _{y}f}$ as $t$ tends to 0, with a similar statement for second order operators.[a] Here, for $z$ a vector in the plane and $u$ a directional vector, the difference operator is defined by

    ${\displaystyle \Delta _{u}^{t}f(z)={f(z+tu)-f(z) \over t}.}$

By the fundamental theorem of calculus for $C^{1}$ functions $f$ on an open interval $I$ with ${\displaystyle (a,b)\subset I}$

    ${\displaystyle \int _{a}^{b}f^{\prime }(x)\,dx=f(b)-f(a).}$

Hence

    ${\displaystyle |f(b)-f(a)|\leq (b-a)\,\sup _{c\in (a,b)}|f^{\prime }(c)|}$.

This is a generalized version of the mean value theorem. Recall that the elementary discussion on maxima or minima for real-valued functions implies that if $f$ is continuous on $[a,b]$ and differentiable on $(a,b)$, then there is a point $c$ in $(a,b)$ such that

    ${\displaystyle {f(b)-f(a) \over b-a}=f^{\prime }(c).}$

For vector-valued functions with $V$ a finite-dimensional normed space, there is no analogue of the equality above, indeed it fails. But since ${\displaystyle \inf f^{\prime }\leq f^{\prime }(c)\leq \sup f^{\prime }}$, the inequality above is a useful substitute. Moreover, using the pairing of the dual of $V$ with its dual norm, yields the following inequality:

    ${\displaystyle \|f(b)-f(a)\|\leq (b-a)\,\sup _{c\in (a,b)}\|f^{\prime }(c)\|}$.

These versions of the mean valued theorem are discussed in Rudin, Hörmander and elsewhere.[19][20]

For $f$ a $C^{2}$ function on an open set in the plane, define ${\displaystyle D_{1}=\partial _{x}}$ and ${\displaystyle D_{2}=\partial _{y}}$. Furthermore for ${\displaystyle t\neq 0}$ set

    ${\displaystyle \Delta _{1}^{t}f(x,y)=[f(x+t,y)-f(x,y)]/t,\,\,\,\,\,\,\Delta _{2}^{t}f(x,y)=[f(x,y+t)-f(x,y)]/t}$.

Then for $(x_{0},y_{0})$ in the open set, the generalized mean value theorem can be applied twice:

    $\displaystyle \left|\Delta _{1}^{t}\Delta _{2}^{t}f(x_{0},y_{0})-D_{1}D_{2}f(x_{0},y_{0})\right|$
    $\displaystyle\leq \sup _{0\leq s\leq 1}\left|\Delta _{1}^{t}D_{2}f(x_{0},y_{0}+ts)-D_{1}D_{2}f(x_{0},y_{0})\right|$
    $\displaystyle\leq \sup _{0\leq r,s\leq 1}\left|D_{1}D_{2}f(x_{0}+tr,y_{0}+ts)-D_{1}D_{2}f(x_{0},y_{0})\right|.$

Thus ${\displaystyle \Delta _{1}^{t}\Delta _{2}^{t}f(x_{0},y_{0})}$ tends to ${\displaystyle D_{1}D_{2}f(x_{0},y_{0})}$ as $t$ tends to 0. The same argument shows that ${\displaystyle \Delta _{2}^{t}\Delta _{1}^{t}f(x_{0},y_{0})}$ tends to ${\displaystyle D_{2}D_{1}f(x_{0},y_{0})}$. Hence, since the difference operators commute, so do the partial differential operators $D_{1}$ and $D_{2}$, as claimed.[21][22][23][24][25]

Remark. By two applications of the classical mean value theorem,

    ${\displaystyle \Delta _{1}^{t}\Delta _{2}^{t}f(x_{0},y_{0})=D_{1}D_{2}f(x_{0}+t\theta ,y_{0}+t\theta ^{\prime })}$

for some $\theta $ and $\theta^\prime$ in $(0,1)$. Thus the first elementary proof can be reinterpreted using difference operators. Conversely, instead of using the generalized mean value theorem in the second proof, the classical mean valued theorem could be used.

3147

主题

8381

回帖

6万

积分

$\style{scale:11;fill:#eff}꩜$

积分
65357
QQ

显示全部楼层

 楼主| hbghlyj 发表于 2022-6-23 00:49

Requirement of continuity

本帖最后由 hbghlyj 于 2022-12-29 18:49 编辑 The symmetry may be broken if the function fails to have differentiable partial derivatives, which is possible if Clairaut's theorem is not satisfied (the second partial derivatives are not continuous).
The function $f(x, y)$, as shown in equation \eqref{1}, does not have symmetric second derivatives at its origin.

An example of non-symmetry is the function (due to Peano)[36][37]

\begin{equation}\label1
f(x,\,y)={\begin{cases}{\frac {xy\left(x^{2}-y^{2}\right)}{x^{2}+y^{2}}}&{\mbox{ for }}(x,\,y)\neq (0,\,0),\\0&{\mbox{ for }}(x,\,y)=(0,\,0).\end{cases}}\tag1\end{equation} x^2-y^2 x^2+y^2.gif

This can be visualized by the polar form ${\displaystyle f(r\cos(\theta ),r\sin(\theta ))={\frac {r^{2}\sin(4\theta )}{4}}}$; it is everywhere continuous, but its derivatives at (0, 0) cannot be computed algebraically. Rather, the limit of difference quotients shows that ${\displaystyle f_{x}(0,0)=f_{y}(0,0)=0}$, so the graph ${\displaystyle z=f(x,y)}$ has a horizontal tangent plane at (0, 0), and the partial derivatives ${\displaystyle f_{x},f_{y}}$ exist and are everywhere continuous. However, the second partial derivatives are not continuous at (0, 0), and the symmetry fails. In fact, along the x-axis the y-derivative is ${\displaystyle f_{y}(x,0)=x}$, and so:

    ${\displaystyle f_{yx}(0,0)=\lim _{\varepsilon \to 0}{\frac {f_{y}(\varepsilon ,0)-f_{y}(0,0)}{\varepsilon }}=1.}$

In contrast, along the y-axis the x-derivative ${\displaystyle f_{x}(0,y)=-y}$, and so ${\displaystyle f_{xy}(0,0)=-1}$. That is, ${\displaystyle f_{yx}\neq f_{xy}}$ at (0, 0), although the mixed partial derivatives do exist, and at every other point the symmetry does hold.

The above function, written in a cylindrical coordinate system, can be expressed as

    ${\displaystyle f(r,\,\theta )={\frac {r^{2}\sin {4\theta }}{4}},}$

showing that the function oscillates four times when traveling once around an arbitrarily small loop containing the origin. Intuitively, therefore, the local behavior of the function at (0, 0) cannot be described as a quadratic form, and the Hessian matrix thus fails to be symmetric.

In general, the interchange of limiting operations need not commute. Given two variables near (0, 0) and two limiting processes on

    ${\displaystyle f(h,\,k)-f(h,\,0)-f(0,\,k)+f(0,\,0)}$

corresponding to making $h → 0$ first, and to making $k → 0$ first. It can matter, looking at the first-order terms, which is applied first. This leads to the construction of pathological examples in which second derivatives are non-symmetric. This kind of example belongs to the theory of real analysis where the pointwise value of functions matters. When viewed as a distribution the second partial derivative's values can be changed at an arbitrary set of points as long as this has Lebesgue measure 0. Since in the example the Hessian is symmetric everywhere except (0, 0), there is no contradiction with the fact that the Hessian, viewed as a Schwartz distribution, is symmetric.


c3d.libretexts.org/CalcPlot3D/index.html

3147

主题

8381

回帖

6万

积分

$\style{scale:11;fill:#eff}꩜$

积分
65357
QQ

显示全部楼层

 楼主| hbghlyj 发表于 2022-11-3 05:21

Calculus.subwiki.org

本帖最后由 hbghlyj 于 2022-12-29 18:50 编辑 Clairaut's theorem on equality of mixed partials
Failure of Clairaut's theorem where only one of the mixed partials is defined
Failure of Clairaut's theorem where both mixed partials are defined but not equal
搬维基好用的代码
  1. document.querySelectorAll('.mwe-math-fallback-image-inline').forEach(a=>{if(a.alt.slice(0,15)=="{\\displaystyle "){a.replaceWith('$'+a.alt.slice(15,-1)+'$')}else{a.replaceWith('$'+a.alt+'$')}})
  2. document.querySelectorAll('.mwe-math-fallback-image-display').forEach(a=>{if(a.alt.slice(0,15)=="{\\displaystyle "){a.replaceWith('$$'+a.alt.slice(15,-1)+'$$')}else{a.replaceWith('$$'+a.alt+'$$')}})
  3. document.querySelectorAll('.mwe-math-mathml-inline.mwe-math-mathml-a11y').forEach(a=>a.remove())
复制代码

手机版|悠闲数学娱乐论坛(第3版)

GMT+8, 2025-3-4 21:43

Powered by Discuz!

× 快速回复 返回顶部 返回列表