多元函数微分学
第十七章 多元函数微分学
§1. 可微性与偏导数
定义 (1)
设函数 \(z = f(x,y)\) 在某邻域 \(U(P_0)\) 内有定义。对于 \(P(x,y) = (x_0 + \Delta x,y_0 + \Delta y)\in U(P_0)\) , 若 \(f\) 在 \(P_0\) 的全增量 \(\Delta z\) 可表示为:
\[
\Delta z = f \left(x _ {0} + \Delta x, y _ {0} + \Delta y\right) - f \left(x _ {0}, y _ {0}\right) = A \Delta x + B \Delta y + o (\rho), \tag {1}
\]
其中 \(A, B\) 是仅与点 \(P_0\) 有关的常数, \(\rho = \sqrt{\Delta x^2 + \Delta y^2}\) ,则称 \(f\) 在 \(P_0\) 可微。并称 (1) 式中关于 \(\Delta x, \Delta y\) 的线性表达式 \(A \Delta x + B \Delta y\) 为 \(f\) 在 \(P_0\) 的全微分,记作
\[
\mathrm{d} z | _ {P _ {0}} = \mathrm{d} f (x _ {0}, y _ {0}) = A \Delta x + B \Delta y. \tag {2}
\]
由 (1), (2), 当 \(|\Delta x|, |\Delta y|\) 充分小时,全微分 \(\mathrm{d}z\) 可作为全增量 \(\Delta z\) 的近似值,于是:
\[
f (x, y) \approx f \left(x _ {0}, y _ {0}\right) + A \left(x - x _ {0}\right) + B \left(y - y _ {0}\right). \tag {3}
\]
在使用上,有时也把 (1) 式写成:
\[
\Delta z = A \Delta x + B \Delta y + \alpha \Delta x + \beta \Delta y, \tag {4}
\]
这里 \(\lim_{(\Delta x, \Delta y) \to (0, 0)} \alpha = \lim_{(\Delta x, \Delta y) \to (0, 0)} \beta = 0\) .
定义 (1)
设函数 \(z = f(x,y)\) 在某邻域 \(U(P_0)\) 内有定义。对于 \(P(x,y) = (x_0 + \Delta x,y_0 + \Delta y)\in U(P_0)\) , 若 \(f\) 在 \(P_0\) 的全增量 \(\Delta z\) 可表示为:
\[
\Delta z = f \left(x _ {0} + \Delta x, y _ {0} + \Delta y\right) - f \left(x _ {0}, y _ {0}\right) = A \Delta x + B \Delta y + o (\rho), \tag {1}
\]
其中 \(A, B\) 是仅与点 \(P_0\) 有关的常数, \(\rho = \sqrt{\Delta x^2 + \Delta y^2}\) ,则称 \(f\) 在 \(P_0\) 可微。并称 (1) 式中关于 \(\Delta x, \Delta y\) 的线性表达式 \(A \Delta x + B \Delta y\) 为 \(f\) 在 \(P_0\) 的全微分,记作
\[
\mathrm{d} z | _ {P _ {0}} = \mathrm{d} f (x _ {0}, y _ {0}) = A \Delta x + B \Delta y. \tag {2}
\]
由 (1), (2), 当 \(|\Delta x|, |\Delta y|\) 充分小时,全微分 \(\mathrm{d}z\) 可作为全增量 \(\Delta z\) 的近似值,于是:
\[
f (x, y) \approx f \left(x _ {0}, y _ {0}\right) + A \left(x - x _ {0}\right) + B \left(y - y _ {0}\right). \tag {3}
\]
在使用上,有时也把 (1) 式写成:
\[
\Delta z = A \Delta x + B \Delta y + \alpha \Delta x + \beta \Delta y, \tag {4}
\]
这里 \(\lim_{(\Delta x, \Delta y) \to (0, 0)} \alpha = \lim_{(\Delta x, \Delta y) \to (0, 0)} \beta = 0\) .
为什么 \(o(\rho)\) 可以写成 \(\alpha\Delta x + \beta\Delta y\) ,且 \(\alpha, \beta \to 0?\)
根据小 o 定义:
\[
\frac {o (\rho)}{\rho} \rightarrow 0
\]
也就是说:
\[
o (\rho) = \varepsilon (\Delta x, \Delta y) \cdot \rho , \quad \varepsilon \rightarrow 0
\]
于是可以构造:
\[
o (\rho) = \varepsilon \cdot \rho = \varepsilon \cdot \frac {\Delta x ^ {2} + \Delta y ^ {2}}{\rho}
\]
拆开:
\[
= \varepsilon \cdot \frac {\Delta x}{\rho} \Delta x + \varepsilon \cdot \frac {\Delta y}{\rho} \Delta y
\]
令
\[
\alpha = \varepsilon \cdot \frac {\Delta x}{\rho}, \quad \beta = \varepsilon \cdot \frac {\Delta y}{\rho}.
\]
于是
\[
o (\rho) = \alpha \Delta x + \beta \Delta y.
\]
因为
所以
\[
\alpha \to 0, \quad \beta \to 0.
\]
例 (1)
考察 \(f(x,y)=xy\) 在任一点 \((x_{0},y_{0})\) 的可微性.
解 f 在点 \((x_{0}, y_{0})\) 处的全增量为
\[
\begin{array}{l} \Delta f \left(x _ {0}, y _ {0}\right) = \left(x _ {0} + \Delta x\right) \left(y _ {0} + \Delta y\right) - x _ {0} y _ {0} \\ = y _ {0} \Delta x + x _ {0} \Delta y + \Delta x \Delta y. \\ \end{array}
\]
由于 \(\frac{|\Delta x\Delta y|}{\rho} = \rho \frac{|\Delta x|}{\rho} \frac{|\Delta y|}{\rho} \leq \rho \rightarrow 0\) when \(\rho \rightarrow 0\) ,
因此 \(\Delta x\Delta y = o(\rho)\) . 从而 f 在 \((x_{0}, y_{0})\) 可微,且
\[
\mathrm{d} f = y _ {0} \Delta x + x _ {0} \Delta y
\]
偏导数
由一元函数微分学:若 \(f(x)\) 在 \(x_0\) 可微,则 \(f(x_0 + \Delta x) - f(x_0) = A\Delta x + o(\Delta x)\) ,其中 \(A = f'(x_0)\) .
当二元函数 \(f(x,y)\) 在点 \((x_{0},y_{0})\) 可微时,(1) 式中的常数 A, B 应取怎样的值?
为此在
\[
\Delta z = A \Delta x + B \Delta y + \alpha \Delta x + \beta \Delta y, \tag {4}
\]
中先令 \(\Delta y = 0 (\Delta x \neq 0)\) ,这时得到 f 关于 x 的偏增量为 \(\Delta_{x}z = A\Delta x + \alpha\Delta x\) 或 \(\frac{\Delta_{x}z}{\Delta x} = A + \alpha\) 。于是
\[
A = \lim _ {\Delta x \rightarrow 0} \frac {\Delta_ {x} z}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \frac {f (x _ {0} + \Delta x , y _ {0}) - f (x _ {0} , y _ {0})}{\Delta x}. \tag {5}
\]
容易看出,(5) 式右边的极限正是关于 \(x\) 的一元函数 \(f(x, y_0)\) 在 \(x = x_0\) 处的导数.
类似地,在 (4) 式中令 \(\Delta x = 0 (\Delta y \neq 0)\) , 又可得到
\[
B = \lim _ {\Delta y \rightarrow 0} \frac {\Delta_ {y} z}{\Delta y} = \lim _ {\Delta y \rightarrow 0} \frac {f (x _ {0} , y _ {0} + \Delta y) - f (x _ {0} , y _ {0})}{\Delta y}, \tag {6}
\]
它是关于 y 的一元函数 \(f(x_{0}, y)\) 在 \(y = y_{0}\) 处的导数.
二元函数当固定其中一个自变量时,它对另一个自变量的导数称为该函数的偏导数,
定义 (2)
设函数 \(z = f(x,y),(x,y)\in D\) ,且 \(f(x,y_0)\) 在 \(x_0\) 的某邻域内有定义.则当极限
\[
\lim _ {\Delta x \rightarrow 0} \frac {\Delta_ {x} z}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \frac {f (x _ {0} + \Delta x , y _ {0}) - f (x _ {0} , y _ {0})}{\Delta x} \tag {7}
\]
存在时,称此极限为 \(f\) 在点 \((x_0, y_0)\) 关于 \(x\) 的偏导数,记作
\[
f _ {x} \left(x _ {0}, y _ {0}\right), \quad \text {或} \quad z _ {x} \left(x _ {0}, y _ {0}\right), \frac {\partial f}{\partial x} \Big | _ {(x _ {0}, y _ {0})}, \frac {\partial z}{\partial x} \Big | _ {(x _ {0}, y _ {0})}.
\]
类似地可定义 f 在点 \((x_{0}, y_{0})\) 关于 y 的偏导数:
\[
\lim _ {\Delta y \rightarrow 0} \frac {\Delta_ {y} z}{\Delta y} = \lim _ {\Delta y \rightarrow 0} \frac {f (x _ {0} , y _ {0} + \Delta y) - f (x _ {0} , y _ {0})}{\Delta y}, \tag {$7^{\prime$}}
\]
记作 \(f_{y}(x_{0},y_{0})\) ,或 \(z_{y}(x_{0},y_{0}),\frac{\partial f}{\partial y}\Big|_{(x_0,y_0)},\frac{\partial z}{\partial y}\Big|_{(x_0,y_0)}.\)
定义 (2)
设函数 \(z = f(x,y),(x,y)\in D\) ,且 \(f(x,y_0)\) 在 \(x_0\) 的某邻域内有定义。则当极限
\[
\lim _ {\Delta x \rightarrow 0} \frac {\Delta_ {x} z}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \frac {f (x _ {0} + \Delta x , y _ {0}) - f (x _ {0} , y _ {0})}{\Delta x} \tag {7}
\]
存在时,称此极限为 \(f\) 在点 \((x_0, y_0)\) 关于 \(x\) 的偏导数,记作
\[
f _ {x} \left(x _ {0}, y _ {0}\right), \quad \text {或} \quad z _ {x} \left(x _ {0}, y _ {0}\right), \frac {\partial f}{\partial x} \Big | _ {(x _ {0}, y _ {0})}, \frac {\partial z}{\partial x} \Big | _ {(x _ {0}, y _ {0})}.
\]
类似地可定义 f 在点 \((x_{0}, y_{0})\) 关于 y 的偏导数:
\[
\lim _ {\Delta y \rightarrow 0} \frac {\Delta_ {y} z}{\Delta y} = \lim _ {\Delta y \rightarrow 0} \frac {f (x _ {0} , y _ {0} + \Delta y) - f (x _ {0} , y _ {0})}{\Delta y}, \tag {$7^{\prime$}}
\]
记作 \(f_{y}(x_{0},y_{0})\) ,或 \(z_{y}(x_{0},y_{0}),\left.\frac{\partial f}{\partial y}\right|_{(x_{0},y_{0})},\left.\frac{\partial z}{\partial y}\right|_{(x_{0},y_{0})}.\)
注 1 这里 \(\frac{\partial}{\partial x}\) , \(\frac{\partial}{\partial y}\) 是专用于偏导数的符号,与一元函数的导数符号 \(\frac{d}{dx}\) 相仿,但又有区别.
注 2 在上述定义中,f 在点 \((x_{0}, y_{0})\) 存在对 x (或 y) 的偏导数,此时 f 至少在
\[
\{(x, y) \mid y = y _ {0}, | x - x _ {0} | < \delta \}
\]
(或 \(\{(x,y)\mid x=x_{0},\mid y-y_{0}\mid<\delta\}\))
上必须有定义.显然,在定义域的内点处总能满足这种要求,而在界点处则往往无法考虑偏导数.
若函数 \(z = f(x, y)\) 在区域 D 上每一点 \((x, y)\) 都存在对 x(或对 y)的偏导数,则得到 \(z = f(x, y)\) 在 D 上对 x(或对 y)的偏导函数(也简称偏导数),记作 \(f_x(x, y)\) 或 \(\frac{\partial f(x, y)}{\partial x} (f_y(x, y))\) 或 \(\left. \frac{\partial f(x, y)}{\partial y}\right)\) ,也可简单地写作 \(f_x, z_x\) ,或 \(\frac{\partial f}{\partial x}, \frac{\partial z}{\partial x} (f_y, z_y\) ,或 \(\left. \frac{\partial f}{\partial y}, \frac{\partial z}{\partial y}\right)\) .
偏导数的几何意义: \(z = f(x, y)\) 的几何图象通常是三维空间中的曲面,设 \(P_0(x_0, y_0, z_0)\) 为此曲面上一点,其中 \(z_0 = f(x_0, y_0)\) 。过点 \(P_0\) 作平面 \(y = y_0\) ,它与曲面相交得一曲线: \(C: y = y_0, z = f(x, y)\) 。

偏导数 \(f_{x}(x_0,y_0)\) 的几何意义为:在平面 \(y = y_0\) 上,曲线 \(C\) 在点 \(P_0\) 处的切线与 \(x\) 轴正向所成倾角 \(\alpha\) 的正切,即 \(f_{x}(x_0,y_0) = \tan \alpha\) .
可同样讨论偏导数 \(f_{y}(x_{0}, y_{0})\) 的几何意义.
由偏导数的定义还知道,多元函数 \(f\) 对某一个自变量求偏导数,是先把别的自变量看作常数,变成一元函数的求导。因此第五章中有关求导数的基本法则,对多元函数求偏导数仍然适用.
偏导数的几何意义: \(z = f(x, y)\) 的几何图象通常是三维空间中的曲面,设 \(P_0(x_0, y_0, z_0)\) 为此曲面上一点,其中 \(z_0 = f(x_0, y_0)\) 。过点 \(P_0\) 作平面 \(y = y_0\) ,它与曲面相交得一曲线: \(C: y = y_0, z = f(x, y)\) 。

偏导数 \(f_{x}(x_0,y_0)\) 的几何意义为:在平面 \(y = y_0\) 上,曲线 \(C\) 在点 \(P_0\) 处的切线与 \(x\) 轴正向所成倾角 \(\alpha\) 的正切,即 \(f_{x}(x_0,y_0) = \tan \alpha\) .
可同样讨论偏导数 \(f_{y}(x_{0}, y_{0})\) 的几何意义.
由偏导数的定义还知道,多元函数 \(f\) 对某一个自变量求偏导数,是先把别的自变量看作常数,变成一元函数的求导。因此第五章中有关求导数的基本法则,对多元函数求偏导数仍然适用.
偏导数的几何意义: \(z = f(x, y)\) 的几何图象通常是三维空间中的曲面,设 \(P_0(x_0, y_0, z_0)\) 为此曲面上一点,其中 \(z_0 = f(x_0, y_0)\) 。过点 \(P_0\) 作平面 \(y = y_0\) ,它与曲面相交得一曲线: \(C: y = y_0, z = f(x, y)\) 。

偏导数 \(f_{x}(x_0,y_0)\) 的几何意义为:在平面 \(y = y_0\) 上,曲线 \(C\) 在点 \(P_0\) 处的切线与 \(x\) 轴正向所成倾角 \(\alpha\) 的正切,即 \(f_{x}(x_0,y_0) = \tan \alpha\) .
可同样讨论偏导数 \(f_{y}(x_{0}, y_{0})\) 的几何意义.
由偏导数的定义还知道,多元函数 \(f\) 对某一个自变量求偏导数,是先把别的自变量看作常数,变成一元函数的求导。因此第五章中有关求导数的基本法则,对多元函数求偏导数仍然适用.
例 (2)
求函数 \(f(x,y)=x^{3}+2x^{2}y-y^{3}\) 在点 (1,3) 处关于 x 和关于 y 的偏导数.
解 先求 \(x\) 的偏导数。令 \(y = 3\) , 得 \(f(x, 3) = x^3 + 6x^2 - 27\) , 求它在 \(x = 1\) 的导数,则得
\[
f _ {x} (1, 3) = \left. \frac {\mathrm{d} f (x , 3)}{\mathrm{d} x} \right| _ {x = 1} = \left(3 x ^ {2} + 1 2 x\right) \Big | _ {x = 1} = 1 5.
\]
再求 \(f\) 在 \((1,3)\) 处关于 \(y\) 的偏导数。为此令 \(x = 1\) , 得 \(f(1,y) = 1 + 2y - y^3\) , 求它在 \(y = 3\) 处的导数,又得
\[
f _ {y} (1, 3) = \left. \frac {\mathrm{d} f (1 , y)}{\mathrm{d} y} \right| _ {y = 3} = 2 - \left. 3 y ^ {2} \right| _ {y = 3} = - 2 5
\]
通常也可先分别求出关于 \(x\) 和 \(y\) 的偏导函数:
\[
f _ {x} (x, y) = 3 x ^ {2} + 4 x y
\]
\[
f _ {y} (x, y) = 2 x ^ {2} - 3 y ^ {2}.
\]
然后以 \((x,y)=(1,3)\) 代入,也能得到同样结果。
例 (3)
求函数 \(z = x^{y}(x > 0)\) 的偏导数.
解 把 \(z = x^{y}\) 依次看成幂函数和指数函数,分别求得
\[
\frac {\partial z}{\partial x} = y \cdot x ^ {y - 1}, \quad \frac {\partial z}{\partial y} = x ^ {y} \ln x
\]
例 (4)
求三元函数 \(u = \sin (x + y^2 -\mathrm{e}^z)\) 的偏导数.
解 把 \(y\) 和 \(z\) 看作常数,得到
\[
\frac {\partial u}{\partial x} = \cos \left(x + y ^ {2} - \mathrm{e} ^ {z}\right);
\]
把 z, x 看作常数,得到
\[
\frac {\partial u}{\partial y} = 2 y \cos \left(x + y ^ {2} - \mathrm{e} ^ {z}\right)
\]
把 x, y 看作常数,得到
\[
\frac {\partial u}{\partial z} = - \mathrm{e} ^ {z} \cos \left(x + y ^ {2} - \mathrm{e} ^ {z}\right)
\]
可微性条件
由可微定义易知:若 f 在点 \(P_{0}(x_{0},y_{0})\) 可微,则 f 在 \(P_{0}\) 必连续。这表明:
“连续是可微的一个必要条件.”
此外,由
\[
A = \lim _ {\Delta x \to 0} \frac {f (x _ {0} + \Delta x , y _ {0}) - f (x _ {0} , y _ {0})}{\Delta x},
\]
\[
B = \lim _ {\Delta y \to 0} \frac {f (x _ {0} , y _ {0} + \Delta y) - f (x _ {0} , y _ {0})}{\Delta y},
\]
又可得到可微的另一必要条件:
定理 (17.1 可微性的必要条件)
若二元函数 \(f\) 在其定义域内一点 \((x_0, y_0)\) 处可微,则 \(f\) 在该点关于每个自变量的偏导数都存在。此时,微分表达式 (1) 式中的
\[
A = f _ {x} \left(x _ {0}, y _ {0}\right), \quad B = f _ {y} \left(x _ {0}, y _ {0}\right).
\]
于是,函数 \(f\) 在点 \((x_0, y_0)\) 的全微分 (2) 可惟一地表示为
\[
\mathrm{d} f (x _ {0}, y _ {0}) = f _ {x} (x _ {0}, y _ {0}) \Delta x + f _ {y} (x _ {0}, y _ {0}) \Delta y.
\]
与一元函数一样,若约定自变量的增量等于自变量的微分,即
\[
\Delta x = d x, \Delta y = d y
\]
则全微分又可写为 \(\mathrm{d}f(x_{0},y_{0})=f_{x}(x_{0},y_{0})\mathrm{d}x+f_{y}(x_{0},y_{0})\mathrm{d}y.\)
若函数 \(f\) 在区域 \(D\) 的每一点 \((x,y)\) 都可微,则称函数 \(f\) 在区域 \(D\) 上可微,且 \(f\) 在 \(D\) 上的全微分为
\[
\mathrm{d} f (x, y) = f _ {x} (x, y) \mathrm{d} x + f _ {y} (x, y) \mathrm{d} y. \tag {8}
\]
定理 17.1 的应用:对于函数 \(f(x,y)=\sqrt{x^{2}+y^{2}}\) ,由于 \(f(x,0)=|x|\) , \(f(0,y)=|y|\) ,它们分别在 x=0 与 y=0 都不可导,即 \(f_{x}(0,0)\) 与 \(f_{y}(0,0)\) 都不存在,故 \(f(x,y)\) 在点 \((0,0)\) 不可微.
例 (5)
考察函数 \(f(x,y) = \left\{ \begin{array}{cc} \frac{xy}{\sqrt{x^2 + y^2}}, & x^2 + y^2 \neq 0, \\ 0, & x^2 + y^2 = 0 \end{array} \right.\) 在原点的可微性.
解 按偏导数的定义先求出
\[
f _ {x} (0, 0) = \lim _ {\Delta x \rightarrow 0} \frac {f (\Delta x , 0) - f (0 , 0)}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \frac {0 - 0}{\Delta x} = 0;
\]
同理可得 \(f_{y}(0,0)=0\) .
若 \(f\) 在原点可微,则
\[
\begin{array}{l} f (0 + \Delta x, 0 + \Delta y) - f (0, 0) - [ f _ {x} (0, 0) \Delta x + f _ {y} (0, 0) \Delta y ] \\ = \frac {\Delta x \Delta y}{\sqrt {\Delta x ^ {2} + \Delta y ^ {2}}} \\ \end{array}
\]
应是 \(\rho = \sqrt{\Delta x^2 + \Delta y^2}\) 的高阶无穷小量。然而极限 \(\lim_{\rho \to 0} \frac{\Delta x \Delta y}{\Delta x^2 + \Delta y^2}\) 却不存在 (第十六章 §2 例 3), 故此 \(f(x, y)\) 在原点不可微.
以前知道,一元函数可微与存在导数是等价的.
而这个例子说明:对于多元函数,偏导数即使都存在,该函数也不一定可微.
还需要添加哪些条件,才能保证函数可微呢?
定理 (定理 17.2 (可微的充分条件))
若函数 \(z = f(x,y)\) 在点 \(P_0(x_0,y_0)\) 的某邻域内存在偏导数 \(f_{x}\) 与 \(f_{y}\) , 且它们在点 \(P_0\) 连续,则 \(f\) 在点 \(P_0\) 可微.
证 第一步 把全增量 \(\Delta z\) 写作
\[
\begin{array}{l} \Delta z = f \left(x _ {0} + \Delta x, y _ {0} + \Delta y\right) - f \left(x _ {0}, y _ {0}\right) \\ = [ f (x _ {0} + \Delta x, y _ {0} + \Delta y) - f (x _ {0}, y _ {0} + \Delta y) ] \\ + \left[ f (x _ {0}, y _ {0} + \Delta y) - f (x _ {0}, y _ {0}) \right]. \\ \end{array}
\]
在第一个方括号里的是函数 \(f(x, y_{0} + \Delta y)\) 关于 x 的增量;在第二个括号里的是函数 \(f(x_{0}, y)\) 关于 y 的增量.
第二步 对它们分别应用一元函数的拉格朗日中值定理,则 \(\exists\theta_{1},\theta_{2}\in(0,1)\) , 使得
\[
\Delta z = f _ {x} \left(x _ {0} + \theta_ {1} \Delta x, y _ {0} + \Delta y\right) \Delta x + f _ {y} \left(x _ {0}, y _ {0} + \theta_ {2} \Delta y\right) \Delta y \tag {9}
\]
第三步 由于 \(f_{x}, f_{y}\) 在点 \((x_{0}, y_{0})\) 连续,因此有
\[
f _ {x} \left(x _ {0} + \theta_ {1} \Delta x, y _ {0} + \Delta y\right) = f _ {x} \left(x _ {0}, y _ {0}\right) + \alpha , \tag {10}
\]
\[
f _ {y} \left(x _ {0}, y _ {0} + \theta_ {2} \Delta y\right) = f _ {y} \left(x _ {0}, y _ {0}\right) + \beta , \tag {11}
\]
其中当 \((\Delta x, \Delta y) \to (0, 0)\) 时,\(\alpha \to 0, \beta \to 0\) .
第四步 将 (10), (11) 代入 (9) 式,
\[
\Delta z = f _ {x} \left(x _ {0} + \theta_ {1} \Delta x, y _ {0} + \Delta y\right) \Delta x + f _ {y} \left(x _ {0}, y _ {0} + \theta_ {2} \Delta y\right) \Delta y. \tag {9}
\]
得到
\[
\Delta z = f _ {x} (x _ {0}, y _ {0}) \Delta x + f _ {y} (x _ {0}, y _ {0}) \Delta y + \alpha \Delta x + \beta \Delta y.
\]
由可微定义的等价式 (4), 便知 f 在点 \((x_{0}, y_{0})\) 可微.

定理 17.2 的应用 容易验证例 2 中的函数
\[
f (x, y) = x ^ {3} + 2 x ^ {2} y - y ^ {3}
\]
满足定理 17.2 的条件,故在点 \((1,3)\) 可微 (且在 \(R^{2}\) 上处处可微);
例 3 中的函数 \(z = x^{y}\) 在 \(\{(x, y) \mid x > 0, -\infty < y < +\infty\}\) 上满足定理 17.2 的条件,也在其定义域上可微;
例 4 中的函数 \(u = \sin (x + y^2 - e^z)\) 在 \(\mathbb{R}^3\) 上同样可微.
注意 偏导数连续并不是可微的必要条件,例如
\[
f (x, y) = \left\{ \begin{array}{c l} (x ^ {2} + y ^ {2}) \sin \frac {1}{\sqrt {x ^ {2} + y ^ {2}}}, & x ^ {2} + y ^ {2} \neq 0 \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
证明 由于 \(\lim_{(x,y)\to (0,0)}(x^2 +y^2)\sin \frac{1}{\sqrt{x^2 + y^2}} = 0 = f(0,0)\) ,所以 \(f(x,y)\) 在点 \((0,0)\) 连续.
当 \(x^{2} + y^{2} = 0\) 时
\[
\lim _ {\Delta x \rightarrow 0} \frac {f (0 + \Delta x , 0) - f (0 , 0)}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \Delta x \sin \frac {1}{| \Delta x |} = 0 = f _ {x} (0, 0)
\]
当 \(x^{2} + y^{2} \neq 0\) 时
\[
f _ {x} (x, y) = 2 x \sin \frac {1}{\sqrt {x ^ {2} + y ^ {2}}} - \frac {x}{\sqrt {x ^ {2} + y ^ {2}}} \cos \frac {1}{\sqrt {x ^ {2} + y ^ {2}}}
\]
而 \(\lim_{(x,y)\to(0,0)}2x\sin\frac{1}{\sqrt{x^{2}+y^{2}}}=0,\quad\lim_{(x,y)\to(0,0)}\frac{x}{\sqrt{x^{2}+y^{2}}}\cos\frac{1}{\sqrt{x^{2}+y^{2}}}\) 不存在 (可考察 y=0 情况).
因此, \(\lim_{(x,y)\to(0,0)}f_{x}(x,y)\) 不存在,从而 \(f_{x}(x,y)\) 在点 \((0,0)\) 不连续.
同理可证 \(f_{y}(x,y)\) 在点 \((0,0)\) 不连续。然而
\[
\begin{array}{l} \lim _ {(\Delta x, \Delta y) \rightarrow (0, 0)} \frac {\Delta f - f _ {x} (0 , 0) \Delta x - f _ {y} (0 , 0) \Delta y}{\sqrt {\Delta x ^ {2} + \Delta y ^ {2}}} \\ = \lim _ {(\Delta x, \Delta y) \rightarrow (0, 0)} \frac {\Delta x ^ {2} + \Delta y ^ {2}}{\sqrt {\Delta x ^ {2} + \Delta y ^ {2}}} \sin \frac {1}{\sqrt {\Delta x + \Delta y}} = 0 \\ \end{array}
\]
所以 f 在点 \((0,0)\) 可微.
若 \(f(x,y)\) 的偏导数 \(f_{x}\) 与 \(f_{y}\) 在点 \((x_{0},y_{0})\) 都连续,则称 f 在点 \((x_{0},y_{0})\) 连续可微.
在定理 17.2 证明过程中出现的 (9) 式,实际上是二元函数的一个中值公式,将它重新写成定理如下:
定理 (17.3)
设函数 f 在点 \((x_{0}, y_{0})\) 的某邻域内存在偏导数,若 \((x, y)\) 属于该邻域,则存在
\[
\xi = x _ {0} + \theta_ {1} (x - x _ {0}), \quad \eta = y _ {0} + \theta_ {2} (y - y _ {0}), \quad 0 < \theta_ {1}, \theta_ {2} < 1
\]
使得
\[
f (x, y) - f \left(x _ {0}, y _ {0}\right) = f _ {x} (\xi , y) \left(x - x _ {0}\right) + f _ {y} \left(x _ {0}, \eta\right) \left(y - y _ {0}\right) \tag {12}
\]
例:某年小考题
四、(18 分)证明题
1、(9 分)设 \(u(x,y) = \left\{ \begin{array}{cc}xy\sin \frac{1}{\sqrt{x^2 + y^2}} & x^2 +y^2\neq 0\\ 0 & x^2 +y^2 = 0 \end{array} \right.\)
(1) 求 \(u_{x}(0,0)\) , \(u_{y}(0,0)\) ;
解答:某年小考题
四。证明.
\[
\lim _ {x \to 0} u _ {x} (0, 0) = \lim _ {x \to 0} \frac {u (x , 0) - u (0 , 0)}{x} = \lim _ {x \to 0} \frac {0}{x} = 0
\]
\[
u _ {y} (0, 0) = 0.
\]
(2) 变 \((x,y)\neq(0,0)\) 时.
\[
u _ {x} (x, y) = y \sin \frac {1}{\sqrt {x ^ {2} + y ^ {2}}} - x ^ {2} y \cos \frac {1}{\sqrt {x ^ {2} + y ^ {2}}} \frac {1}{(x ^ {2} + y ^ {2}) ^ {\frac {3}{2}}}
\]
\[
u _ {y} (x, y) = x \sin \frac {1}{\sqrt {x ^ {2} + y ^ {2}}} - \frac {x y ^ {2}}{(x ^ {2} + y ^ {2}) ^ {3 / 2}} \cos \frac {1}{\sqrt {x ^ {2} + y ^ {2}}}
\]
令 \(x = r\cos \theta\) \(y = r\sin \theta\)
\[
\lim _ {(x, y) \to (0, 0)} u _ {x} (x, y) = \lim _ {y \to 0 ^ {+}} [ y \sin \theta \sin \frac {1}{r} - \cos^ {2} \theta \sin \theta a b \frac {1}{r} ]
\]
不存在,
同理 \(\lim_{(x,y)\to(0,0)}u_{y}(x,y)\) 不存在.
一反 \(U_{x}(x,y)\) . \(U_{y}(x,y)\) 在 \((0,0)\) 处不连续.
解答:某年小考题
\[
\begin{array}{r l}{\lim _ {\sqrt {x ^ {2} + y ^ {2}} \rightarrow 0 +} \frac {u (x , y) - u (0 , 0) - u _ {x} (0 , 0) \Delta x - u _ {y} (0 , 0) \Delta y}{\sqrt {x ^ {2} + y ^ {2}}}}\\&{= \lim _ {r \rightarrow 0 +} \frac {r ^ {2} \cos \theta \sin \theta \sin \frac {1}{r}}{r} = 0.}\end{array}
\]
故 \(u(x,y)\) 在 \((0,0)\) 处可微.
作业
p.119
1 单、9
可微性的几何意义及应用
一元函数 \(y = f(x)\) 可微,在几何上反映为曲线存在不平行于 \(y\) 轴的切线。对于二元函数而言,可微性则反映为曲面与其切平面之间的类似关系。为此需要先给出切平面的定义,这可以从切线定义中获得启发.
在第五章 §1 中,我们曾把平面曲线 \(S\) 在其上某一点 \(P(x_0, y_0)\) 的切线 \(PT\) 定义为:过点 \(P\) 的割线 \(PQ\) 当 \(Q\) 沿 \(S\) 趋近 \(P\) 时的极限位置 (如果存在的话).
这时 \(PQ\) 与 \(PT\) 的夹角 \(\varphi\) 也将随 \(Q \to P\) 而趋于零 (参见图 17-2). 用 \(h\) 和 \(d\) 分别表示点 \(Q\) 到直线 \(PT\) 的距离和点 \(Q\) 到点 \(P\) 的距离,由于
\[
\sin \varphi = \frac {h}{d}
\]
因此当 \(Q\) 沿 \(S\) 趋于 \(P\) 时,\(\varphi \rightarrow 0\) 等同于 \(\frac{h}{d} \rightarrow 0\) . 仿照这个想法,我们引进曲面 \(S\) 在点 \(P\) 的切平面的定义 (参见图 17-3).

定义 (3)
设曲面 \(S\) 上一点 \(P, \Pi\) 为通过点 \(P\) 的一个平面,\(S\) 上的动点 \(Q\) 到定点 \(P\) 和到平面 \(\Pi\) 距离分别记为 \(d\) 和 \(h\) . 当 \(Q\) 在 \(S\) 上以任意方式趋近于 \(P\) 时,恒有
\[
\frac {h}{d} \rightarrow 0,
\]
则称 \(\Pi\) 为曲面 S 在点 P 的切平面,称 P 为切点.

定理 (17.4)
曲面 \(z = f(x,y)\) 在点 \(P(x_0,y_0,f(x_0,y_0))\) 存在不平行于 \(z\) 轴的切平面的充要条件是:函数 \(f\) 在点 \(P_0(x_0,y_0)\) 可微.
证(充分性)若函数 f 在 \(P_{0}\) 可微,由定义知道
\[
z - z _ {0} = f _ {x} (x _ {0}, y _ {0}) (x - x _ {0}) + f _ {y} (x _ {0}, y _ {0}) (y - y _ {0}) + o (\rho),
\]
其中 \(z_{0}=f\left(x_{0},y_{0}\right),\rho=\sqrt{\left(x-x_{0}\right)^{2}+\left(y-y_{0}\right)^{2}}.\)
现在讨论过点 \(P(x_{0}, y_{0}, f(x_{0}, y_{0}))\) 的平面 \(\Pi\) :
\[
Z - z _ {0} = f _ {x} \left(x _ {0}, y _ {0}\right) \left(X - x _ {0}\right) + f _ {y} \left(x _ {0}, y _ {0}\right) \left(Y - y _ {0}\right),
\]
其中 X, Y, Z 是平面上点的流动坐标.
下面证明它就是曲面 \(z = f(x, y)\) 在点 P 的切平面.
由于 \(S\) 上动点 \(Q(x,y,z)\) 到 \(\Pi\) 的距离为
\[
h = \frac {\left| z - z _ {0} - f _ {x} \left(x _ {0} , y _ {0}\right) \left(x - x _ {0}\right) - f _ {y} \left(x _ {0} , y _ {0}\right) \left(y - y _ {0}\right) \right|}{\sqrt {1 + f _ {x} ^ {2} \left(x _ {0} , y _ {0}\right) + f _ {y} ^ {2} \left(x _ {0} , y _ {0}\right)}}
\]
\[
= \frac {| o (\rho) |}{\sqrt {1 + f _ {x} ^ {2} (x _ {0} , y _ {0}) + f _ {y} ^ {2} (x _ {0} , y _ {0})}},
\]
P 到 Q 的距离为
\[
d = \sqrt {(x - x _ {0}) ^ {2} + (y - y _ {0}) ^ {2} + (z - z _ {0}) ^ {2}} \geq \rho ,
\]
\[
\frac {h}{d} < \frac {h}{\rho} = \frac {| o (\rho) |}{\rho} \cdot \frac {1}{\sqrt {1 + f _ {x} ^ {2} (x _ {0} , y _ {0}) + f _ {y} ^ {2} (x _ {0} , y _ {0})}} \rightarrow 0
\]
因此,由 \(\frac{h}{d} \geq 0\) , 及 \(\rho \rightarrow 0\) , 根据定义 3, 平面 II 即为曲面 \(z = f(x, y)\) 在点 \(P\) 的切平面.
*(必要性) 若曲面 \(z = f(x, y)\) 在点 \(P\) 存在不平行于 \(z\) 轴的切平面
\[
Z - z _ {0} = A (X - x _ {0}) + B (Y - y _ {0}).
\]
第一步 设 \(Q(x,y,z)\) 是曲面上任意一点,由 Q 到这个平面的距离为
\[
h = \frac {| z - z _ {0} - A (x - x _ {0}) - B (y - y _ {0}) |}{\sqrt {1 + A ^ {2} + B ^ {2}}}.
\]
令 \(\Delta x = x - x_0, \Delta y = y - y_0, \Delta z = z - z_0\) ,则 \(\rho = \sqrt{\Delta x^2 + \Delta y^2}, d = \sqrt{\Delta x^2 + \Delta y^2 + \Delta z^2}\) . 由切平面的定义知道,当 \(Q \to P\) 时,有 \(\frac{h}{d} \to 0\) .
因此对于充分接近的 P 与 Q,有
\[
{\frac {h}{d}} = {\frac {| \Delta z - A \Delta x - B \Delta y |}{d {\sqrt {1 + A ^ {2} + B ^ {2}}}}} < {\frac {1}{2 {\sqrt {1 + A ^ {2} + B ^ {2}}}}},
\]
由此得
\[
\left| \Delta z - A \Delta x - B \Delta y \right| < \frac {d}{2} = \frac {1}{2} \sqrt {\rho^ {2} + \Delta z ^ {2}}.
\]
第二步 分析:要证明 f 在点 \(P_{0}(x_{0}, y_{0})\) 可微,事实上就是需证
\[
\Delta z - (A \Delta x + B \Delta y) = o (\rho)
\]
\[
\text { ratio } \stackrel {{\text { def }}} {{=}} \frac {| \Delta z - (A \Delta x + B \Delta y) |}{\rho}
\]
由于 \(= \frac{|\Delta z - A\Delta x - B\Delta y|}{d\sqrt{1 + A^2 + B^2}}\times \left(\frac{d}{\rho}\right)\times \sqrt{1 + A^2 + B^2}\)
\[
= \left(\frac {h}{d}\right) \times \left(\frac {d}{\rho}\right) \times \sqrt {1 + A ^ {2} + B ^ {2}},
\]
因此,若能证得当 \(\rho\) 充分小时, \(d/\rho\) 为一有界量,则有 \(\lim_{\rho\to0}\) ratio = 0.
第三步 先证 \(\frac{|\Delta z|}{\rho}\) 是有界量:由 \(|a| - |b| \leq |a - b|\) 可推得 \(|\Delta z| - |A||\Delta x| - |B||\Delta y|\)
\[
< \frac {1}{2} \sqrt {\rho^ {2} + \Delta z ^ {2}} \leq \frac {1}{2} (\rho + | \Delta z |),
\]
故有
\[
\begin{array}{l} \frac {1}{2} | \Delta z | < | A | | \Delta x | + | B | | \Delta y | + \frac {1}{2} \rho , \\ \frac {| \Delta z |}{\rho} < 2 \left(| A | \frac {| \Delta x |}{\rho} + | B | \frac {| \Delta y |}{\rho}\right) + 1 \\ \leq 2 (| A | + | B |) + 1. \\ \end{array}
\]
第四步 再证 \(\frac{d}{\rho}\) 是有界量:由上式进一步可得
\[
\begin{array}{l} \frac {d}{\rho} = \frac {\sqrt {\rho^ {2} + \Delta z ^ {2}}}{\rho} = \sqrt {1 + \left(\frac {\Delta z}{\rho}\right) ^ {2}} \\ \leq 1 + \frac {\Delta z}{\rho} < 2 (| A | + | B | + 1). \\ \end{array}
\]
根据第二步的分析,这就证得在点 \(P_{0}(x_{0}, y_{0})\) 可微.
定理 17.4 说明:函数在点 \(P_{0}(x_{0},y_{0})\) 可微,则曲面 \(z=f(x,y)\) 在点 \(P(x_{0},y_{0},z_{0})\) 处的切平面方程为
\[
z - z _ {0} = f _ {x} \left(x _ {0}, y _ {0}\right) \left(x - x _ {0}\right) + f _ {y} \left(x _ {0}, y _ {0}\right) \left(y - y _ {0}\right). \tag {13}
\]
过切点 \(P\) 与切平面垂直的直线称为曲面在点 \(P\) 的法线。由切平面方程知道,法向量为
\[
\vec {n} = \pm \left(f _ {x} \left(x _ {0}, y _ {0}\right), f _ {y} \left(x _ {0}, y _ {0}\right), - 1\right),
\]
于是过切点 \(P\) 的法线方程为
\[
\frac {x - x _ {0}}{f _ {x} \left(x _ {0} , y _ {0}\right)} = \frac {y - y _ {0}}{f _ {y} \left(x _ {0} , y _ {0}\right)} = \frac {z - z _ {0}}{- 1}. \tag {14}
\]
二元函数全微分的几何意义:当自变量 \((x, y)\) 由 \((x_0, y_0)\) 变为 \((x_0 + \Delta x, y_0 + \Delta y)\) 时,函数 \(z = f(x, y)\) 的增量 \(\Delta z\) 是 \(z\) 轴方向上的一段 \(NQ\) ; 而在点 \((x_0, y_0)\) 的全微分
\[
\mathrm{d} z = f _ {x} \left(x _ {0}, y _ {0}\right) \Delta x + f _ {y} \left(x _ {0}, y _ {0}\right) \Delta y
\]
则是切平面 \(PM_{1}MM_{2}\) 上相应的那一段增量 NM.
于是,\(\Delta z\) 与 \(\mathrm{d}z\) 之差是 \(MQ\) 那一段,它的长度将随着 \(\rho \rightarrow 0\) 而趋于零,而且是较 \(\rho\) 高阶的无穷小量.

例 (6)
试求抛物面 \(z = ax^{2} + by^{2}\) 在点 \(P(x_0,y_0,z_0)\) 处的切平面方程与法线方程
解 \(f_{x}(x_{0},y_{0})=2ax_{0},f_{y}(x_{0},y_{0})=2by_{0}\) ,点 P 处的切平面方程为
\[
z - z _ {0} = 2 a x _ {0} (x - x _ {0}) + 2 b y _ {0} (y - y _ {0}).
\]
又因 \(z_0 = ax_0^2 + by_0^2\) ,所以它可化简为
\[
2 a x _ {0} x + 2 b y _ {0} y - z - z _ {0} = 0.
\]
由公式 (14), 在点 \(M\) 处的法线方程为
\[
{\frac {x - x _ {0}}{2 a x _ {0}}} = {\frac {y - y _ {0}}{2 b y _ {0}}} = {\frac {z - z _ {0}}{- 1}}.
\]
用近似公式 (3)
\[
f (x, y) \approx f \left(x _ {0}, y _ {0}\right) + f _ {x} \left(x _ {0}, y _ {0}\right) \Delta x + f _ {y} \left(x _ {0}, y _ {0}\right) \Delta y
\]
进行近似计算的例
例 (7)
求 \(1.08^{3.96}\) 的近似值.
解 设 \(f(x,y)=x^{y}\) ,并令 \(x_{0}=1, y_{0}=4, \Delta x=0.08, \Delta y=-0.04\) 。由公式 (3),有
\[
\begin{array}{l} 1. 0 8 ^ {3. 9 6} = f \left(x _ {0} + \Delta x, y _ {0} + \Delta y\right) \\ \approx f (1, 4) + f _ {x} (1, 4) \Delta x + f _ {y} (1, 4) \Delta y \\ = 1 + 4 \times 0. 0 8 + 1 ^ {4} \times \ln 1 \times (- 0. 0 4) = 1. 3 2. \\ \end{array}
\]
例 (8)
应用公式 \(S = \frac{1}{2} ab \sin C\) 计算某三角形的面积,现测得 a = 12.50, b = 8.30, \(C = 30^{\circ}\) . 若测量 a, b 的误差为 \(\pm 0.01\) , 测量 C 的误差为 \(\pm 0.1^{\circ}\) ,
试求用此公式计算三角形面积时的绝对误差限和相对误差限.
解 依题意,测量 a, b, C 的绝对误差限分别为
\[
| \Delta a | = 0. 0 1, | \Delta b | = 0. 0 1, | \Delta C | = 0. 1 ^ {\circ} = \frac {\pi}{1 8 0 0}.
\]
由于
\[
\begin{array}{l} {| \Delta S |} \approx {| \mathbf {d} S | = \left| \frac {\partial S}{\partial a} \Delta a + \frac {\partial S}{\partial b} \Delta b + \frac {\partial S}{\partial C} \Delta C \right|} \\ \leq \left| \frac {\partial S}{\partial \boldsymbol {a}} \right| | \Delta a | + \left| \frac {\partial S}{\partial b} \right| | \Delta b | + \left| \frac {\partial S}{\partial C} \right| | \Delta C | \\ { = } { \frac { 1 } { 2 } | b \sin C | \cdot | \Delta a | + \frac { 1 } { 2 } | a \sin C | \cdot | \Delta b | + \frac { 1 } { 2 } | a b \cos C | \cdot | \Delta C | , } \\ \end{array}
\]
因此将各数据代入上式,即得 S 的绝对误差限为
\[
| \Delta S | \approx 0. 1 3.
\]
又因
\[
S = \frac {1}{2} a b \sin C = \frac {1}{2} \times 1 2. 5 0 \times 8. 3 0 \times \frac {1}{2} \approx 2 5. 9 4,
\]
所以 S 的相对误差限为
\[
\left| \frac {\Delta S}{S} \right| \approx \frac {0 . 1 3}{25. 9 4} \approx 0. 5
\]
◎13. 计算近似值:
(1) \(1.002 \times 2.003^{2} \times 3.004^{3}\) ; (2) \(\sin 29^{\circ} \times \tan 46^{\circ}\) .
分析 利用近似公式 \(\Delta f \approx \mathrm{d}f\) ,为了使公式有效并易于计算,应适当选取 \(P_0\) ,使 \(P_0\) 处的函数值和偏导数值都易于计算。并要使 \(|\Delta x|\) 、 \(|\Delta y|\) 和 \(|\Delta z|\) 尽量小,一般要比 1 小得多,否则误差较大。
解 (1)选函数 \(f(x,y,z)=xy^{2}z^{3}, P_{0}(x_{0},y_{0},z_{0})=(1,2,3), \Delta x=0.002, \Delta y=0.003, \Delta z=0.004.\) 于是
\[
\begin{array}{l} f _ {x} (1, 2, 3) = y ^ {2} x ^ {3} \mid_ {(1, 2, 3)} = 1 0 8, f _ {y} (1, 2, 3) = 2 x y z ^ {3} \mid_ {(1, 2, 3)} = 1 0 8, f _ {z} (1, 2, 3) = 3 x y ^ {2} z ^ {2} \\ \mid_ {(1, 2, 3)} = 1 0 8 \end{array}
\]
故 \(f(1.002,2.003,3.004)\)
\[
\begin{array}{l} \approx f (1, 2, 3) + f _ {x} (1, 2, 3) \Delta x + f _ {y} (1, 2, 3) \Delta y + f _ {z} (1, 2, 3) \Delta z \\ = 1 0 8 + 1 0 8 \times 0. 0 0 2 + 1 0 8 \times 0. 0 0 3 + 1 0 8 \times 0. 0 0 4 \\ = 1 0 8. 9 7 2 \\ \end{array}
\]
即 \(1.002 \times 2.003^{2} \times 3.004^{3} = 108.972\)
总结
- 已知函数的连续性、偏导数的存在性、可微性和偏导数的连续性之间有如下关系:

试举出能分别满足如下要求的函数 \(f(x,y)\)
2. 可微性定义中,(1) 式与 (4) 式为何是等价的?
\[
\Delta z = f \left(x _ {0} + \Delta x, y _ {0} + \Delta y\right) - f \left(x _ {0}, y _ {0}\right) = A \Delta x + B \Delta y + o (\rho), \tag {1}
\]
\[
\Delta z = A \Delta x + B \Delta y + \alpha \Delta x + \beta \Delta y, \tag {4}
\]
作业
P. 120
10.
13
重要内容回顾
补充例题
例 1 讨论函数 \(f(x,y)=\left\{\begin{aligned}&\frac{1-\mathrm{e}^{x(x^{2}+y^{2})}}{x^{2}+y^{2}},&x^{2}+y^{2}\neq0,\\ &0,&x^{2}+y^{2}=0\end{aligned}\right.\)
在 \((0,0)\) 处的连续性和可微性.
解 (1) 连续性.
当 x=0 时,显然有 \(\lim_{(x,y)\to(0,0)}f(x,y)=0=f(0,0)\) ;
当 \(x \neq 0\) 时,有 \(\lim_{(x,y) \to (0,0)} f(x,y) = \lim_{(x,y) \to (0,0)} \frac{1 - e^{x(x^2 + y^2)}}{x^2 + y^2}\)
\[
= \lim _ {(x, y) \rightarrow (0, 0)} \frac {1 - \mathrm{e} ^ {x \left(x ^ {2} + y ^ {2}\right)}}{x \left(x ^ {2} + y ^ {2}\right)} \cdot x = 0
\]
\[
\mathrm{e} ^ {x} - 1 \sim x (x \rightarrow 0).
\]
(2) 可微性。容易计算出:
\[
f _ {x} (0, 0) = \lim _ {x \to 0} \frac {f (x , 0) - f (0 , 0)}{x} = \lim _ {x \to 0} \frac {1 - \mathrm{e} ^ {- x ^ {3}}}{x ^ {3}} = - 1;
\]
\[
f _ {y} (0, 0) = \lim _ {x \rightarrow 0} \frac {f (0 , y) - f (0 , 0)}{y} = \lim _ {x \rightarrow 0} \frac {0}{y} = 0.
\]
于是
\[
\begin{array}{l} \lim_{\substack{x\to 0\\ y\to 0}}\frac{f(x,y) - f(0,0) - f_{x}(0,0)x - f_{y}(0,0)y}{\sqrt{x^{2} + y^{2}}} \\ = \lim_{\substack{x\to 0\\ y\to 0}}\frac{\frac{1 - \mathrm{e}^{x(x^{2} + y^{2})}}{x^{2} + y^{2}} + x}{\sqrt{x^{2} + y^{2}}} = \lim_{\substack{x\to 0\\ y\to 0}}\frac{1 + x(x^{2} + y^{2}) - \mathrm{e}^{x(x^{2} + y^{2})}}{(x^{2} + y^{2})^{\frac{3}{2}}} \\ \end{array}
\]
\[
= \lim_{\substack{x\to 0\\ y\to 0}}\frac{1 + x(x^{2} + y^{2}) - (1 + x(x^{2} + y^{2}) + \frac{1}{2}x^{2}(x^{2} + y^{2})^{2}\mathrm{e}^{\theta x(x^{2} + y^{2})})}{(x^{2} + y^{2})^{\frac{1}{2}}} \\ \theta \in (0,1),
\]
\[
= \lim_{\substack{x\to 0\\ y\to 0}}\frac{-\frac{1}{2}x^{2}(x^{2} + y^{2})^{2}\mathrm{e}^{\theta x(x^{2} + y^{2})}}{(x^{2} + y^{2})^{\frac{3}{2}}}
\]
\[
= \lim_ {\substack {x \rightarrow 0\\y \rightarrow 0}} - \frac {1}{2} x ^ {2} \left(x ^ {2} + y ^ {2}\right) ^ {\frac {1}{2}} \mathrm{e} ^ {\theta x \left(x ^ {2} + y ^ {2}\right)}
\]
\[
\mathrm{e} ^ {x} = 1 + x + \frac {\mathrm{e} ^ {\theta_ {X}}}{2} x ^ {2}
\]
\[
\lim_{\substack{x\to 0\\ y\to 0}}\frac{1 + x(x^{2} + y^{2}) - \mathrm{e}^{x(x^{2} + y^{2})}}{(x^{2} + y^{2})^{\frac{3}{2}}}
\]
例 2 求函数 \(u = \sqrt{x^2 + y^2 + z^2}\) 在 (1, -1, 1) 处的全微分。解 先求 (1, -1, 1) 处的偏导数:
\[
\begin{array}{l} \left. \frac {\partial u}{\partial x} \right| _ {(1, - 1, 1)} = \left. \frac {x}{\sqrt {x ^ {2} + y ^ {2} + z ^ {2}}} \right| _ {(1, - 1, 1)} = \frac {1}{\sqrt {3}}; \\ \left. \frac {\partial u}{\partial y} \right| _ {(1, - 1, 1)} = \left. \frac {y}{\sqrt {x ^ {2} + y ^ {2} + z ^ {2}}} \right| _ {(1, - 1, 1)} = - \frac {1}{\sqrt {3}}; \\ \left. \frac {\partial u}{\partial z} \right| _ {(1, - 1, 1)} = \frac {z}{\sqrt {x ^ {2} + y ^ {2} + z ^ {2}}} \Bigg | _ {(1, - 1, 1)} = \frac {1}{\sqrt {3}}. \\ \end{array}
\]
例 3 求曲面 \(z = x^{2} + y^{2} - 2xy - x + 3y + 4\) 在 \((2, -3, 18)\) 处的切平面方程和法线方程.
解 因为 \(\left.\frac{\partial z}{\partial x}\right|_{(2,-3,18)}=2x-2y-1\big|_{(2,-3,18)}=9;\)
\[
\left. \frac {\partial z}{\partial y} \right| _ {(2, - 3, 1 8)} = 2 y - 2 x + 3 | _ {(2, - 3, 1 8)} = - 7.
\]
所以,曲面在 (2,-3,18) 处的法向量为 \(n=(9,-7,-1)\) .
由此可得曲面在 (2,-3,18) 处切平面方程和法线方程:
\[
9 (x - 2) - 7 (y + 3) - (z - 1 8) = 0, \quad 9 x - 7 y - z = 2 7;
\]
\[
\frac {x - 2}{9} = \frac {y + 3}{- 7} = \frac {z - 1 8}{- 1}.
\]
例 4 设 \(f_{x}(x,y)\) 存在,\(f_{y}(x,y)\) 在点 \((x_0,y_0)\) 连续,证明 \(f(x,y)\) 在点 \((x_0,y_0)\) 可微.
解 欲证
\[
\lim _ {\Delta x \rightarrow 0 \atop \Delta y \rightarrow 0} \frac {f (x _ {0} + \Delta x , y _ {0} + \Delta y) - f (x _ {0} , y _ {0}) - f _ {x} (x _ {0} , y _ {0}) \Delta x - f _ {y} (x _ {0} , y _ {0}) \Delta y}{\sqrt {\Delta x ^ {2} + \Delta y ^ {2}}} = 0
\]
\[
\begin{array}{l} \because f \left(x _ {0} + \Delta x, y _ {0} + \Delta y\right) - f \left(x _ {0}, y _ {0}\right) - f _ {x} \left(x _ {0}, y _ {0}\right) \Delta x - f _ {y} \left(x _ {0}, y _ {0}\right) \Delta y \\ = [ f (x _ {0} + \Delta x, y _ {0} + \Delta y) - f (x _ {0} + \Delta x, y _ {0}) ] + f (x _ {0} + \Delta x, y _ {0}) \\ - f \left(x _ {0}, y _ {0}\right) - f _ {x} \left(x _ {0}, y _ {0}\right) \Delta x - f _ {y} \left(x _ {0}, y _ {0}\right) \Delta y \\ = f _ {y} \left(x _ {0} + \Delta x, y _ {0} + \theta \Delta y\right) \Delta y - f _ {y} \left(x _ {0}, y _ {0}\right) \Delta y \\ + f \left(x _ {0} + \Delta x, y _ {0}\right) - f \left(x _ {0}, y _ {0}\right) - f _ {x} \left(x _ {0}, y _ {0}\right) \Delta x, \quad 0 < \theta < 1, \\ \end{array}
\]
于是
\[
\lim_{\substack{\Delta x\to 0\\ \Delta y\to 0}}\frac{f(x_{0} + \Delta x,y_{0} + \Delta y) - f(x_{0},y_{0}) - f_{x}(x_{0},y_{0})\Delta x - f_{y}(x_{0},y_{0})\Delta y}{\sqrt{\Delta x^{2} + \Delta y^{2}}}
\]
\[
= \lim_ {\substack {\Delta x \rightarrow 0\\\Delta y \rightarrow 0}} \frac {\Delta y}{\sqrt {\Delta x ^ {2} + \Delta y ^ {2}}} \frac {[ f _ {y} (x _ {0} + \Delta x , y _ {0} + \theta \Delta y) - f _ {y} (x _ {0}, y _ {0}) ]}{\text{极限为} 0} (0 < \theta < 1)
\]
\[
+ \lim_{\substack{\Delta x\to 0\\ \Delta y\to 0}}\frac{\Delta x}{\sqrt{\Delta x^{2} + \Delta y^{2}}} \left[ \frac{f(x_{0} + \Delta x,y_{0}) - f(x_{0},y_{0})}{\Delta x} -f_{x}(x_{0},y_{0}) \right] \text{有界} \text{极限为} f_{x}(x_{0},y_{0})
\]
\[
= 0.
\]
数学分析
2025-2026 (2)
Ch. 17b
沈超敏
计算机科学与技术学院
第十七章 多元函数微分学
§2 复合函数微分法
复合函数的求导法则
设函数
\[
x = \varphi (s, t) \text {与} y = \psi (s, t) \tag {1}
\]
定义在 st 平面的区域 D 上,
函数
\[
z = f (x, y) \tag {2}
\]
定义在 \(xy\) 平面的区域 \(\bar{D}\) 上。若
\[
\{(x, y) \mid x = \varphi (s, t), y = \psi (s, t), (s, t) \in D \} \subset \bar {D},
\]
则可构成复合函数:
\[
z = F (s, t) = f (\varphi (s, t), \psi (s, t)), \quad (s, t) \in D. \tag {3}
\]
其中 (1) 为内函数,(2) 为外函数,\((x, y)\) 为中间变量,\((s, t)\) 为自变量.
下面将讨论复合函数 F 的可微性.
定理 (17.5)
若函数 \(x = \varphi(s, t)\) , \(y = \psi(s, t)\) 在点 \((s, t) \in D\) 可微,\(z = f(x, y)\) 在点 \((x, y) = (\varphi(s, t), \psi(s, t))\) 可微,则复合函数 \(z = f(\varphi(s, t), \psi(s, t))\) 在点 \((s, t)\) 可微,且关于 \(s\) 与 \(t\) 的偏导数分别为
\[
\begin{array}{l} \left. \frac {\partial z}{\partial s} \right| _ {(s, t)} = \left. \frac {\partial z}{\partial x} \right| _ {(x, y)} \cdot \left. \frac {\partial x}{\partial s} \right| _ {(s, t)} + \left. \frac {\partial z}{\partial y} \right| _ {(x, y)} \cdot \left. \frac {\partial y}{\partial s} \right| _ {(s, t)}, \\ \left. \frac {\partial z}{\partial t} \right| _ {(s, t)} = \left. \frac {\partial z}{\partial x} \right| _ {(x, y)} \cdot \left. \frac {\partial x}{\partial t} \right| _ {(s, t)} + \left. \frac {\partial z}{\partial y} \right| _ {(x, y)} \cdot \left. \frac {\partial y}{\partial t} \right| _ {(s, t)}. \\ \end{array}
\]
公式(4)也称为链式法则.
另一视角: Chain rule 看成向量的内积
\[
\begin{array}{rl}&{\frac{d}{dt}\underbrace{f(x(t),y(t))}=\frac{g_{f}}{g_{x}}\cdot\frac{dx}{dt}+\frac{g_{f}}{g_{y}}\cdot\frac{dy}{dt}}\\&{\vec{v}(t)=\begin{bmatrix}x(t)\\y(t)\end{bmatrix}}\\&{\frac{d\vec{v}}{dt}=\begin{bmatrix}dx/dt\\dy/dt\end{bmatrix}}\\&{\quad=\boxed{\nabla f\cdot\vec{v}^{\prime}(t)}\quad}\\&{\quad=\boxed{\nabla f(\vec{v}(t))\cdot\vec{v}^{\prime}(t)}}\end{array}
\]
证 由假设 \(x = \varphi(s, t), y = \psi(s, t)\) 在点 \((s, t)\) 可微,于是
\[
\Delta x = \frac {\partial x}{\partial s} \Delta s + \frac {\partial x}{\partial t} \Delta t + \alpha_ {1} \Delta s + \beta_ {1} \Delta t, \tag {5}
\]
\[
\Delta y = \frac {\partial y}{\partial s} \Delta s + \frac {\partial y}{\partial t} \Delta t + \alpha_ {2} \Delta s + \beta_ {2} \Delta t, \tag {6}
\]
其中 \((\Delta s, \Delta t) \rightarrow (0, 0)\) 时 \((\alpha_{1}, \beta_{1}, \alpha_{2}, \beta_{2}) \rightarrow (0, 0, 0, 0)\) .
又由 \(z = f(x, y)\) 在点 \((x, y)\) 可微,故有
\[
\boxed {\Delta z = \frac {\partial z}{\partial x} \Delta x + \frac {\partial z}{\partial y} \Delta y + \alpha \Delta x + \beta \Delta y,} \tag {7}
\]
其中 \((\Delta x, \Delta y) \rightarrow (0, 0)\) 时, \((\alpha, \beta) \rightarrow (0, 0)\) ,并可补充定义:当 \(\Delta x = \Delta y = 0\) 时, \(\alpha = \beta = 0\) .
现把 (5), (6) 两式代入 (7) 式,得到
\[
\begin{array}{l} \Delta z = \left(\frac {\partial z}{\partial x} + \alpha\right) \Delta x + \left(\frac {\partial z}{\partial y} + \beta\right) \Delta y \\ = \left(\frac {\partial z}{\partial x} + \alpha\right) \cdot \left(\frac {\partial x}{\partial s} \Delta s + \frac {\partial x}{\partial t} \Delta t + \alpha_ {1} \Delta s + \beta_ {1} \Delta t\right) \\ + \left(\frac {\partial z}{\partial y} + \beta\right) \cdot \left(\frac {\partial y}{\partial s} \Delta s + \frac {\partial y}{\partial t} \Delta t + \alpha_ {2} \Delta s + \beta_ {2} \Delta t\right). \\ \end{array}
\]
整理后又得
\[
\Delta z = \left(\frac {\partial z}{\partial x} \cdot \frac {\partial x}{\partial s} + \frac {\partial z}{\partial y} \cdot \frac {\partial y}{\partial s}\right) \Delta s + \left(\frac {\partial z}{\partial x} \cdot \frac {\partial x}{\partial t} + \frac {\partial z}{\partial y} \cdot \frac {\partial y}{\partial t}\right) \Delta t + \bar {\alpha} \Delta s + \bar {\beta} \Delta t, \tag {8}
\]
其中
\[
\bar {\alpha} = \frac {\partial z}{\partial x} \alpha_ {1} + \frac {\partial z}{\partial y} \alpha_ {2} + \frac {\partial x}{\partial s} \alpha + \frac {\partial y}{\partial s} \beta + \alpha \alpha_ {1} + \beta \alpha_ {2}, \tag {9}
\]
\[
\bar {\beta} = \frac {\partial z}{\partial x} \beta_ {1} + \frac {\partial z}{\partial y} \beta_ {2} + \frac {\partial x}{\partial t} \alpha + \frac {\partial y}{\partial t} \beta + \alpha \beta_ {1} + \beta \beta_ {2} \tag {10}
\]
由于 \(\varphi(s, t), \psi(s, t)\) 在点 \((s, t)\) 可微,因此在点 \((s, t)\) 都连续,即当 \((\Delta s, \Delta t) \to (0, 0)\) 时, \((\Delta x, \Delta y) \to (0, 0)\) . 从而 \((\alpha, \beta) \to (0, 0)\) ,以及 \((\alpha_1, \beta_1, \alpha_2, \beta_2) \to (0, 0, 0, 0)\) .
于是在 (9), (10) 两式中,当 \((\Delta s, \Delta t) \to (0, 0)\) 时,有 \((\bar{\alpha}, \bar{\beta}) \to (0, 0)\) .
故由 (8) 式推知复合函数 (3) 可微,并求得 \(z\) 关于 \(s\) 和 \(t\) 的偏导数公式 (4).
注 如果只是求复合函数 \(f(\varphi(s, t), \psi(s, t))\) 关于 \(s\) 或 \(t\) 的偏导数,则上述定理中内函数 \(x = \varphi(s, t), y = \psi(s, t)\) 只须具有关于 \(s\) 或 \(t\) 的偏导数就够了.
因为以 \(\Delta s\) 或 \(\Delta t\) 除以
\[
\Delta z = \frac {\partial z}{\partial x} \Delta x + \frac {\partial z}{\partial y} \Delta y + \alpha \Delta x + \beta \Delta y, \tag {7}
\]
两边,然后让 \(\Delta s \to 0\) 或 \(\Delta t \to 0\) , 也能得到相应的结果.
但是外函数 f 的可微性假设是不能轻易省略的,
否则上述复合求导公式就不一定成立。例如
例 (反例)
\[
f (x, y) = \left\{ \begin{array}{c l} \frac {x ^ {2} y}{x ^ {2} + y ^ {2}}, & x ^ {2} + y ^ {2} \neq 0 \\ 0, & x ^ {2} + y ^ {2} = 0 \end{array} \right.
\]
由 §1 习题 6 已知 \(f_{x}(0,0)=f_{y}(0,0)=0\) ,但 \(f(x,y)\) 在点 \((0,0)\) 不可微.
若以 \(f(x,y)\) 为外函数,x=t, y=t 为内函数,则得到以 t 为自变量的复合函数
\[
z = F (t) = f (t, t) = t / 2, \quad \therefore \frac {\mathrm{d} z}{\mathrm{d} t} \equiv \frac {1}{2}.
\]
若形式地使用法则 (4), 将得出错误结论:
\[
\begin{array}{l} \left. \frac {\mathrm{d} z}{\mathrm{d} t} \right| _ {t = 0} = \left. \frac {\partial z}{\partial x} \right| _ {(0, 0)} \cdot \left. \frac {\mathrm{d} x}{\mathrm{d} t} \right| _ {t = 0} + \left. \frac {\partial z}{\partial y} \right| _ {(0, 0)} \cdot \left. \frac {\mathrm{d} y}{\mathrm{d} t} \right| _ {t = 0} \\ = 0 \times 1 + 0 \times 1 = 0. \\ \end{array}
\]
这说明:在使用链式法则时,必须注意外函数可微这个条件.
一般地,若 \(f(u_{1},\cdots,u_{m})\) 在点 \((u_{1},\cdots,u_{m})\) 可微,函数组
\[
u _ {k} = g _ {k} \left(x _ {1}, \dots , x _ {n}\right) (k = 1, 2, \dots , m)
\]
在点 \((x_{1},\cdots,x_{n})\) 具有对于 \(x_{i}(i=1,2,\cdots,n)\) 的偏导数,则复合函数
\[
f \left(g _ {1} \left(x _ {1}, \dots , x _ {n}\right), g _ {2} \left(x _ {1}, \dots , x _ {n}\right), \dots , g _ {m} \left(x _ {1}, \dots , x _ {n}\right)\right)
\]
关于自变量 \(x_{i}(i=1,2,\cdots,n)\) 的偏导数为
\[
\boxed {\frac {\partial f}{\partial x _ {i}} = \sum_ {k = 1} ^ {m} \frac {\partial f}{\partial u _ {k}} \cdot \frac {\partial u _ {k}}{\partial x _ {i}} (i = 1, 2, \dots , n)}
\]
多元函数的复合求导一般比较复杂,必须要区分哪些是中间变量,哪些是自变量,这样才能正确使用链式法则求出正确的结果.
为了便于记忆,可以按照各变量间的复合关系,画如图那样的树形图。首先从因变量 \(z\) 向中间变量 \(x, y\) 画两个分枝,然后再分别从中间变量 \(x, y\) 向自变量 \(s, t\) 画分枝,并在每个分枝上写上对应的偏导数.

例如,求 \(\frac{\partial z}{\partial s}\) 时,只要把从 \(z\) 到 \(s\) 的每条路径上的各个偏导数相乘,然后再将这些乘积相加即得
\[
{\frac {\partial z}{\partial s}} = {\frac {\partial z}{\partial x}} {\frac {\partial x}{\partial s}} + {\frac {\partial z}{\partial y}} {\frac {\partial y}{\partial s}}
\]
类似地,考察从 z 到 t 的路径,得
\[
{\frac {\partial z}{\partial t}} = {\frac {\partial z}{\partial x}} {\frac {\partial x}{\partial t}} + {\frac {\partial z}{\partial y}} {\frac {\partial y}{\partial t}}
\]

例 (1)
设 \(z = \ln (u^2 + v)\) ,而 \(u = \mathrm{e}^{x + y^2}, v = x^2 + y\) ,试求 \(\frac{\partial z}{\partial x}\) 与 \(\frac{\partial z}{\partial y}\) .
解 所讨论的复合函数以 \((u, v)\) 为中间变量,\((x, y)\) 为自变量,并满足定理 17.5 的条件。因此由
\[
\frac {\partial z}{\partial u} = \frac {2 u}{u ^ {2} + v}, \quad \frac {\partial z}{\partial v} = \frac {1}{u ^ {2} + v},
\]
\[
\frac {\partial u}{\partial x} = \mathrm{e} ^ {x + y ^ {2}}, \quad \frac {\partial u}{\partial y} = 2 y \mathrm{e} ^ {x + y ^ {2}}, \quad \frac {\partial v}{\partial x} = 2 x, \quad \frac {\partial v}{\partial y} = 1,
\]
根据链式法则得到
\[
\begin{array}{l} \frac {\partial z}{\partial x} = \frac {\partial z}{\partial u} \cdot \frac {\partial u}{\partial x} + \frac {\partial z}{\partial v} \cdot \frac {\partial v}{\partial x} \\ = \frac {2 u}{u ^ {2} + v} \cdot \mathrm{e} ^ {x + y ^ {2}} + \frac {1}{u ^ {2} + v} \cdot 2 x \\ = \frac {2}{u ^ {2} + v} \cdot \left(u \mathrm{e} ^ {x + y ^ {2}} + x\right), \\ \frac {\partial z}{\partial y} = \frac {\partial z}{\partial u} \cdot \frac {\partial u}{\partial y} + \frac {\partial z}{\partial v} \cdot \frac {\partial v}{\partial y} \\ = \frac {1}{u ^ {2} + v} \cdot \left(4 u y \mathrm{e} ^ {x + y ^ {2}} + 1\right). \\ \end{array}
\]
例 (2)
设 \(u = u(x,y)\) 可微,在极坐标变换 \(x = r\cos \theta ,\pmb {y} = r\sin \theta\) 之下,证明:
\[
\left(\frac {\partial u}{\partial r}\right) ^ {2} + \frac {1}{r ^ {2}} \left(\frac {\partial u}{\partial \theta}\right) ^ {2} = \left(\frac {\partial u}{\partial x}\right) ^ {2} + \left(\frac {\partial u}{\partial y}\right) ^ {2}.
\]
证 把 u 看作 \(r, \theta\) 的复合函数 \(u = u(r \cos \theta, r \sin \theta)\) ,因此有
\[
\begin{array}{l} \frac {\partial u}{\partial r} = \frac {\partial u}{\partial x} \cdot \frac {\partial x}{\partial r} + \frac {\partial u}{\partial y} \cdot \frac {\partial y}{\partial r} = \frac {\partial u}{\partial x} \cos \theta + \frac {\partial u}{\partial y} \sin \theta , \\ {\frac {\partial u}{\partial \theta}} = {\frac {\partial u}{\partial x}} \cdot {\frac {\partial x}{\partial \theta}} + {\frac {\partial u}{\partial y}} \cdot {\frac {\partial y}{\partial \theta}} = {\frac {\partial u}{\partial x}} (- r \sin \theta) + {\frac {\partial u}{\partial y}} r \cos \theta . \\ \end{array}
\]
于是
\[
\begin{array}{l} \left(\frac {\partial u}{\partial r}\right) ^ {2} + \frac {1}{r ^ {2}} \left(\frac {\partial u}{\partial \theta}\right) ^ {2} = \left(\frac {\partial u}{\partial x} \cos \theta + \frac {\partial u}{\partial y} \sin \theta\right) ^ {2} \\ + \frac {1}{r ^ {2}} \left(- \frac {\partial u}{\partial x} r \sin \theta + \frac {\partial u}{\partial y} r \cos \theta\right) ^ {2} \\ = \left(\frac {\partial u}{\partial x}\right) ^ {2} + \left(\frac {\partial u}{\partial y}\right) ^ {2} \\ \end{array}
\]
例 (3)
设 \(z = uv + \sin t\) ,其中 \(u = \mathrm{e}^t, v = \cos t,\) 求 \(\frac{\mathrm{d}z}{\mathrm{d}t}\)
解 复合后仅是自变量 \(t\) 的一元函数。于是
\[
\begin{array}{l} \frac {\mathrm{d} z}{\mathrm{d} t} = \frac {\partial z}{\partial u} \frac {\mathrm{d} u}{\mathrm{d} t} + \frac {\partial z}{\partial v} \frac {\mathrm{d} v}{\mathrm{d} t} + \frac {\partial z}{\partial t} \frac {\mathrm{d} t}{\mathrm{d} t} \\ = v \mathrm{e} ^ {t} + u (- \sin t) + \cos t \\ = \mathrm{e} ^ {t} (\cos t - \sin t) + \cos t. \\ \end{array}
\]
注 上面第一个等式中,左边的 \(\frac{dz}{dt}\) 是作为一元函数的复合函数对 t 求导数(又称 “全导数”);右边的 \(\frac{\partial z}{\partial t}\) 是外函数(是 u, v, t 的三元函数)对 t 求偏导.

例 (4)
用多元复合微分法计算下列一元函数的导数:
解 (1) 令 \(y = u^v, v = w^x, u = x, w = x\) ,从而有
\[
\begin{array}{l} \frac {\mathrm{d} y}{\mathrm{d} x} = \frac {\partial y}{\partial u} \cdot \frac {\mathrm{d} u}{\mathrm{d} x} + \frac {\partial y}{\partial v} \left[ \frac {\partial v}{\partial w} \cdot \frac {\mathrm{d} w}{\mathrm{d} x} + \frac {\partial v}{\partial x} \right] \\ = v u ^ {v - 1} + u ^ {v} \ln u [ x w ^ {x - 1} + w ^ {x} \ln w ] \\ = x ^ {x} \cdot x ^ {x ^ {x} - 1} + x ^ {x ^ {x}} \ln x [ x \cdot x ^ {x - 1} + x ^ {x} \ln x ] \\ = x ^ {x} \cdot x ^ {x ^ {x}} \left[ \frac {1}{x} + \ln x + (\ln x) ^ {2} \right] \\ \end{array}
\]
\[
y = \frac {(1 + x ^ {2}) \ln x}{\sin x + \cos x}. \tag {2}
\]
解 令 \(y = \frac{vw}{u}, u = \sin x + \cos x, v = 1 + x^2, w = \ln x\) ,则有
\[
\begin{array}{l} \frac {\mathrm{d} y}{\mathrm{d} x} = \frac {\partial y}{\partial u} \cdot \frac {\mathrm{d} u}{\mathrm{d} x} + \frac {\partial y}{\partial v} \cdot \frac {\mathrm{d} v}{\mathrm{d} x} + \frac {\partial y}{\partial w} \cdot \frac {\mathrm{d} w}{\mathrm{d} x} \\ = \frac {- v w}{u ^ {2}} (\cos x - \sin x) + \frac {w}{u} \cdot 2 x + \frac {v}{u} \cdot \frac {1}{x} \\ = \frac {1}{(\sin x + \cos x) ^ {2}} \left[ (\sin x - \cos x) (1 + x ^ {2}) \ln x \right. \\ \left. + (\sin x + \cos x) \left(2 x \ln x + \frac {1 + x ^ {2}}{x}\right) \right]. \\ \end{array}
\]
由此可见,以前用 “对数求导法” 求一元函数导数的问题,如今可用多元复合函数的链式法则来计算.
例 (5*)
设 \(f(x,y)\) 为可微函数,\(f(1,1)=1\) , \(f_{x}(1,1)=a\) , \(f_{y}(1,1)=b\) , \(\varphi(x)=f(x,f(x,f(x,x)))\) , 试求 \(\varphi'(1)\) .
解 令 \(\varphi(x) = f(x, y)\) , \(y = f(x, z)\) , \(z = f(x, u)\) , \(u = x\) , 则有
\[
\begin{array}{l} \varphi^ {\prime} (x) = f _ {x} + f _ {y} \cdot \frac {\mathrm{d} y}{\mathrm{d} x} = f _ {x} + f _ {y} \cdot \left(f _ {x} + f _ {z} \cdot \frac {\mathrm{d} z}{\mathrm{d} x}\right) \\ = f _ {x} + f _ {y} \cdot \left[ f _ {x} + f _ {z} \cdot \left(f _ {x} + f _ {u} \cdot \frac {\mathrm{d} u}{\mathrm{d} x}\right) \right]. \\ \end{array}
\]
由 \(\frac{\mathrm{d}u}{\mathrm{d}x} = 1, f_x(1,1) = a, f_y(1,1) = f_z(1,1) = f_u(1,1) = b,\)
因此 \(\varphi'(1)=a+b[a+b(a+b)]=a+ab+ab^{2}+b^{3}.\)
例 (6)
设 \(u = f(x,y,z),y = \varphi (x,t),t = \psi (x,z)\) 都有一阶连续偏导数,求 \(\frac{\partial u}{\partial x},\frac{\partial u}{\partial z}.\)
解 代入中间变量,得复合函数 \(u = f(x, \varphi(x, \psi(x, z)), z)\) 为 x, z 的函数(见图),得到:
\[
\frac {\partial u}{\partial x} = \frac {\partial f}{\partial x} + \frac {\partial f}{\partial y} \frac {\partial \varphi}{\partial x} + \frac {\partial f}{\partial y} \frac {\partial \varphi}{\partial t} \frac {\partial \psi}{\partial x}
\]
\[
\frac {\partial u}{\partial z} = \frac {\partial f}{\partial y} \frac {\partial \varphi}{\partial t} \frac {\partial \psi}{\partial z}
\]
注意,这里用 \(\frac{\partial u}{\partial x}\) 表示复合函数 \(u = f(x, \varphi(x, \psi(x, z)), z)\) 对 x 的偏导数,而用 \(\frac{\partial f}{\partial x}\) 表示函数 \(u = f(x, y, z)\) 对第一个变量 x 的偏导数,以区别两者的不同。

例 (7)
设在 \(\mathbb{R}^2\) 上的可微函数 \(f\) 满足方程
\[
y f _ {x} (x, y) = x f _ {y} (x, y).
\]
证明:在极坐标系里 f 只是 r 的函数。
证 本题是要证明:经极坐标系变换后,f 满足
\[
\frac {\partial f}{\partial \theta} = 0.
\]
为此设 \(u = f(x, y)\) , \(x = r \cos \theta\) , \(y = r \sin \theta\) ,则得
\[
\begin{array}{l} \frac {\partial u}{\partial \theta} = \frac {\partial u}{\partial x} \frac {\partial x}{\partial \theta} + \frac {\partial u}{\partial y} \frac {\partial y}{\partial \theta} \\ = \frac {\partial u}{\partial x} (- r \sin \theta) + \frac {\partial u}{\partial y} (r \cos \theta) \\ = - y \frac {\partial u}{\partial x} + x \frac {\partial u}{\partial y} \\ = - y \frac {\partial f}{\partial x} + x \frac {\partial f}{\partial y} = 0. \\ \end{array}
\]
从而 f 在 \(R^{2}\) 上的极坐标系里与 \(\theta\) 无关,于是 f 只是 r 的函数。
复合函数的全微分
若以 x, y 为自变量的函数 \(z = f(x, y)\) 可微,其全微分为
\[
\mathrm{d} z = \frac {\partial z}{\partial x} \mathrm{d} x + \frac {\partial z}{\partial y} \mathrm{d} y. \tag {11}
\]
如果 x, y 作为中间变量,又是自变量 s, t 的可微函数
\[
x = \varphi (s, t), \quad y = \psi (s, t),
\]
则由定理 17.5 知道,复合函数 \(f(\varphi(s,t),\psi(s,t))\) 是可微的,其全微分为
\[
\begin{array}{l} \mathrm{d} z = \frac {\partial z}{\partial s} \mathrm{d} s + \frac {\partial z}{\partial t} \mathrm{d} t \\ = \left(\frac {\partial z}{\partial x} \frac {\partial x}{\partial s} + \frac {\partial z}{\partial y} \frac {\partial y}{\partial s}\right) \mathrm{d} s + \left(\frac {\partial z}{\partial x} \frac {\partial x}{\partial t} + \frac {\partial z}{\partial y} \frac {\partial y}{\partial t}\right) \mathrm{d} t \tag {12} \\ = \frac {\partial z}{\partial x} (\frac {\partial x}{\partial s} \mathrm{d} s + \frac {\partial x}{\partial t} \mathrm{d} t) + \frac {\partial z}{\partial y} (\frac {\partial y}{\partial s} \mathrm{d} s + \frac {\partial y}{\partial t} \mathrm{d} t). \\ \end{array}
\]
由于 \(x, y\) 又是 \((s, t)\) 的可微函数,因此同时有
\[
\mathrm{d} x = \frac {\partial x}{\partial s} \mathrm{d} s + \frac {\partial x}{\partial t} \mathrm{d} t, \quad \mathrm{d} y = \frac {\partial y}{\partial s} \mathrm{d} s + \frac {\partial y}{\partial t} \mathrm{d} t \tag {13}
\]
将 (13) 式代入 (12) 式,得到与 (11) 式完全相同的结果,这就是多元函数的一阶(全)微分形式不变性.
\[
\mathrm{d} z = \frac {\partial z}{\partial x} \mathrm{d} x + \frac {\partial z}{\partial y} \mathrm{d} y. \tag {11}
\]
\[
\mathrm{d} x = \frac {\partial x}{\partial s} \mathrm{d} s + \frac {\partial x}{\partial t} \mathrm{d} t, \quad \mathrm{d} y = \frac {\partial y}{\partial s} \mathrm{d} s + \frac {\partial y}{\partial t} \mathrm{d} t \tag {13}
\]
必须指出,在(11)式中
例 (8)
设 \(z = e^{xy}\sin (x + y)\) ,利用微分形式不变性计算 \(\mathrm{dz}\) ,并由此导出 \(\frac{\partial z}{\partial x}\) 与 \(\frac{\partial z}{\partial y}\)
解 令 \(z = e^{u}\sin v, u = xy, v = x + y.\) 由于
\[
\mathrm{d} z = z _ {u} \mathrm{d} u + z _ {v} \mathrm{d} v = e ^ {u} \sin v \mathrm{d} u + e ^ {u} \cos v \mathrm{d} v,
\]
\[
\mathrm{d} u = y \mathrm{d} x + x \mathrm{d} y, \quad \mathrm{d} v = \mathrm{d} x + \mathrm{d} y,
\]
因此
\[
\begin{array}{l} \mathrm{d} z = e ^ {u} \sin v (y \mathrm{d} x + x \mathrm{d} y) + e ^ {u} \cos v (\mathrm{d} x + \mathrm{d} y) \\ = e ^ {x y} [ y \sin (x + y) + \cos (x + y) ] d x + e ^ {x y} [ x \sin (x + y) + \cos (x + y) ] d y, \\ \end{array}
\]
并由此得到
\[
\frac {\partial z}{\partial x} = e ^ {x y} \left[ y \sin (x + y) + \cos (x + y) \right],
\]
\[
\frac {\partial z}{\partial y} = e ^ {x y} \left[ x \sin (x + y) + \cos (x + y) \right],
\]
\[
\mathrm{d} z = e ^ {x y} \left[ y \sin (x + y) + \cos (x + y) \right] \mathrm{d} x + e ^ {x y} \left[ x \sin (x + y) + \cos (x + y) \right] \mathrm{d} y,
\]
复习思考题
- 在一元函数章节里,利用对数求导法曾得到过一个结果:
\[
(x ^ {x}) ^ {\prime} = x ^ {x} (1 + \ln x) = x x ^ {x - 1} + x ^ {x} \ln x.
\]
不难看出等式右边两项恰好是把 \(x^{x}\) 分别看成幂函数与指数函数求导数而得到的。
有人认为这是偶然的巧合,
也有人认为这是必然的结果。
试问哪一种看法是正确的?请说出依据。
复习思考题
- 设由可微的 \(u = u(x, y, t), x = x(s, t), y = y(s, t)\) 得到 \(u = u(x(s, t), y(s, t), t)\) ,它是一个以 \(s, t\) 为自变量的复合函数。考察下面计算复合函数偏导数的一种写法:
\[
{\frac {\partial u}{\partial t}} = {\frac {\partial u}{\partial x}} {\frac {\partial x}{\partial t}} + {\frac {\partial u}{\partial y}} {\frac {\partial y}{\partial t}} + {\frac {\partial u}{\partial t}},
\]
试问这个写法有何不妥?怎样纠正?
复习思考题
- 设由可微的 \(u = u(x, y, t), x = x(s, t), y = y(s, t)\) 得到 \(u = u(x(s, t), y(s, t), t)\) ,它是一个以 \(s, t\) 为自变量的复合函数。考察下面计算复合函数偏导数的一种写法:
\[
{\frac {\partial u}{\partial t}} = {\frac {\partial u}{\partial x}} {\frac {\partial x}{\partial t}} + {\frac {\partial u}{\partial y}} {\frac {\partial y}{\partial t}} + {\frac {\partial u}{\partial t}},
\]
试问这个写法有何不妥?怎样纠正?
\[
\begin{array}{l} \frac {\partial}{\partial t} u (x (s, t), y (s, t), t) \\ = u _ {x} (x (s, t), y (s, t), t) x _ {t} (s, t) + u _ {y} (x (s, t), y (s, t), t) y _ {t} (s, t) + u _ {t} (x (s, t), y (s, t), t). \\ \end{array}
\]
作业
P. 127
1 单、2
数学分析
2025-2026 (2)
Ch. 17C
沈超敏
计算机科学与技术学院
复习

\[
\vec {n} = (- f ^ {\prime} (x), 1)
\]
\[
\vec {t} = (1, f ^ {\prime} (x))
\]

\[
\vec {r}: (x, y) \rightarrow (x, y, f (x, y))
\]
\[
a \text { parametrigation }
\]
\[
\vec {t} _ {1} = \frac {\partial \vec {r}}{\partial x} = (1, 0, \partial_ {x} f)
\]
\[
\vec {t} _ {2} = \frac {\partial \vec {r}}{\partial y} = (0, 1, \partial y f)
\]
\[
\vec {n} = \vec {t} _ {1} \times \vec {t} _ {2} = (- \partial_ {x} f, - \partial_ {y} f, 1)
\]
第十七章 多元函数微分学
§3 方向导数与梯度
方向导数与梯度
定义 (1)
设函数 \(f(x,y,z)\) 在点 \(P_0(x_0,y_0,z_0)\) 的某邻域 \(U(P_0)\subset \mathbb{R}^3\) 内有定义, \(\vec{l}\) 为从点 \(P_{0}\) 出发的射线.任给 \(P(x,y,z)\in \vec{l}\cap U(P_0)\) ,记 \(\rho = |P_0P|\) ,若极限
\[
\lim _ {\rho \to 0 ^ {+}} \frac {\Delta_ {\vec {l}} f}{\rho} = \lim _ {\rho \to 0 ^ {+}} \frac {f (P) - f \left(P _ {0}\right)}{\rho}
\]
存在,则称此极限为函数 \(f\) 在点 \(P_0\) 沿方向 \(\vec{l}\) 的方向导数,记作 \(\left.\frac{\partial f}{\partial\vec{l}}\right|_{P_0}, f_{\vec{l}}(P_0)\) 或 \(f_{\vec{l}}(x_0, y_0, z_0)\) .
不难看出:若 \(f\) 在点 \(P_0\) 存在对 \(x\) 的偏导数,则
1) \(f\) 在点 \(P_0\) 沿 \(x\) 轴正方向的方向导数恰为
\[
f _ {\vec {l}} (P _ {0}) = f _ {x} \left(P _ {0}\right) \quad (\vec {l} = + \overrightarrow {O x})
\]
2)当 \(\vec{l}\) 的方向为 x 轴的负方向时,则有
\[
f _ {\vec {l}} (P _ {0}) = - f _ {x} \left(P _ {0}\right) \quad (\vec {l} = - \overrightarrow {O x});
\]
对于 \(f_{y}\) 与 \(f_{z}\) 也有相应的结论.
定理 (17.6)
若 \(f(x,y,z)\) 在点 \(P_0(x_0,y_0,z_0)\) 可微,则 \(f\) 在点 \(P_0\) 沿任一方向 \(\vec{l}\) 的方向导数都存在,且
\[
f _ {\vec {l}} (P _ {0}) = f _ {x} (P _ {0}) \cos \alpha + f _ {y} (P _ {0}) \cos \beta + f _ {z} (P _ {0}) \cos \gamma , \tag {1}
\]
其中 \(\cos\alpha,\cos\beta,\cos\gamma\) 为 \(\vec{l}\) 的方向余弦.
不难看出:若 \(f\) 在点 \(P_0\) 存在对 \(x\) 的偏导数,则
1) \(f\) 在点 \(P_0\) 沿 \(x\) 轴正方向的方向导数恰为
\[
f _ {\vec {l}} (P _ {0}) = f _ {x} \left(P _ {0}\right) \quad (\vec {l} = + \overrightarrow {O x})
\]
2)当 \(\vec{l}\) 的方向为 x 轴的负方向时,则有
\[
f _ {\vec {l}} (P _ {0}) = - f _ {x} \left(P _ {0}\right) \quad (\vec {l} = - \overrightarrow {O x});
\]
对于 \(f_{y}\) 与 \(f_{z}\) 也有相应的结论.
定理 (17.6)
若 \(f(x,y,z)\) 在点 \(P_0(x_0,y_0,z_0)\) 可微,则 \(f\) 在点 \(P_0\) 沿任一方向 \(\vec{l}\) 的方向导数都存在,且
\[
f _ {\vec {l}} (P _ {0}) = f _ {x} \left(P _ {0}\right) \cos \alpha + f _ {y} \left(P _ {0}\right) \cos \beta + f _ {z} \left(P _ {0}\right) \cos \gamma ,, \tag {1}
\]
其中 \(\cos\alpha,\cos\beta,\cos\gamma\) 为 \(\vec{l}\) 的方向余弦.
证设 \(P(x,y,z)\) 为 \(\vec{l}\) 上任一点,于是有
\[
\left. \begin{array}{l} \Delta x = x - x _ {0} = \rho \cos \alpha \\ \Delta y = y - y _ {0} = \rho \cos \beta \\ \Delta z = z - z _ {0} = \rho \cos \gamma . \end{array} \right\} \tag {2}
\]

由假设 \(f\) 在点 \(P_0\) 可微,则有
\[
f (P) - f \left(P _ {0}\right) = f _ {x} \left(P _ {0}\right) \Delta x + f _ {y} \left(P _ {0}\right) \Delta y + f _ {z} \left(P _ {0}\right) \Delta z + o (\rho).
\]
上式左、右两边皆除以 \(\rho\) ,并根据(2)式可得
\[
\begin{array}{l} (f (P) - f \left(P _ {0}\right)) / \rho = f _ {x} \left(P _ {0}\right) \Delta x / \rho + f _ {y} \left(P _ {0}\right) \Delta y / \rho + f _ {z} \left(P _ {0}\right) \Delta z / \rho + o (\rho) / \rho \\ = f _ {x} \left(P _ {0}\right) \cos \alpha + f _ {y} \left(P _ {0}\right) \cos \beta + f _ {z} \left(P _ {0}\right) \cos \gamma + o (\rho) / \rho . \\ \end{array}
\]
因为 \(\lim_{\rho\to0^{+}}o(\rho)/\rho=0,\) 所以上式左边的极限存在:
\[
\begin{array}{l} f _ {\vec {l}} (P _ {0}) = \lim _ {\rho \rightarrow 0 ^ {+}} (f (P) - f (P _ {0})) / \rho \\ = f _ {x} \left(P _ {0}\right) \cos \alpha + f _ {y} \left(P _ {0}\right) \cos \beta + f _ {z} \left(P _ {0}\right) \cos \gamma . \\ \end{array}
\]
对于二元函数 \(f(x,y)\) 来说,相应于 (1) 的结果为
\[
f _ {\vec {l}} \left(x _ {0}, y _ {0}\right) = f _ {x} \left(x _ {0}, y _ {0}\right) \cos \alpha + f _ {y} \left(x _ {0}, y _ {0}\right) \cos \beta , \tag {2}
\]
其中 \(\alpha,\beta\) 是 \(R^{2}\) 中向量 \(\vec{l}\) 的方向角.
\[
\begin{array}{l} (f (P) - f \left(P _ {0}\right)) / \rho = f _ {x} \left(P _ {0}\right) \Delta x / \rho + f _ {y} \left(P _ {0}\right) \Delta y / \rho + f _ {z} \left(P _ {0}\right) \Delta z / \rho + o (\rho) / \rho \\ = f _ {x} \left(P _ {0}\right) \cos \alpha + f _ {y} \left(P _ {0}\right) \cos \beta + f _ {z} \left(P _ {0}\right) \cos \gamma + o (\rho) / \rho . \\ \end{array}
\]
因为 \(\lim_{\rho\to0^{+}}o(\rho)/\rho=0,\) 所以上式左边的极限存在:
\[
\begin{array}{l} f _ {\vec {l}} (P _ {0}) = \lim _ {\rho \rightarrow 0 ^ {+}} (f (P) - f (P _ {0})) / \rho \\ = f _ {x} \left(P _ {0}\right) \cos \alpha + f _ {y} \left(P _ {0}\right) \cos \beta + f _ {z} \left(P _ {0}\right) \cos \gamma . \\ \end{array}
\]
对于二元函数 \(f(x,y)\) 来说,相应于 (1) 的结果为
\[
f _ {\vec {l}} (x _ {0}, y _ {0}) = f _ {x} \left(x _ {0}, y _ {0}\right) \cos \alpha + f _ {y} \left(x _ {0}, y _ {0}\right) \cos \beta , \tag {2}
\]
其中 \(\alpha,\beta\) 是 \(R^{2}\) 中向量 \(\vec{l}\) 的方向角.
例 (1)
设 \(f(x,y,z) = x + y^{2} + z^{3}\) , 求 \(f\) 在点 \(P_0(1,1,1)\) 处沿着指向点 \(P_{1}(3, - 1,2)\) 的方向导数.
解 易见 f 在点 \(P_{0}\) 可微。故由
\[
f _ {x} \left(P _ {0}\right) = 1, f _ {y} \left(P _ {0}\right) = 2, f _ {z} \left(P _ {0}\right) = 3,
\]
以及 \(\vec{l}=\overrightarrow{P_{0}P_{1}}=(2,-2,1)\) 的方向余弦
\[
\cos \alpha = \frac {\overrightarrow {p p _ {0}} \cdot \overrightarrow {i}}{| \overrightarrow {p p _ {0}} |} = \frac {2}{\sqrt {2 ^ {2} + (- 2) ^ {2} + 1 ^ {2}}} = \frac {2}{3},
\]
\[
\cos \beta = \frac {\overrightarrow {p p _ {0}} \cdot \overrightarrow {j}}{| \overrightarrow {p p _ {0}} |} = \frac {- 2}{3}, \quad \cos \gamma = \frac {\overrightarrow {p p _ {0}} \cdot \overrightarrow {k}}{| \overrightarrow {p p _ {0}} |} = \frac {1}{3},
\]
按公式 (1) 可求得
\[
f _ {\vec {l}} (P _ {0}) = 1 \cdot \frac {2}{3} + 2 \cdot \left(- \frac {2}{3}\right) + 3 \cdot \frac {1}{3} = \frac {1}{3}.
\]
例 (2)
函数
\[
f (x, y) = \left\{ \begin{array}{l l} 1, & \text {当} 0 < y < x ^ {2}, - \infty < x < + \infty \text {时}, \\ 0, & \text {其余部分}. \end{array} \right.
\]
已知它在原点不连续(当然也就不可微)。但在始于原点的任何射线上,都存在包含原点的充分小的一段,在这一段上 \(f\) 的函数值恒为零。
于是由方向导数定义,在原点处沿任何方向 \(\vec{l}\) 都有 \(f_{\vec{l}}(0,0) = 0\)

例题
4、 \(u = x^{2} + y^{2}\) 在点 \(M(1,1)\) 处沿 \(\vec{l} = (1,1)\) 方向的方向导数是 ____。
\(2\sqrt{2}\)
例题
4、 \(u = x^{2} + y^{2}\) 在点 \(M(1,1)\) 处沿 \(\vec{l} = (1,1)\) 方向的方向导数是 ____。
\[
2 \sqrt {2}
\]
说明
(i) 函数在一点可微 \(\Longrightarrow\) 方向导数存在
(ii) 函数在一点连续同样不是方向导数存在的必要条件,当然也非充分条件(对此读者应能举出反例)
定义 (2 梯度)
若 \(f(x,y,z)\) 在 \(P_0(x_0,y_0,z_0)\) 存在对所有自变量的偏导数,则称向量 \((f_x(P_0),f_y(P_0),f_z(P_0))\) 为函数 \(f\) 在点 \(P_0\) 的梯度,记作 \(\operatorname{grad}f(P_0) = (f_x(P_0),f_y(P_0),f_z(P_0)).\) \(\operatorname{grad}f(P_0)\) 的长度(或模)为 \(|\operatorname{grad}f(P_0)| = \sqrt{f_x(P_0)^2 + f_y(P_0)^2 + f_z(P_0)^2}\)
在定理 17.6 的条件下,若记 \(\vec{l}\) 方向上的单位向量为
\[
\vec {l _ {0}} = (\cos \alpha , \cos \beta , \cos \gamma),
\]
则方向导数计算公式(1)又可写成
\[
f _ {\bar {l}} (P _ {0}) = \mathrm{grad} f (P _ {0}) \cdot \vec {l _ {0}} = | \mathrm{grad} f (P _ {0}) | \cos \theta .
\]
这里 \(\theta\) 是梯度向量 \(\operatorname{grad} f(P_0)\) 与 \(\vec{l}_0\) 的夹角.
因此,当 \(\theta = 0\) 时,\(f_{\vec{l}}(P_0)\) 取得最大值 \(|\operatorname{grad} f(P_0)|\) . 这就是说,当 \(f\) 在点 \(P_0\) 可微时,\(f\) 在点 \(P_0\) 的梯度方向是 \(f\) 的值增长最快的方向,且沿这一方向的变化率就是梯度的模;
而当 \(\vec{l}\) 与梯度向量反方向 \((\theta = \pi)\) 时,方向导数取得最小值 \(-|\mathrm{grad}f(P_0)|\)
例 (3)
设 \(f(x,y,z) = xy^{2} + yz^{3}\) , 试求 \(f\) 在点 \(P_0(2, -1, 1)\) 处的梯度及它的模.
解 易得 \(f_{x}(P_{0})=1, f_{y}(P_{0})=-3, f_{z}(P_{0})=-3\) ,所以
\[
\operatorname{grad} f \left(P _ {0}\right) = (1, - 3, - 3),
\]
\[
| \mathrm{grad} f (P _ {0}) | = \sqrt {1 ^ {2} + (- 3) ^ {2} + (- 3) ^ {2}} = \sqrt {1 9}
\]
方向导数
Define \(\phi (\alpha) = f\left(\pmb {x}^{*} + \alpha \pmb {d}\right)\) for \(\alpha \in [0,\alpha_0]\) , then
\[
\phi^ {\prime} (0) = \left\{ \begin{array}{l l} \lim _ {\alpha \to 0 ^ {+}} \frac {\phi (\alpha) - \phi (0)}{\alpha} = \lim _ {\alpha \to 0 ^ {+}} \frac {f (\boldsymbol {x} ^ {*} + \alpha \boldsymbol {d}) - f (\boldsymbol {x} ^ {*})}{\alpha} & \text {Def} \\ \nabla f \left(\boldsymbol {x} ^ {*}\right) \cdot \boldsymbol {d} & \text {Chain rule} \end{array} \right.
\]
梯度下降法

\[
\min _ {w} f = \frac {1}{2} (1. 5 w - 1. 2) ^ {2}
\]
\[
w _ {k + 1} = w _ {k} - \alpha \nabla f (x _ {k}) = w _ {k} - \alpha (1. 5 w _ {k} - 1. 2) \cdot 1. 5
\]
梯度下降法

作业
p. 131
1、3、5
数学分析
2025-2026 (2)
Ch. 17d
沈超敏
计算机科学与技术学院
第十七章 多元函数微分学
§4 泰勒公式与极值问题
高阶偏导数
由于 \(z = f(x,y)\) 的偏导数 \(f_{x}(x,y),f_{y}(x,y)\) 一般仍然是 \(x,y\) 的函数,如果它们关于 \(x\) 与 \(y\) 的偏导数也存在,说明 \(f\) 具有二阶偏导数.
二元函数的二阶偏导数有如下四种形式:
\[
\begin{array}{l} f _ {x x} (x, y) = \frac {\partial^ {2} z}{\partial x ^ {2}} = \frac {\partial}{\partial x} \left(\frac {\partial z}{\partial x}\right), f _ {x y} (x, y) = \frac {\partial^ {2} z}{\partial x \partial y} = \frac {\partial}{\partial y} \left(\frac {\partial z}{\partial x}\right), \\ f _ {y x} (x, y) = \frac {\partial^ {2} z}{\partial y \partial x} = \frac {\partial}{\partial x} \left(\frac {\partial z}{\partial y}\right), f _ {y y} (x, y) = \frac {\partial^ {2} z}{\partial y ^ {2}} = \frac {\partial}{\partial y} \left(\frac {\partial z}{\partial y}\right). \\ \end{array}
\]
其中 \(f_{xy}, f_{yx}\) 这两个既有 x,又有 y 的高阶偏导数称为混合偏导数。
类似地可以定义更高阶的偏导数,例如 \(z = f(x,y)\) 的的三阶偏导数共有八种情况:
\[
\frac {\partial}{\partial x} \left(\frac {\partial z}{\partial x ^ {2}}\right) = \frac {\partial^ {3} z}{\partial x ^ {3}} = f _ {x ^ {3}} (x, y)
\]
\[
\frac {\partial}{\partial y} \left(\frac {\partial z}{\partial x ^ {2}}\right) = \frac {\partial^ {2} z}{\partial x ^ {2} \partial y} = f _ {x ^ {2} y} (x, y)
\]
\[
f _ {x y x} (x, y), \quad f _ {x y ^ {2}} (x, y), \quad f _ {y ^ {3}} (x, y),
\]
\[
f _ {y ^ {2} x} (x, y), \quad f _ {y x y} (x, y), \quad f _ {y x ^ {2}} (x, y).
\]
例 (1)
求函数 \(z = \mathrm{e}^{x + 2y}\) 的所有二阶偏导数和 \(\frac{\partial^3z}{\partial y\partial x^2}\) .
解 由于 \(\frac{\partial z}{\partial x} = \mathrm{e}^{x + 2y}\) , \(\frac{\partial z}{\partial y} = 2\mathrm{e}^{x + 2y}\) , 因此有
\[
\frac {\partial^ {2} z}{\partial x ^ {2}} = \frac {\partial}{\partial x} \left(\mathrm{e} ^ {x + 2 y}\right) = \mathrm{e} ^ {x + 2 y}
\]
\[
\frac {\partial^ {2} z}{\partial y ^ {2}} = \frac {\partial}{\partial y} \left(2 \mathrm{e} ^ {x + 2 y}\right) = 4 e ^ {x + 2 y}
\]
\[
\frac {\partial^ {2} z}{\partial x \partial y} = \frac {\partial}{\partial y} \left(\mathrm{e} ^ {x + 2 y}\right) = 2 \mathrm{e} ^ {x + 2 y};
\]
\[
\frac {\partial^ {2} z}{\partial y \partial x} = \frac {\partial}{\partial x} \left(2 \mathrm{e} ^ {x + 2 y}\right) = 2 \mathrm{e} ^ {x + 2 y};
\]
\[
{\frac {\partial^ {3} z}{\partial y \partial x ^ {2}}} = {\frac {\partial}{\partial x}} \left({\frac {\partial^ {2} z}{\partial y \partial x}}\right) = {\frac {\partial}{\partial x}} \left(2 \mathrm{e} ^ {x + 2 y}\right) = 2 \mathrm{e} ^ {x + 2 y}.
\]
例 (2)
求函数 \(z = \arctan \frac{y}{x}\) 的所有二阶偏导数.
解 因为 \(\frac{\partial z}{\partial x} = \frac{-y}{x^2 + y^2}, \frac{\partial z}{\partial y} = \frac{x}{x^2 + y^2}\) ,所以二阶偏导数为
\[
\frac {\partial^ {2} z}{\partial x ^ {2}} = \frac {\partial}{\partial x} \left(\frac {- y}{x ^ {2} + y ^ {2}}\right) = \frac {2 x y}{(x ^ {2} + y ^ {2}) ^ {2}},
\]
\[
{\frac {\partial^ {2} z}{\partial y ^ {2}}} = {\frac {\partial}{\partial y}} \left({\frac {x}{x ^ {2} + y ^ {2}}}\right) = {\frac {- 2 x y}{(x ^ {2} + y ^ {2}) ^ {2}}}.
\]
\[
\frac {\partial^ {2} z}{\partial x \partial y} = \frac {\partial}{\partial y} \left(\frac {- y}{x ^ {2} + y ^ {2}}\right) = - \frac {x ^ {2} - y ^ {2}}{(x ^ {2} + y ^ {2}) ^ {2}},
\]
\[
\frac {\partial^ {2} z}{\partial y \partial x} = \frac {\partial}{\partial x} \left(\frac {x}{x ^ {2} + y ^ {2}}\right) = - \frac {x ^ {2} - y ^ {2}}{(x ^ {2} + y ^ {2}) ^ {2}},
\]
注意 在上面两个例子中都有
\[
\frac {\partial^ {2} z}{\partial x \partial y} = \frac {\partial^ {2} z}{\partial y \partial x},
\]
即先对 x、后对 y 与先对 y、后对 x 的两个二阶偏导数相等.
但是这个结论并不对任何函数都成立,例如
一个例子:混合偏导可能不相等
构造函数
\[
f (x, y) = \left\{ \begin{array}{l l} x y, & | x | < | y |, \\ 2 x y, & | x | \geq | y |. \end{array} \right.
\]
在原点 \((0,0)\) 处的混合偏导数
先 \(x\) 后 \(y\)
固定 \(y \neq 0\) ,当 \(x \to 0\) 时,
\[
| x | < | y |
\]
所以
\[
f (x, y) = x y.
\]
因此
\[
f _ {x} (0, y) = y.
\]
于是
\[
f _ {x y} (0, 0) = \lim _ {y \to 0} \frac {f _ {x} (0 , y) - f _ {x} (0 , 0)}{y} = 1.
\]
先 \(y\) 后 \(x\)
固定 \(x \neq 0\) ,当 \(y \to 0\) 时,
\[
| x | \geq | y |
\]
所以
\[
f (x, y) = 2 x y.
\]
因此
\[
f _ {y} (x, 0) = 2 x.
\]
于是
\[
f _ {y x} (0, 0) = \lim _ {x \to 0} \frac {f _ {y} (x , 0) - f _ {y} (0 , 0)}{x} = 2.
\]
例
\[
f (x, y) = \left\{ \begin{array}{c l} x y \frac {x ^ {2} - y ^ {2}}{x ^ {2} + y ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
其一阶偏导数为
\[
f _ {x} (x, y) = \left\{ \begin{array}{l l} \frac {y \big (x ^ {4} + 4 x ^ {2} y ^ {2} - y ^ {4} \big)}{(x ^ {2} + y ^ {2}) ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0; \end{array} \right.
\]
\[
f _ {y} (x, y) = \left\{ \begin{array}{l l} \frac {x \big (x ^ {4} - 4 x ^ {2} y ^ {2} - y ^ {4} \big)}{(x ^ {2} + y ^ {2}) ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
\[
f _ {x y} (0, 0) = \lim _ {\Delta y \rightarrow 0} \frac {f _ {x} (0 , \Delta y) - f _ {x} (0 , 0)}{\Delta y} = \lim _ {\Delta y \rightarrow 0} \frac {- \Delta y}{\Delta y} = - 1,
\]
\[
f _ {y x} (0, 0) = \lim _ {\Delta x \rightarrow 0} \frac {f _ {y} (\Delta x , 0) - f _ {y} (0 , 0)}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \frac {\Delta x}{\Delta x} = 1.
\]
例
\[
f (x, y) = \left\{ \begin{array}{c l} x y \frac {x ^ {2} - y ^ {2}}{x ^ {2} + y ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
其一阶偏导数为
\[
f _ {x} (x, y) = \left\{ \begin{array}{l l} \frac {y \big (x ^ {4} + 4 x ^ {2} y ^ {2} - y ^ {4} \big)}{(x ^ {2} + y ^ {2}) ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0; \end{array} \right.
\]
\[
f _ {y} (x, y) = \left\{ \begin{array}{l l} \frac {x \big (x ^ {4} - 4 x ^ {2} y ^ {2} - y ^ {4} \big)}{(x ^ {2} + y ^ {2}) ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
\[
f _ {x y} (0, 0) = \lim _ {\Delta y \rightarrow 0} \frac {f _ {x} (0 , \Delta y) - f _ {x} (0 , 0)}{\Delta y} = \lim _ {\Delta y \rightarrow 0} \frac {- \Delta y}{\Delta y} = - 1,
\]
\[
f _ {y x} (0, 0) = \lim _ {\Delta x \rightarrow 0} \frac {f _ {y} (\Delta x , 0) - f _ {y} (0 , 0)}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \frac {\Delta x}{\Delta x} = 1.
\]
例
\[
f (x, y) = \left\{ \begin{array}{c l} x y \frac {x ^ {2} - y ^ {2}}{x ^ {2} + y ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
其一阶偏导数为
\[
f _ {x} (x, y) = \left\{ \begin{array}{l l} \frac {y \big (x ^ {4} + 4 x ^ {2} y ^ {2} - y ^ {4} \big)}{(x ^ {2} + y ^ {2}) ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0; \end{array} \right.
\]
\[
f _ {y} (x, y) = \left\{ \begin{array}{l l} \frac {x \big (x ^ {4} - 4 x ^ {2} y ^ {2} - y ^ {4} \big)}{(x ^ {2} + y ^ {2}) ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
\[
f _ {x y} (0, 0) = \lim _ {\Delta y \to 0} \frac {f _ {x} (0 , \Delta y) - f _ {x} (0 , 0)}{\Delta y} = \lim _ {\Delta y \to 0} \frac {- \Delta y}{\Delta y} = - 1,
\]
\[
f _ {y x} (0, 0) = \lim _ {\Delta x \rightarrow 0} \frac {f _ {y} (\Delta x , 0) - f _ {y} (0 , 0)}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \frac {\Delta x}{\Delta x} = 1.
\]
例
\[
f (x, y) = \left\{ \begin{array}{c l} x y \frac {x ^ {2} - y ^ {2}}{x ^ {2} + y ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
其一阶偏导数为
\[
f _ {x} (x, y) = \left\{ \begin{array}{l l} \frac {y \big (x ^ {4} + 4 x ^ {2} y ^ {2} - y ^ {4} \big)}{(x ^ {2} + y ^ {2}) ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0; \end{array} \right.
\]
\[
f _ {y} (x, y) = \left\{ \begin{array}{l l} \frac {x \big (x ^ {4} - 4 x ^ {2} y ^ {2} - y ^ {4} \big)}{(x ^ {2} + y ^ {2}) ^ {2}}, & x ^ {2} + y ^ {2} \neq 0, \\ 0, & x ^ {2} + y ^ {2} = 0. \end{array} \right.
\]
\[
f _ {x y} (0, 0) = \lim _ {\Delta y \to 0} \frac {f _ {x} (0 , \Delta y) - f _ {x} (0 , 0)}{\Delta y} = \lim _ {\Delta y \to 0} \frac {- \Delta y}{\Delta y} = - 1,
\]
\[
f _ {y x} (0, 0) = \lim _ {\Delta x \rightarrow 0} \frac {f _ {y} (\Delta x , 0) - f _ {y} (0 , 0)}{\Delta x} = \lim _ {\Delta x \rightarrow 0} \frac {\Delta x}{\Delta x} = 1.
\]
由此看到,这两个混合偏导数与求导顺序有关.
在什么条件下混合偏导数与求导顺序无关呢?
为此,先按定义把 \(f_{xy}(x_0, y_0)\) 与 \(f_{yx}(x_0, y_0)\) 表示成极限形式。由于
\[
f _ {x} (x, y) = \lim _ {\Delta x \to 0} \frac {f (x + \Delta x , y) - f (x , y)}{\Delta x},
\]
因此有
\[
\begin{array}{l} f _ {x y} \left(x _ {0}, y _ {0}\right) = \lim _ {\Delta y \rightarrow 0} \frac {f _ {x} \left(x _ {0} , y _ {0} + \Delta y\right) - f _ {x} \left(x _ {0} , y _ {0}\right)}{\Delta y} \\ = \lim _ {\Delta y \to 0} \frac {1}{\Delta y} \left[ \lim _ {\Delta x \to 0} \frac {f (x _ {0} + \Delta x , y _ {0} + \Delta y) - f (x _ {0} , y _ {0} + \Delta y)}{\Delta x} \right. \\ \left. - \lim _ {\Delta x \rightarrow 0} \frac {f \left(x _ {0} + \Delta x , y _ {0}\right) - f \left(x _ {0} , y _ {0}\right)}{\Delta x} \right] \\ \end{array}
\]
\[
\begin{array}{l} f _ {x y} \left(x _ {0}, y _ {0}\right) = \lim _ {\Delta y \rightarrow 0} \lim _ {\Delta x \rightarrow 0} \frac {1}{\Delta x \Delta y} [ f (x _ {0} + \Delta x, y _ {0} + \Delta y) \tag {1} \\ \left. - f \left(x _ {0}, y _ {0} + \Delta y\right) - f \left(x _ {0} + \Delta x, y _ {0}\right) + f \left(x _ {0}, y _ {0}\right) \right]; \\ \end{array}
\]
类似地有
\[
\begin{array}{l} f _ {y x} \left(x _ {0}, y _ {0}\right) = \lim _ {\Delta x \rightarrow 0} \lim _ {\Delta y \rightarrow 0} \frac {1}{\Delta x \Delta y} [ f (x _ {0} + \Delta x, y _ {0} + \Delta y) \tag {2} \\ \left. - f \left(x _ {0} + \Delta x, y _ {0}\right) - f \left(x _ {0}, y _ {0} + \Delta y\right) + f \left(x _ {0}, y _ {0}\right) \right]. \\ \end{array}
\]
为使 \(f_{xy}(x_0, y_0) = f_{yx}(x_0, y_0)\) 成立,必须使 (1)、(2) 这两个累次极限相等。下述定理给出了使 (1) 与 (2) 相等的一个充分条件。
定理 (17.7)
若 \(f_{xy}(x,y)\) 与 \(f_{yx}(x,y)\) 都在点 \((x_0,y_0)\) 连续,则
\[
f _ {x y} \left(x _ {0}, y _ {0}\right) = f _ {y x} \left(x _ {0}, y _ {0}\right). \tag {3}
\]
证令
\[
F (\Delta x, \Delta y) = f (x _ {0} + \Delta x, y _ {0} + \Delta y) - f (x _ {0} + \Delta x, y _ {0})
\]
\[
- f \left(x _ {0}, y _ {0} + \Delta y\right) + f \left(x _ {0}, y _ {0}\right),
\]
i)
\[
\varphi (x) = f \left(x, y _ {0} + \Delta y\right) - f \left(x, y _ {0}\right).
\]
于是有
\[
F (\Delta x, \Delta y) = \varphi (x _ {0} + \Delta x) - \varphi (x _ {0}). \tag {4}
\]
对 \(\varphi\) 应用微分中值定理,\(\exists\theta_{1}\in(0,1)\) , 使得
\[
\begin{array}{l} \varphi \left(x _ {0} + \Delta x\right) - \varphi \left(x _ {0}\right) = \varphi^ {\prime} \left(x _ {0} + \theta_ {1} \Delta x\right) \Delta x \\ = \left[ f _ {x} \left(x _ {0} + \theta_ {1} \Delta x, y _ {0} + \Delta y\right) - f _ {x} \left(x _ {0} + \theta_ {1} \Delta x, y _ {0}\right) \right] \Delta x \\ \end{array}
\]
又 \(f_{x}(x_{0} + \theta_{1}\Delta x,y)\) 作为 \(y\) 的可导函数,再使用微分中值定理, \(\exists \theta_2\in (0,1)\) ,使上式化为
\[
\varphi \left(x _ {0} + \Delta x\right) - \varphi \left(x _ {0}\right) = f _ {x y} \left(x _ {0} + \theta_ {1} \Delta x, y _ {0} + \theta_ {2} \Delta y\right) \Delta x \Delta y.
\]
由(4)则有
\[
\begin{array}{l} F (\Delta x, \Delta y) = f _ {x y} \left(x _ {0} + \theta_ {1} \Delta x, y _ {0} + \theta_ {2} \Delta y\right) \Delta x \Delta y \\ (0 < \theta_ {1}, \theta_ {2} < 1). \\ \end{array}
\]
ii)为了得到 \(f_{yx}\) ,再令
\[
\psi (y) = f \left(x _ {0} + \Delta x, y\right) - f \left(x _ {0}, y\right)
\]
则有
\[
\begin{array}{l} F (\Delta x, \Delta y) = f (x _ {0} + \Delta x, y _ {0} + \Delta y) - f (x _ {0}, y _ {0}) \\ - f \left(x _ {0}, y _ {0} + \Delta y\right) + f \left(x _ {0}, y _ {0}\right) = \psi \left(y _ {0} + \Delta y\right) - \psi \left(y _ {0}\right). \\ \end{array}
\]
用前面相同的方法,又可得到
\[
\begin{array}{l} F (\Delta x, \Delta y) = f _ {y x} \left(x _ {0} + \theta_ {3} \Delta x, y _ {0} + \theta_ {4} \Delta y\right) \Delta x \Delta y \\ \left(0 < \theta_ {3}, \theta_ {4} < 1\right). \\ \end{array}
\]
iii)当 \(\Delta x, \Delta y\) 不为零时,由 (5),(6) 两式又得
\[
\begin{array}{l} f _ {x y} \left(x _ {0} + \theta_ {1} \Delta x, y _ {0} + \theta_ {2} \Delta y\right) = f _ {y x} \left(x _ {0} + \theta_ {3} \Delta x, y _ {0} + \theta_ {4} \Delta y\right) \\ (0 < \theta_ {1}, \theta_ {2}, \theta_ {3}, \theta_ {4} < 1) \\ \end{array}
\]
由定理假设 \(f_{xy}(x,y)\) 与 \(f_{yx}(x,y)\) 都在点 \((x_{0},y_{0})\) 连续,故当 \(\Delta x \rightarrow 0, \Delta y \rightarrow 0\) 时,(7) 式的两边的极限存在且相等,这就得到所要证明的 (3) 式.
注 1 若二元函数 \(f(x,y)\) 在某一点存在直到 \(n\) 阶的连续混合偏导数,则在这一点的所有 \(m\) \((m\leq n)\) 阶混合偏导数都与求导顺序无关.
注 2 这个定理对 \(n\) 元函数的混合偏导数也成立。例如三元函数 \(f(x,y,z)\) 的如下六个三阶混合偏导数
\[
\begin{array}{l} f _ {x y z} (x, y, z), f _ {x z y} (x, y, z), f _ {y z x} (x, y, z), \\ f _ {y x z} (x, y, z), f _ {z x y} (x, y, z), f _ {z y x} (x, y, z) \\ \end{array}
\]
若在某一点都连续,则它们在这一点都相等。
今后在牵涉求导顺序问题时,除特别指出外,一般都假设相应阶数的混合偏导数连续.
注 1 若二元函数 \(f(x,y)\) 在某一点存在直到 \(n\) 阶的连续混合偏导数,则在这一点的所有 \(m\) \((m\leq n)\) 阶混合偏导数都与求导顺序无关.
注 2 这个定理对 \(n\) 元函数的混合偏导数也成立。例如三元函数 \(f(x,y,z)\) 的如下六个三阶混合偏导数
\[
f _ {x y z} (x, y, z), f _ {x z y} (x, y, z), f _ {y z x} (x, y, z),
\]
\[
f _ {y x z} (x, y, z), f _ {z x y} (x, y, z), f _ {z y x} (x, y, z)
\]
若在某一点都连续,则它们在这一点都相等.
今后在牵涉求导顺序问题时,除特别指出外,一般都假设相应阶数的混合偏导数连续.
复合函数的高阶偏导数
设
\[
z = f (x, y), \quad x = \varphi (s, t), \quad y = \psi (s, t).
\]
若函数 \(f, \varphi, \psi\) 都具有连续的二阶偏导数,则复合函数 \(z = f(\varphi(s, t), \psi(s, t))\) 对于 \(s, t\) 同样存在二阶连续偏导数。具体计算如下:
\[
{\frac {\partial z}{\partial s}} = {\frac {\partial z}{\partial x}} {\frac {\partial x}{\partial s}} + {\frac {\partial z}{\partial y}} {\frac {\partial y}{\partial s}}, \quad {\frac {\partial z}{\partial t}} = {\frac {\partial z}{\partial x}} {\frac {\partial x}{\partial t}} + {\frac {\partial z}{\partial y}} {\frac {\partial y}{\partial t}}
\]
显然 \(\frac{\partial z}{\partial s}\) 与 \(\frac{\partial z}{\partial t}\) 仍是 \(s, t\) 的复合函数,其中 \(\frac{\partial z}{\partial x}, \frac{\partial z}{\partial y}\) 是 \(x, y\) 的函数, \(\frac{\partial x}{\partial s}, \frac{\partial x}{\partial t}, \frac{\partial y}{\partial s}, \frac{\partial y}{\partial t}\) 是 \(s, t\) 的函数。继续求 \(z\) 关于 \(s, t\) 的二阶偏导数:
\[
\begin{array}{l} \frac {\partial^ {2} z}{\partial s ^ {2}} = \frac {\partial}{\partial s} \left(\frac {\partial z}{\partial x}\right) \frac {\partial x}{\partial s} + \frac {\partial z}{\partial x} \cdot \frac {\partial}{\partial s} \left(\frac {\partial x}{\partial s}\right) \\ + \frac {\partial}{\partial s} \left(\frac {\partial z}{\partial y}\right) \frac {\partial y}{\partial s} + \frac {\partial z}{\partial y} \cdot \frac {\partial}{\partial s} \left(\frac {\partial y}{\partial s}\right) \\ = \left(\frac {\partial^ {2} z}{\partial x ^ {2}} \cdot \frac {\partial x}{\partial s} + \frac {\partial^ {2} z}{\partial x \partial y} \cdot \frac {\partial y}{\partial s}\right) \frac {\partial x}{\partial s} + \frac {\partial z}{\partial x} \cdot \frac {\partial^ {2} x}{\partial s ^ {2}} \\ + \left(\frac {\partial^ {2} z}{\partial y \partial x} \cdot \frac {\partial x}{\partial s} + \frac {\partial^ {2} z}{\partial y ^ {2}} \cdot \frac {\partial y}{\partial s}\right) \frac {\partial y}{\partial s} + \frac {\partial z}{\partial y} \cdot \frac {\partial^ {2} y}{\partial s ^ {2}} \\ = \frac {\partial^ {2} z}{\partial x ^ {2}} \left(\frac {\partial x}{\partial s}\right) ^ {2} + 2 \frac {\partial^ {2} z}{\partial x \partial y} \cdot \frac {\partial x}{\partial s} \cdot \frac {\partial y}{\partial s} \\ + \frac {\partial^ {2} z}{\partial y ^ {2}} \left(\frac {\partial y}{\partial s}\right) ^ {2} + \frac {\partial z}{\partial x} \cdot \frac {\partial^ {2} x}{\partial s ^ {2}} + \frac {\partial z}{\partial y} \cdot \frac {\partial^ {2} y}{\partial s ^ {2}}. \\ \end{array}
\]
同理可得
\[
\begin{array}{l} \frac {\partial^ {2} z}{\partial t ^ {2}} = \frac {\partial^ {2} z}{\partial x ^ {2}} \left(\frac {\partial x}{\partial t}\right) ^ {2} + 2 \frac {\partial^ {2} z}{\partial x \partial y} \cdot \frac {\partial x}{\partial t} \cdot \frac {\partial y}{\partial t} \\ + \frac {\partial^ {2} z}{\partial y ^ {2}} \left(\frac {\partial y}{\partial t}\right) ^ {2} + \frac {\partial z}{\partial x} \cdot \frac {\partial^ {2} x}{\partial t ^ {2}} + \frac {\partial z}{\partial y} \cdot \frac {\partial^ {2} y}{\partial t ^ {2}}; \\ \end{array}
\]
\[
\begin{array}{l} \frac {\partial^ {2} z}{\partial s \partial t} = \frac {\partial^ {2} z}{\partial x ^ {2}} \cdot \frac {\partial x}{\partial s} \cdot \frac {\partial x}{\partial t} + \frac {\partial^ {2} z}{\partial x \partial y} \left(\frac {\partial x}{\partial s} \cdot \frac {\partial y}{\partial t} + \frac {\partial x}{\partial t} \cdot \frac {\partial y}{\partial s}\right) \\ + \frac {\partial^ {2} z}{\partial y ^ {2}} \cdot \frac {\partial y}{\partial s} \cdot \frac {\partial y}{\partial t} + \frac {\partial z}{\partial x} \cdot \frac {\partial^ {2} x}{\partial s \partial t} + \frac {\partial z}{\partial y} \cdot \frac {\partial^ {2} y}{\partial s \partial t} \\ \end{array}
\]
\[
\frac {\partial^ {2} z}{\partial t \partial s} = \frac {\partial^ {2} z}{\partial s \partial t}.
\]
例 (3)
设 \(z = f\left(x,\frac{x}{y}\right)\) ,求 \(\frac{\partial^2z}{\partial x^2},\frac{\partial^2z}{\partial x\partial y}.\)
解 这里 z 是以 x, y 为自变量的复合函数,它也可以改写成如下形式:
\[
z = f (u, v), \quad u = x, \quad v = \frac {x}{y}.
\]
由复合函数求导公式,有
\[
{\frac {\partial z}{\partial x}} = {\frac {\partial f}{\partial u}} \cdot {\frac {\partial u}{\partial x}} + {\frac {\partial f}{\partial v}} \cdot {\frac {\partial \pmb {v}}{\partial x}} = {\frac {\partial f}{\partial u}} + {\frac {1}{y}} \cdot {\frac {\partial f}{\partial v}}.
\]
注意,这里 \(\frac{\partial f}{\partial u}, \frac{\partial f}{\partial v}\) 仍是以 \(u, v\) 为中间变量,\(x, y\) 为自变量的复合函数.
所以
\[
\begin{array}{l} \frac {\partial^ {2} z}{\partial x ^ {2}} = \frac {\partial}{\partial x} \left(\frac {\partial f}{\partial u} + \frac {1}{y} \cdot \frac {\partial f}{\partial v}\right) \\ = \frac {\partial^ {2} f}{\partial u ^ {2}} \cdot \frac {\partial u}{\partial x} + \frac {\partial^ {2} f}{\partial u \partial v} \cdot \frac {\partial v}{\partial x} + \frac {1}{y} \cdot \left(\frac {\partial^ {2} f}{\partial v \partial u} \cdot \frac {\partial u}{\partial x} + \frac {\partial^ {2} f}{\partial v ^ {2}} \cdot \frac {\partial v}{\partial x}\right) \\ = \frac {\partial^ {2} f}{\partial u ^ {2}} + \frac {2}{y} \cdot \frac {\partial^ {2} f}{\partial u \partial v} + \frac {1}{y ^ {2}} \cdot \frac {\partial^ {2} f}{\partial v ^ {2}} \\ \end{array}
\]
\[
\begin{array}{l} \frac {\partial^ {2} z}{\partial x \partial y} = \frac {\partial}{\partial y} \left(\frac {\partial f}{\partial u} + \frac {1}{y} \cdot \frac {\partial f}{\partial v}\right) \\ = \frac {\partial^ {2} f}{\partial u ^ {2}} \cdot \frac {\partial u}{\partial y} + \frac {\partial^ {2} f}{\partial u \partial v} \cdot \frac {\partial v}{\partial y} - \frac {1}{y ^ {2}} \cdot \frac {\partial f}{\partial v} \\ + \frac {1}{y} \cdot \left(\frac {\partial^ {2} f}{\partial v \partial u} \cdot \frac {\partial u}{\partial y} + \frac {\partial^ {2} f}{\partial v ^ {2}} \cdot \frac {\partial v}{\partial y}\right) \\ = - \frac {x}{y ^ {2}} \cdot \frac {\partial^ {2} f}{\partial u \partial v} - \frac {x}{y ^ {3}} \cdot \frac {\partial^ {2} f}{\partial v ^ {2}} - \frac {1}{y ^ {2}} \cdot \frac {\partial f}{\partial v}. \\ \end{array}
\]
中值定理和泰勒公式
二元函数的中值公式和泰勒公式,与一元函数的拉格朗日公式和泰勒公式相仿对于 \(n(n > 2)\) 元函数也有相同的公式,只是形式上更复杂一些.
先介绍凸区域。若区域 D 上任意两点的连线都含于 D, 则称 D 为凸区域 (图 17-6).


i.e., 若 \(D\) 为凸区域,则对任意两点 \(P_{1}(x_{1},y_{1}), P_{2}(x_{2},y_{2}) \in D\) , 和一切 \(\lambda (0\leq \lambda \leq 1)\) , 恒有
\[
P \left(x _ {1} + \lambda \left(x _ {2} - x _ {1}\right), y _ {1} + \lambda \left(y _ {2} - y _ {1}\right)\right) \in D.
\]
定理 (17.8 (中值定理))
设 \(f(x,y)\) 在凸区域 \(D\subset \mathbb{R}^2\) 上连续,在 \(D\) 的所有内点都可微,则对 \(D\) 内任意两点 \(P(a,b)\) \(Q(a + h,b + k)\in \mathrm{int}D,\exists \theta (0 < \theta < 1)\) ,使得
\[
f (a + h, b + k) - f (a, b) = f _ {x} (a + \theta h, b + \theta k) h + f _ {y} (a + \theta h, b + \theta k) k. \tag {8}
\]
证令 \(\phi(t) = f(a + th, b + tk)\) , 它是定义在 \([0,1]\) 上的一元连续函数,且在 \((0,1)\) 内可微。根据一元函数中值定理,\(\exists \theta (0 < \theta < 1)\) , 使得
\[
\phi (1) - \phi (0) = \phi^ {\prime} (\theta) \tag {9}
\]
其中
\[
\phi^ {\prime} (\theta) = f _ {x} (a + \theta h, b + \theta k) h + f _ {y} (a + \theta h, b + \theta k) k. \tag {10}
\]
由于 D 为凸区域,因此 \((a + \theta h, b + \theta k) \in D\) , 根据 (9)、(10) 两式即得所要证明的 (8).
注 若 D 为严格凸区域,即 \(\forall P_{1}(x_{1},y_{1}), P_{2}(x_{2},y_{2})\in D, \forall\lambda(0<\lambda<1)\) , 都有
\[
P \left(x _ {1} + \lambda \left(x _ {2} - x _ {1}\right), y _ {1} + \lambda \left(y _ {2} - y _ {1}\right)\right) \in \text { int } D,
\]
则对 D 上连续、int D 内可微的函数 f,只要 \(P, Q \in D\) ,也存在 \(\theta \in (0,1)\) ,使 (8) 式成立.
例如,\(f\) 在闭圆域 \(D\) 上连续,在 \(\operatorname{int} D\) 内可微,则必有 (8) 式成立.
公式(8)也称为二元函数(在凸域上)的中值公式.
注 若 D 为严格凸区域,即 \(\forall P_{1}(x_{1},y_{1}), P_{2}(x_{2},y_{2})\in D, \forall\lambda(0<\lambda<1)\) , 都有
\[
P \left(x _ {1} + \lambda \left(x _ {2} - x _ {1}\right), y _ {1} + \lambda \left(y _ {2} - y _ {1}\right)\right) \in \text { int } D,
\]
则对 \(D\) 上连续、int \(D\) 内可微的函数 \(f\) , 只要 \(P, Q \in D\) , 也存在 \(\theta \in (0,1)\) , 使 (8) 式成立。例如,\(f\) 在闭圆域 \(D\) 上连续,在 int \(D\) 内可微,则必有 (8) 式成立.
公式 (8) 也称为二元函数(在凸域上)的中值公式.
它与定理 17.3 的中值公式 (12)
\[
\begin{array}{l} f \left(x _ {0} + h, y _ {0} + k\right) - f \left(x _ {0}, y _ {0}\right) = f _ {x} \left(x _ {0} + \theta_ {1} h, y _ {0}\right) h + f _ {y} \left(x _ {0}, y _ {0} + \theta_ {2} k\right) k, \\ 0 < \theta_ {1}, \theta_ {2} < 1 \\ \end{array}
\]
相比较,差别在于这里的中值点 \((a + \theta h, b + \theta k)\) 是在连线 \(PQ\) 上,而在定理 17.3 中的 \(\theta_1, \theta_2\) 可以不相等.
推论
若函数 \(f\) 在区域 \(D\) 上存在偏导数,且
\[
f _ {x} = f _ {y} = 0
\]
则 f 在区域 D 上为常量函数.
请读者作为练习自行证明此推论.
它与定理 17.3 的中值公式 (12)
\[
\begin{array}{l} f \left(x _ {0} + h, y _ {0} + k\right) - f \left(x _ {0}, y _ {0}\right) = f _ {x} \left(x _ {0} + \theta_ {1} h, y _ {0}\right) h + f _ {y} \left(x _ {0}, y _ {0} + \theta_ {2} k\right) k, \\ 0 < \theta_ {1}, \theta_ {2} < 1 \\ \end{array}
\]
相比较,差别在于这里的中值点 \((a + \theta h, b + \theta k)\) 是在连线 \(PQ\) 上,而在定理 17.3 中的 \(\theta_1, \theta_2\) 可以不相等.
推论
若函数 \(f\) 在区域 \(D\) 上存在偏导数,且
\[
f _ {x} = f _ {y} = 0
\]
则 f 在区域 D 上为常量函数.
请读者作为练习自行证明此推论.
例 \(^{*}\) 4
对 \(f(x,y) = \frac{1}{\sqrt{x^2 - 2xy + 1}}\) 应用微分中值定理,证明存在某个 \(\theta (0 < \theta < 1)\) ,使得
\[
1 - \sqrt {2} = \sqrt {2} (1 - 3 \theta) \left(1 - 2 \theta + 3 \theta^ {2}\right) ^ {- 3 / 2}.
\]
分析 将上式改写成
\[
{\frac {1 - \sqrt {2}}{\sqrt {2}}} = (1 - 3 \theta) (1 - 2 \theta + 3 \theta^ {2}) ^ {- 3 / 2}
\]
左边恰好是 \(f(1,0) - f(0,1) = \frac{1}{\sqrt{2}} - 1\) ,故应在两点 \(P_{1}(1,0)\) 与 \(P_{2}(0,1)\) 之间应用微分中值定理.
证首先,当 \(x^{2} + y^{2} \leq 1\) 时,有 \(x^{2} - 2xy + 1 > 0\) .
\[
f _ {x} = - \frac {x - y}{(x ^ {2} - 2 x y + 1) ^ {3 / 2}}, f _ {y} = \frac {x}{(x ^ {2} - 2 x y + 1) ^ {3 / 2}}.
\]
易知 \(f_{x}\) 与 \(f_{y}\) 在凸闭域 \(D = \{(x,y)\mid x^{2} + y^{2}\leq 1\}\) 上连续, \(P_{1},P_{2}\in D\) 。由中值定理, \(\exists \theta (0 < \theta < 1)\) ,使得
\[
\begin{array}{l} \frac {1}{\sqrt {2}} - 1 = f (1, 0) - f (0, 1) \\ = f _ {x} (0 + \theta , 1 - \theta) \cdot 1 + f _ {y} (0 + \theta , 1 - \theta) \cdot (- 1) \\ = - \frac {\theta - (1 - \theta)}{\left[ \theta^ {2} - 2 \theta (1 - \theta) + 1 \right] ^ {3 / 2}} \cdot 1 + \frac {\theta}{\left[ \theta^ {2} - 2 \theta (1 - \theta) + 1 \right] ^ {3 / 2}} \cdot (- 1) \\ = (1 - 3 \theta) \left(1 - 2 \theta + 3 \theta^ {2}\right) ^ {- 3 / 2}. \\ \end{array}
\]
定理 (定理 17.9 (泰勒定理))
若 \(f\) 在点 \(P_0(x_0, y_0)\) 的某邻域 \(U(P_0)\) 内有直到 \(n + 1\) 阶的连续偏导数,则对 \(U(P_0)\) 内任一点 \((x_0 + h, y_0 + k), \exists \theta \in (0, 1)\) ,使得
\[
\begin{array}{l} f \left(x _ {0} + h, y _ {0} + k\right) = f \left(x _ {0}, y _ {0}\right) + \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) f \left(x _ {0}, y _ {0}\right) \\ + \frac {1}{2 !} \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) ^ {2} f \left(x _ {0}, y _ {0}\right) + \dots + \frac {1}{n !} \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) ^ {n} f \left(x _ {0}, y _ {0}\right) + R _ {n} \tag {11} \\ \end{array}
\]
其中
\[
\begin{array}{l} R _ {n} = \frac {1}{(n + 1) !} \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) ^ {n + 1} f \left(x _ {0} + \theta h, y _ {0} + \theta k\right), \\ \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) ^ {m} f (x _ {0}, y _ {0}) = \sum_ {i = 0} ^ {m} \mathrm{C} _ {m} ^ {i} \frac {\partial^ {m}}{\partial x ^ {i} \partial y ^ {m - i}} f (x _ {0}, y _ {0}) h ^ {i} k ^ {m - i} \\ (m = 1, 2, \dots , n), \\ \end{array}
\]
证 类似于定理 17.8 (中值定理) 的证明,先引入辅助函数
\[
\phi (t) = f \left(x _ {0} + t h, y _ {0} + t k\right).
\]
由假设,\(\phi(t)\) 在 \([0,1]\) 上满足一元函数泰勒公式的条件,于是有
\[
\begin{array}{l} \phi (1) = \phi (0) + \frac {\phi^ {\prime} (0)}{1 !} + \frac {\phi^ {\prime \prime} (0)}{2 !} + \dots \\ + \frac {\phi^ {(n)} (0)}{n !} + \frac {\phi^ {(n + 1)} (\theta)}{(n + 1) !} \quad (0 < \theta < 1). \\ \end{array}
\]
应用复合求导法则,可求得 \(\phi(t)\) 的各阶导数如下:
\[
\phi^ {(m)} (t) = \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) ^ {m} f \left(x _ {0} + t h, y _ {0} + t k\right) \quad (m = 0, 1, \dots , n + 1), \tag {13}
\]
\[
\phi^ {(m)} (0) = \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) ^ {m} f \left(x _ {0}, y _ {0}\right) (m = 0, 1, \dots , n)
\]
\[
\phi^ {(n + 1)} (\theta) = \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) ^ {n + 1} f (x _ {0} + \theta h, y _ {0} + \theta k). \tag {14}
\]
将 (13), (14) 两式代入 (12) 式,就得到所求之泰勒公式 (11).
注 1 前面的中值公式 (8) 正是泰勒公式 (11) 在 n = 0 时的特殊情形.
注 2 若在 (11) 式中只要求 \(R_{n} = \pmb{o}(\rho^{n})\left(\rho = \sqrt{h^{2} + k^{2}}\right)\) ,则仅需 \(\pmb{f}\) 在 \(U(P_0)\) 内存在 \(\pmb{n}\) 阶的连续偏导数即可,此时的 \(n\) 阶泰勒公式可写作
\[
f \left(x _ {0} + h, y _ {0} + k\right) = \sum_ {p = 0} ^ {n} \frac {1}{p !} \left(h \frac {\partial}{\partial x} + k \frac {\partial}{\partial y}\right) ^ {p} f \left(x _ {0}, y _ {0}\right) + o \left(\rho^ {n}\right). \tag {15}
\]
例 (5)
求 \(f(x,y) = x^{y}\) 在点 (1,4) 的泰勒公式 (到二阶为止), 并用它计算 \(1.08^{3.96}\) .
解 由于 \(x_0 = 1, y_0 = 4, n = 2\) , 因此有
\[
\begin{array}{l} f (x, y) = x ^ {y}, \quad f (1, 4) = 1, \\ f _ {x} (x, y) = y x ^ {y - 1}, \quad f _ {x} (1, 4) = 4, \\ f _ {y} (x, y) = x ^ {y} \ln x, \quad f _ {y} (1, 4) = 0, \\ f _ {x ^ {2}} (x, y) = y (y - 1) x ^ {y - 2}, \quad f _ {x ^ {2}} (1, 4) = 1 2, \\ f _ {x y} (x, y) = x ^ {y - 1} + y x ^ {y - 1} \ln x, \quad f _ {x y} (1, 4) = 1, \\ f _ {y ^ {2}} (x, y) = x ^ {y} (\ln x) ^ {2}, \quad f _ {y ^ {2}} (1, 4) = 0. \\ \end{array}
\]
将它们代入泰勒公式 (15), 即有
\[
x ^ {y} = 1 + 4 (x - 1) + 6 (x - 1) ^ {2} + (x - 1) (y - 4) + o \left(\rho^ {2}\right).
\]
若略去余项,并让 \(x = 1.08, y = 3.96\) ,则有
\[
\begin{array}{l} 1. 0 8 ^ {3. 9 6} \approx 1 + 4 \times 0. 0 8 + 6 \times 0. 0 8 ^ {2} - 0. 0 8 \times 0. 0 4 \\ = 1. 3 5 5 2. \\ \end{array}
\]
与 §1 例 7 的结果 (1.32) 相比较,这是更接近于精确值 (1.356307...) 的近似值.
事实上,§1 中的微分近似相当于现在的一阶泰勒公式.
极值问题
多元函数的极值问题是多元函数微分学的重要应用,这里仍以二元函数为例进行讨论.
定义
设 \(f(x,y)\) 在点 \(P_{0}(x_{0},y_{0})\) 的某邻域 \(U(P_{0})\) 内有定义.
若 \(\forall P(x,y)\in U(P_0)\) ,满足 \(f(P)\leq f(P_0)\) (或 \(f(P)\geq f(P_0))\) ,则称 \(f\) 在点 \(P_0\) 取得极大 (或极小)值,点 \(P_0\) 称为 \(f\) 的极大(或极小)值点.
极大值、极小值统称极值;极大值点、极小值点统称极值点.
注意 这里讨论的极值点只限于定义域的内点.
例 (6)
设 \(f(x,y)=2x^{2}+y^{2}, g(x,y)=\sqrt{1-x^{2}-y^{2}}, h(x,y)=xy.\)
由定义知道,原点 \((0,0)\) 是 f 的极小值点,是 g 的极大值点,但不是 h 的极值点.
这是因为 \(\forall(x,y)\) ,恒有 \(f(x,y)\geq f(0,0)=0;\)
又 \(\forall(x,y)\in\{(x,y)\mid x^{2}+y^{2}\leq1\}\) ,恒有 \(g(x,y)\leq g(0,0)=1;\)
对于 \(h\) , 在原点的任意小邻域内,既含有使 \(h(x,y) > 0\) 的点 (I、III 象限), 又含有使 \(h(x,y) < 0\) 的点 (II、IV 象限), 所以 \(h(0,0) = 0\) 既不是极大值也不是极小值.
\((z = xy \text{ 的图像是一马鞍面, } (0, 0, 0) \text{ 为其鞍点}).\)
注 由定义可见,若 \(f\) 在点 \((x_0, y_0)\) 取极值,则当固定 \(y = y_0\) 时,一元函数 \(f(x, y_0)\) 必定在 \(x = x_0\) 取相同极值;\(f(x_0, y)\) 必定在 \(y = y_0\) 也取相同极值.
于是,得到二元函数取极值的必要条件如下:
定理(定理 17.10(极值的必要条件))
若函数 \(f(x,y)\) 在点 \(P_0(x_0,y_0)\) 存在偏导数,且在 \(P_0\) 取得极值,则必有
\[
f _ {x} \left(x _ {0}, y _ {0}\right) = f _ {y} \left(x _ {0}, y _ {0}\right) = 0. \tag {16}
\]
若函数 f 在点 \(P_{0}\) 满足 (16),则称点 \(P_{0}\) 为 f 的稳定点.
上述定理指出:偏导数存在时,极值点必是稳定点.
但要注意:稳定点并不都是极值点.
在例 6 中之所以只讨论原点,就是因为原点是那三个函数的惟一稳定点;而对于函数 \(h\) ,原点虽为其稳定点,但却不是它的极值点.
与一元函数的情形相同,多元函数在偏导数不存在的点处也可能取得极值.
例如 \(f(x, y) = \sqrt{x^2 + y^2}\) 在原点没有偏导数,但 \(f(0, 0) = 0\) 显然是它的极小值.
为了讨论二元函数 \(f\) 在点 \(P_0(x_0, y_0)\) 取得极值的充分条件,我们假定 \(f\) 具有二阶连续偏导数,并记
\[
H _ {f} \left(P _ {0}\right) = \left[ \begin{array}{l l} f _ {x x} \left(P _ {0}\right) & f _ {x y} \left(P _ {0}\right) \\ f _ {y x} \left(P _ {0}\right) & f _ {y y} \left(P _ {0}\right) \end{array} \right] = \left[ \begin{array}{l l} f _ {x x} & f _ {x y} \\ f _ {y x} & f _ {y y} \end{array} \right] _ {P _ {0}}, \tag {17}
\]
称为 f 在点 \(P_{0}\) 的黑赛 (Hesse) 矩阵.
设 \(f(x,y)\) 在点 \(P_0\) 的某领域 \(U(P_0)\) 内具有二阶连续偏导数,且 \(P_0\) 为 \(f\) 的稳定点,则有如下结论:
\[
\left. \begin{array}{l} H _ {f} \left(P _ {0}\right) \text {为正定矩阵} \Rightarrow f \left(P _ {0}\right) \text {为极小值}, \\ H _ {f} \left(P _ {0}\right) \text {为负定矩阵} \Rightarrow f \left(P _ {0}\right) \text {为极大值}, \\ H _ {f} \left(P _ {0}\right) \text {为不定矩阵} \Rightarrow f \left(P _ {0}\right) \text {不是极值}. \end{array} \right\} \tag {18}
\]
证 由 f 在 \(P_{0}\) 的二阶泰勒公式,并注意到条件,于是有
\[
\begin{array}{l} f (x, y) - f \left(x _ {0}, y _ {0}\right) = f \left(x _ {0} + \Delta x, y _ {0} + \Delta y\right) - f \left(x _ {0}, y _ {0}\right) \\ = \frac {1}{2} (\Delta x, \Delta y) H _ {f} (P _ {0}) (\Delta x, \Delta y) ^ {\mathrm{T}} + o (\Delta x ^ {2} + \Delta y ^ {2}). \\ \end{array}
\]
首先证明:当 \(H_{f}(P_{0})\) 正定时, \(f\) 在点 \(P_{0}\) 取得极小值。这是因为,此时对任何 \((\Delta x, \Delta y) \neq (0, 0)\) ,恒使二次型
\[
Q (\Delta x, \Delta y) = (\Delta x, \Delta y) H _ {f} (P _ {0}) (\Delta x, \Delta y) ^ {\mathrm{T}} > 0.
\]
现考察关于 \((u,v) = \left(\frac{\Delta x}{\sqrt{\Delta x^2 + \Delta y^2}},\frac{\Delta y}{\sqrt{\Delta x^2 + \Delta y^2}}\right)\) 的连续函数(仍为一正定二次型)
\[
Q (u, v) = \frac {Q (\Delta x , \Delta y)}{\Delta x ^ {2} + \Delta y ^ {2}} = (u, v) H _ {f} \left(P _ {0}\right) (u, v) ^ {\mathrm{T}}
\]
由于 \((u,v)\) 恒满足 \(u^{2}+v^{2}=1\) ,因此 \(Q(u,v)\) 在此有界闭域上存在最小值 2q>0,于是有
\[
Q (\Delta x, \Delta y) \geq 2 q \left(\Delta x ^ {2} + \Delta y ^ {2}\right)
\]
从而只要 \(U(P_{0})\) 充分小,且 \((x,y)\in U(P_{0})\) , 就有
\[
\begin{array}{l} f (x, y) - f \left(x _ {0}, y _ {0}\right) \geq q \left(\Delta x ^ {2} + \Delta y ^ {2}\right) + o \left(\Delta x ^ {2} + \Delta y ^ {2}\right) \\ = \left(\Delta x ^ {2} + \Delta y ^ {2}\right) \cdot [ q + o (1) ] \geq 0, \\ \end{array}
\]
即 f 在点 \((x_{0}, y_{0})\) 取得极小值.
同理可证:当 \(H_{f}(P_{0})\) 负定时,f 在点 \((x_{0}, y_{0})\) 取得极大值.
最后证明:当 \(H_{f}(P_{0})\) 为不定矩阵时,f 在点 \(P_{0}\) 不取极值.
这是因为,倘若 f 取极值(设为极大值),则沿着过 \(p_{0}\) 的任何直线
\[
x = x _ {0} + t \Delta x, y = y _ {0} + t \Delta y,
\]
\(f(x,y) = f(x_0 + t\Delta x,y_0 + t\Delta y) = \varphi (t)\) 在 \(t = 0\) 亦取极大值.
由一元函数取极值的充分条件,\(\varphi''(0) > 0\) 是不可能的 (否则 \(\varphi\) 在 \(t = 0\) 将取极小值), 故只能 \(\varphi''(0) \leq 0\) . 而
\[
\begin{array}{l} \varphi^ {\prime} (t) = f _ {x} \Delta x + f _ {y} \Delta y, \\ \varphi^ {\prime \prime} (t) = f _ {x x} \Delta x ^ {2} + 2 f _ {x y} \Delta x \Delta y + f _ {y y} \Delta y ^ {2}, \\ \varphi^ {\prime \prime} (0) = (\Delta x, \Delta y) H _ {f} \left(P _ {0}\right) (\Delta x, \Delta y) ^ {\mathrm{T}}, \\ \end{array}
\]
这表明 \(H_{f}(P_{0})\) 必须是负半定的.同理,倘若 \(f\) 取极小值,则将导致 \(H_{f}(P_{0})\) 必须是正半定的.也就是说,当 \(f\) 在 \(P_{0}\) 取得极值时, \(H_{f}(P_{0})\) 必须是正半定的或负半定的,这与假设相矛盾.
根据对称矩阵的定号性与其主子行列式之间的关系,定理 17.11 又可写成如下比较实用的形式:若 \(f\) 如定理 17.11 所设,则有如下结论:
例 (7)
求 \(f(x,y) = x^{2} + 5y^{2} - 6x + 10y + 6\) 的极值.
解 由方程组
\[
\left\{ \begin{array}{l} f _ {x} = 2 x - 6 = 0, \\ f _ {y} = 1 0 y + 1 0 = 0 \end{array} \right.
\]
解出 f 的稳定点 \(P_{0}(3,-1)\) . 因为
\[
f _ {x x} \left(P _ {0}\right) = 2 > 0, f _ {x y} \left(P _ {0}\right) = 0,
\]
\[
f _ {y y} \left(P _ {0}\right) = 1 0, \quad \left(f _ {x x} f _ {y y} - f _ {x y} ^ {2}\right) \left(P _ {0}\right) = 2 0 > 0,
\]
因此 \(f\) 在 \(P_0\) 取得极小值 \(f(3, -1) = -8\) . 又因 \(f\) 处处存在偏导数,故 \(P_0(3, -1)\) 为 \(f\) 的惟一极值点.
例 (8)
讨论 \(f(x,y)=x^{2}+xy\) 是否存在极值.
解 由 \(f_x = 2x + y = 0, f_y = x = 0\) 得原点为稳定点。因 \(\left(f_{xx}f_{yy} - f_{xy}^2\right)|_{(0,0)} = -1 < 0\) , 故原点不是 \(f\) 的极值点。又因 \(f\) 处处可微,所以 \(f\) 没有极值点.
例 (9)
讨论 \(f(x,y)=(y-x^{2})(y-2x^{2})\) 在原点是否取得极值?
解 容易验证原点是 f 的稳定点,且
\[
\left. \left(f _ {x x} f _ {y y} - f _ {x y} ^ {2}\right) \right| _ {(0, 0)} = 0,
\]
故用定理 17.11 无法判断 f 在原点是否取得极值.
但因为在原点的任意小邻域内,当 \(x^{2} < y < 2x^{2}\) 时 \(f(x,y) < 0\) , 而当 \(y > 2x^{2}\) 或 \(y < x^{2}\) 时 \(f(x,y) > 0\) , 所以 \(f(0,0) = 0\) 不是极值 (参见图 17-7).

想求出函数在有界闭域上的最大值和最小值,方法与一元函数问题一样:需先求出在该区域上所有稳定点、无偏导数点处的函数值,还有在区域边界上的这类特殊值;然后比较这些值,其中最大 (小) 者即为问题所求的最大 (小) 值.
例 (10)
证明:圆的所有外切三角形中,以正三角形的面积为最小.
证 设圆的半径为 \(a\) ,任一外切三角形为 \(ABC\) ,三切点处半径相夹的中心角分别为 \(\alpha, \beta, \gamma\) ,其中 \(\gamma = 2\pi - \alpha - \beta\) 。易知 \(\triangle ABC\) 的面积表达式为
\[
\begin{array}{l} S = a ^ {2} \left(\tan {\frac {\alpha}{2}} + \tan {\frac {\beta}{2}} + \tan {\frac {\gamma}{2}}\right) \\ = a ^ {2} \left(\tan {\frac {\alpha}{2}} + \tan {\frac {\beta}{2}} - \tan {\frac {a + \beta}{2}}\right) \\ \end{array}
\]

其中 \(\alpha, \beta \in (\mathbf{0}, \pi)\) . 为求得稳定点,令
\[
\begin{array}{l} \boldsymbol {S} _ {\alpha} = \frac {1}{2} a ^ {2} \left(\sec^ {2} \frac {\alpha}{2} - \sec^ {2} \frac {\alpha + \beta}{2}\right) = 0, \\ \pmb {S} _ {\beta} = \frac {1}{2} \pmb {a} ^ {2} \left(\sec^ {2} \frac {\beta}{2} - \sec^ {2} \frac {\alpha + \beta}{2}\right) = 0. \\ \end{array}
\]
在定义域内,上述方程组仅有惟一解:
\[
\alpha = \beta = \frac {2 \pi}{3}, \gamma = 2 \pi - (\alpha + \beta) = \frac {2 \pi}{3}.
\]
为了应用定理 17.11, 求出在点 \((\alpha, \beta) = \left(\frac{2\pi}{3}, \frac{2\pi}{3}\right)\) 处的二阶偏导数:
\[
S _ {\alpha \alpha} = 4 \sqrt {3} a ^ {2}, S _ {\alpha \beta} = 2 \sqrt {3} a ^ {2}, S _ {\beta \beta} = 4 \sqrt {3} a ^ {2}.
\]
由于 \(S_{\alpha \alpha} > 0, S_{\alpha \alpha} \cdot S_{\beta \beta} - S_{\alpha \beta}^2 = 36a^4 > 0\) ,因此 \(S\) 在此稳定点处取得极小值。因为 \(\alpha = \beta = \gamma\) ,面积函数 \(S\) 在定义域中处处存在偏导数,而具体问题存在最小值,故外切三角形中以正三角形的面积为最小。
例 (*11)
求函数 \(f(x,y)=x^{3}+2x^{2}-2xy+y^{2}\) 的极值和在 \(D=[-2,2]\times[-2,2]\) 上的最大、最小值.
解 (i) 求稳定点:解方程组
\[
\left\{ \begin{array}{l l} f _ {x} (x, y) = 3 x ^ {2} + 4 x - 2 y = 0, \\ f _ {y} (x, y) = - 2 x + 2 y = 0, \end{array} \right.
\]
得稳定
点 \((0,0)\) 和 \((-2/3,-2/3)\) .
(ii) 求极值:由于 \(f(x,y)\) 的黑赛矩阵为
\[
H _ {f} (x, y) = \left[ \begin{array}{c c} 6 x + 4 & - 2 \\ - 2 & 2 \end{array} \right]
\]
并有 \(H_{f}(\mathbf{0},0)=\left[\begin{array}{cc}4 & -2 \\ -2 & 2\end{array}\right]\) (正定),

\[
H _ {f} (- 2 / 3, - 2 / 3) = \left[ \begin{array}{c c} {{0}} & {{- 2}} \\ {{- 2}} & {{2}} \end{array} \right] \quad (\text {不定}),
\]
因此 \(f(0,0)=0\) 为极小值, \(f(-2/3,-2/3)\) 不是极值.
(iii) 求在 \(\partial D\) 上的特殊值:
当 x = -2 时,
\[
f (- 2, y) = 4 y + y ^ {2}, y \in [ - 2, 2 ],
\]
单调增,算出两端值 \(f(-2,-2)=-4\) , \(f(-2,2)=12\) ;
当 x = 2 时,
\[
f (2, y) = 1 6 - 4 y + y ^ {2}, y \in [ - 2, 2 ],
\]
单调减,算出两端值 \(f(2,2)=12\) , \(f(2,-2)=28\) ;
当 \(y = -2\) 时,
\[
f (x, - 2) = x ^ {3} + 2 x ^ {2} + 4 x + 4, x \in [ - 2, 2 ],
\]
由
\[
\frac {d}{d x} f (x, - 2) = 3 x ^ {2} + 4 x + 4 = 3 \left(x + \frac {2}{3}\right) ^ {2} + \frac {8}{3} > 0,
\]
单调增,算出两端值 \(f(-2,-2)=-4\) , \(f(2,-2)=28\) ;
当 y = 2 时,
\[
f (x, 2) = x ^ {3} + 2 x ^ {2} - 4 x + 4, x \in [ - 2, 2 ]
\]
由 \(\frac{d}{dx} f(x,2) = 3x^2 +4x - 4 = 0\) ,得 \(x_{1} = \frac{2}{3},x_{2} = -2\) ,算出 \(f\left(\frac{2}{3},2\right) = \frac{68}{27}\) 与两端值 \(f(\pm 2,2) = 12.\)
(iv) 求在 \(D\) 上的最大、小值:将 (iii) 中五个特殊值与 \(f(0,0) = 0\) , \(f(-2/3, -2/3) = 4/27\) 相比较,便得
\[
\begin{array}{l} \max _ {(x, y) \in D} f (x, y) = f (2, - 2) = 2 8, \\ \min _ {(x, y) \in D} f (x, y) = f (- 2, - 2) = - 4. \\ \end{array}
\]
注 本例中的 \(f\) 在 \(D\) 上虽然只有惟一极值,且为极小值,但它并不因此成为 \(f\) 在 \(D\) 上的最小值。这一点与一元函数是不相同的,务请读者注意!
这在曲面 \(z = x^{3} + 2x^{2} - 2xy + y^{2}\) 的图形中清晰地反映出来.
例 12 (最小二乘法问题) 设通过观察或实验,得到一列点 \((x_{i}, y_{i}), i = 1, 2, \cdots, n.\) 它们大体上在一条直线上,即大体上可用直线方程来反映变量 x 与 y 之间的对应关系 (参见图 17-10).
现要确定一直线,使得与这 n 个点的偏差平方之和为最小 (最小二乘方).
解如图设所求直线方程为
\[
y = a x + b,
\]
所测得的 n 个点为 \((x_{i}, y_{i}), i = 1, 2, \cdots, n.\) 现要确定 a, b, 使得 \(f(a, b) = \sum_{i=1}^{n} (ax_{i} + b - y_{i})^{2}\) 为最小。为此令

把这组关于 a, b 的线性方程加以整理并求解,得
\[
\left\{ \begin{array}{l l} a \sum_ {i = 1} ^ {n} x _ {i} ^ {2} + b \sum_ {i = 1} ^ {n} x _ {i} = \sum_ {i = 1} ^ {n} x _ {i} y _ {i} \\ a \sum_ {i = 1} ^ {n} x _ {i} + b n = \sum_ {i = 1} ^ {n} y _ {i}; \end{array} \right.
\]
\[
\bar {a} = \frac {n \sum_ {i = 1} ^ {n} x _ {i} y _ {i} - (\sum_ {i = 1} ^ {n} x _ {i}) \cdot (\sum_ {i = 1} ^ {n} y _ {i})}{n \sum_ {i = 1} ^ {n} x _ {i} ^ {2} - (\sum_ {i = 1} ^ {n} x _ {i}) ^ {2}}
\]
\[
\bar {b} = \frac {\left(\sum_ {i = 1} ^ {n} x _ {i} ^ {2}\right) \cdot \left(\sum_ {i = 1} ^ {n} y _ {i}\right) - \left(\sum_ {i = 1} ^ {n} x _ {i} y _ {i}\right) \cdot \left(\sum_ {i = 1} ^ {n} x _ {i}\right)}{n \sum_ {i = 1} ^ {n} x _ {i} ^ {2} - \left(\sum_ {i = 1} ^ {n} x _ {i}\right) ^ {2}}
\]
为进一步确定稳定点 \((\bar{a}, \bar{b})\) 是极小值点,需要计算
\[
A = f _ {a a} = 2 \sum_ {i = 1} ^ {n} x _ {i} ^ {2} > 0, \quad B = f _ {a b} = 2 \sum_ {i = 1} ^ {n} x _ {i}, C = f _ {b b} = 2 n
\]
\[
D = A C - B ^ {2} = 4 n \sum_ {i = 1} ^ {n} x _ {i} ^ {2} - 4 \left(\sum_ {i = 1} ^ {n} x _ {i}\right) ^ {2} > 0.
\]
从而根据定理 17.11, \(f(a,b)\) 在点 \((\bar{a},\bar{b})\) 取得极小值,并由实际意义,这极小值即为最小值.
作业
P. 145
数学分析
2025-2026 (2)
18a
沈超敏
计算机科学与技术学院