Tuesday, March 20, 2018

calculus - The notation for partial derivatives



Today, in my lesson, I was introduced to partial derivatives. One of the things that confuses me is the notation. I hope that I am wrong and hope the community can contribute to my learning. In single-variable calculus, we know that, given a function $y =f(x)$, the derivative of $y$ is denoted as $\frac {dy}{dx}$. I understand this as the relative change in $y$, $\delta y$ given a small change in $x$, $\delta x$.



However, in today's lesson on partial derivative, my professor constantly used this notation.




Given a function $z = f(x,y)$, the first derivative with respected to $x$ is written as



$$ \frac{\partial z}{\partial x} $$



So, for example



$$
z = 5x+3y\\
\frac{\partial z}{\partial x} = 5
$$




Why can't I just write it as
$$
z = 5x+3y\\
\frac{d z}{d x} = 5
$$



Is it some convention or am I not understanding something in the notation?


Answer



First, rest assured that you're not the only one who's confused by the standard notation for partial derivatives. See this answer for a collection of answers I've written in response to such confusions.




The problem is that the standard notation doesn't indicate which variables are being held constant. It assumes that you've defined a function of a certain set of variables, and that everyone remembers what these are. That's fine if you only introduce a single function and write its partial derivatives as



$$
\frac{\partial f(x,y,z)}{\partial x}
$$



and the like, since the arguments for the function evaluation make up for what the notation for the partial derivative is missing, but it becomes a problem when you start writing things like



$$

\frac{\partial f}{\partial x}
$$



and especially when you have lots of things like $x,y,z$ floating around that all look like variables and the notation doesn't contain the slightest clue which of these are being treated as functions and which as independent variables being held constant.



In a certain sense, you're right that you could always regard $\dfrac{\partial f}{\partial x}$ as $\dfrac{\mathrm df}{\mathrm dx}$ of a univariate function, namely by regarding all other variables as parameters. That is, given a function $f(x,y)$ of two variables, you can regard $y$ as a fixed parameter and write $g(x)=f(x,y)$, and then $\dfrac{\mathrm dg}{\mathrm dx}=\dfrac{\partial f}{\partial x}$. Then if you feel things aren't quite confusing enough already, you can instead call this new univariate function by the same name as the multivariate function $f$ and write $\vphantom{\dfrac{\partial f}{\partial x}}f(x)=f(x,y)$, and then indeed $\dfrac{\mathrm df}{\mathrm dx}=\dfrac{\partial f}{\partial x}$, but you need to remember what you mean by that: $\dfrac{\mathrm df(x)}{\mathrm dx}=\dfrac{\partial f(x,y)}{\partial x}$, with two different uses of the symbol $f$.



However, this view is rarely very helpful, since the variables of a multivariate function are usually variables on an equal footing for good reason, and one would usually have introduced them as fixed parameters in the first place if that were the natural way to think of them. So usually it's better to view the univariate function that you get by keeping all but one variable fixed as a more temporary construct that's used only for defining and thinking about the partial derivative, but not as something that should appear in the notation as a univariate function in its own right.


No comments:

Post a Comment

analysis - Injection, making bijection

I have injection $f \colon A \rightarrow B$ and I want to get bijection. Can I just resting codomain to $f(A)$? I know that every function i...