-1

My question is very simple, I'm a beginner student in real analysis in several variables and I don't understand the meaning of the $\cdot$ in the expression: $(g\circ f)'(a)=g'(f(a))\cdot f'(a):\mathbb R^m\to\mathbb R^p$. Does it mean a composition? Why doesn't the author denote this using the classical $\circ$ symbol?

Chain Rule: Let $U\in \mathbb R^m, V\in \mathbb R^n$ be open sets and $f:U\to\mathbb R^n$ differentiable at the point $a$, with $f(U)\subset V$ and $g:V\to \mathbb R^p$ differentiable at the point $f(a)$. Then $g\circ f:U\to \mathbb R^p$ is differentiable at the point $a$ with $(g\circ f)'(a)=g'(f(a))\cdot f'(a):\mathbb R^m\to\mathbb R^p$

MJD
  • 67,568
  • 43
  • 308
  • 617
user42912
  • 24,130

1 Answers1

5

The $\cdot$ actually stands for matrix multiplication. Recall that $f'(a)$ is an $m \times n$ matrix for any $a$.


Recall that $f'(a)$ is actually a linear map from $\Bbb R^n$ to $\Bbb R^m$. That is, if $\mathcal L(\Bbb R^n,\Bbb R^m)$ denotes the space of such maps, then $$ f': \Bbb R^n \to \mathcal L(\Bbb R^n,\Bbb R^m) $$ Consequently, $f'(a)$ actually has a suppressed argument: if $h \in \Bbb R^n$, then $[f'(a)](h) \in \Bbb R^m$.

With that, we may write the chain rule properly as follows: $$ [(g \circ f)'(a)](h) = [g'(f(a))]([f'(a)](h)) $$ In other words, there are two levels of composition here. One is denoted by $\circ$, the other is denoted by $\cdot$. For linear maps, the tendency is to think of composition as "matrix multiplication", hence the answers given.

Ben Grossmann
  • 234,171
  • 12
  • 184
  • 355