What is the affine connection, and what is the intuition behind/for affine connection?

Question

Here is the definition of affine connection, as appears in Milnor's book Morse Theory.

DEFINITION. An affine connection at a point $p \in \text{M}$ is a function which assigns to each tangent vector $\text{X}_p \in \text{TM}_p$ and to each vector field $\text{Y}$ a new tangent vector$$\text{X}_p \vdash \text{Y} \in \text{TM}_p$$called the covariant derivative of $\text{Y}$ in the direction $\text{X}_p$.

(Note that our $\text{X} \vdash \text{Y}$ coincides with Nomizu's $\nabla_\text{X} \text{Y}$. The notation is intended to suggest that the differential operator $\text{X}$ acts on the vector field $\text{Y}$.)

This is required to be bilinear as a function of $\text{X}_p$ and $\text{Y}$. Furthermore, if$$f: \text{M} \to \mathbb{R}$$is a real valued function, and if $f\text{Y}$ denotes the vector field$$(f\text{Y})_q = f(q)\text{Y}_q$$then $\vdash$ is required to satisfy the identity$$\text{X}_p \vdash (f\text{Y}) = (\text{X}_p f)\text{Y}_p + f(p) \text{X}_p \vdash \text{Y}.$$

(As usual, $\text{X}_p$ denotes the directional derivative of $f$ in the direction of $\text{X}_p$.)

I have two questions.

This definition of affine connection is quite terse here—I'm just seeing text on a page and not really understanding what is going on here. Is it possible somebody could help me parse through/explain what is really being said here with regards to affine connection?
Could somebody supply their intuitions behind/for affine connections?

Thanks.

the affine connection gives you a relationsship between tangent spaces at different (close) points on a manifold $M$. this is obviously important to define derivaties of vector(fields) on $M$ which is for sure something useful. It is defined such that usual rules from differential calculus like the Leibnitzrule can be fulfiled. — tired, Feb 15 '17 at 07:34
You'd better consult with another textbook in riemannian geometry. There are several points of view on (affine) connections, like Ehresmann connections or differential operators, but I believe the most elementary one is just an (set of) operator defined on the vector space of tangent bundles which follows the rule of "derivatives"(in usual way), as given in the last identity, and you can think of it as a designation of a partial derivative on a manifold. BTW, Milnor's notation for connections is largely obsolete one; the one with nable is more common. — cjackal, Feb 15 '17 at 07:34
furthermore check out the link between paralell transport and connections, this should strengthen your intuition a lot :) — tired, Feb 15 '17 at 07:36

score 54 · Answer 1 · edited Nov 19 '18 at 11:57

There is a lot to be said on the subject, but the least technical point of view (in my opinion) is the following:

Consider first the situation in $\mathbb{R}^n$. Let $X,Y \colon \mathbb{R}^n \rightarrow \mathbb{R}^n$ be vector fields. To define the directional derivative of the vector field $X$ in the direction of the vector field $Y$ at a point $p \in \mathbb{R}^n$, we can mimic usual definition of directional derivative:

$$ (\nabla_Y X)(p) := \lim_{t \to 0} \frac{X(p + tY(p)) - X(p)}{t}. $$

The result $(\nabla_Y X)$ is a vector field on $\mathbb{R}^n$. You can check that the operation $\nabla$ defined as above satisfies the following two properties:

$\nabla_{fY}(X) = f\nabla_Y X$.
$\nabla_Y(fX) = (Yf)X + f\nabla_YX$.

Here, $X,Y \colon \mathbb{R}^n \rightarrow \mathbb{R}^n$ are vector fields and $f \colon \mathbb{R}^n \rightarrow \mathbb{R}$ is a scalar function. The function $Yf$ (at a point $p$) is the directional derivative of $f$ at $p$ in the direction $Y(p)$.

Now let us try and mimic the above construction on a general manifold. Given vector fields $X,Y \in \mathfrak{X}(M)$, we try to use the same formula and define

$$ (\nabla_Y X)(p) := \lim_{t \to 0} \frac{X(p + tY(p)) - X(p)}{t}. $$

However, we see that there are two problems. First, the expression $X(p + tY(p))$ is not defined because we don't have a way of adding a point $p \in M$ to a tangent vector $tY(p) \in T_pM$. This is not so bad because we can actually replace the expression $p + tY(p)$ with any curve "which goes in the direction $Y(p)$" such as the flow $\varphi_t^Y(p)$. The more serious problem is that we need to subtract the tangent vector $X(p) \in T_pM$ from the tangent vector $X(\varphi_t^Y(p)) \in T_{\varphi_t^Y(p)}$ and those are two tangent vectors that belong to different vector spaces. In general, without any extra data, we have no way of identifying tangent spaces at different points of $M$.

To summarize, we see that we can differentiate vector fields along vector fields without any problem on $\mathbb{R}^n$ but we encounter problems when we try and do it on a general manifold. But $\mathbb{R}^n$ is also a manifold so what makes it special? The fact that it is not only a manifold but a vector space and an affine space and so we can add points to vectors and identify tangent spaces at different points using translations. This is something we don't have on a general manifold.

The definition of an affine connection is meant to supply the manifold $M$ "externally" with an operation $\nabla \colon \mathfrak{X}(M) \times \mathfrak{X}(M) \rightarrow \mathfrak{X}(M)$ which satisfies properties $(1)-(2)$ and so allows us to differentiate vector fields along vector fields. That is, instead of defining the directional derivative of a vector field along a vector field, we require that somebody handles us a mechanism $\nabla$ which satisfies the properties that the familiar derivative satisfied on $\mathbb{R}^n$ and then we will think of it as a directional derivative.

Obviously this raises quite a lot of questions. Does such mechanism always exists? (Yes). Is it unique? (No). Is there a natural choice of such differentiation mechanism? (Yes, under certain circumstances). Can we use this mechanism to recover the ability to identify tangent vectors at different points that was necessary to define the regular directional derivative in $\mathbb{R}^n$? (Yes, at least along curves. This leads to the notion of parallel transport). I refer you to the extensive article on the covariant derivative (which is pretty much another name for an affine connection) on wikipedia for further details.

If there is a book or lecture notes on differential geometry with a similar ("bottom-up", from very simple and known concepts like a calculus or differential geometry in $\mathbb{R}^n$ to more advanced concepts in less trivial and known settings) approach then you recommendation for such book would be highly appreciated! :) — Evgeny, Feb 15 '17 at 09:53
I'm sorry, but I can't recommend anything off the top of my head which is entirely bottom-up. IMO, the best books for a beginner in differential geometry or Riemannian geometry are the books by Lee (Introduction to Smooth Manifolds, Introduction to Curvature). He is not sparse on details, tries to motivate everything, gives precise definitions and obviously spent a lot of thought on how to organize the material, which notation to use and how to treat corner cases. If you find them too advanced, you can start with "Curves and Surfaces" by Montiel and Ros — levap, Feb 15 '17 at 10:22
which covers differential geometry in $\mathbb{R}^n$ but prepares you well to do abstract differential geometry. — levap, Feb 15 '17 at 10:22
is there by any chance a $\nabla$ missing in your second equation? If not, what kind of object is $(Y f)X$? — glS, Dec 03 '19 at 13:38
@glS: The expression $Yf$ is a function which is the result of differentiating $f$ in the direction of the vector field $Y$ at each point. This is indeed sometimes denoted by $\nabla_Y f$ so that (2) looks like a "product rule". Another common notation for $Yf$ is $df(Y)$. I used Milnor's notation. The expression $(Yf)X$ is a function times a vector field so it is a vector field. — levap, Dec 03 '19 at 15:06

score 5 · Answer 2 · answered Feb 17 '17 at 08:05

Intuition come from mechanics as usual in differential geometry. Assume you are in a car moving with a law $P(t)$. in your car, there is a compass which gives you the magnetic vector field say $ \vec M$, note that this vector field is globally defined on the earth, but what you see is $ \vec M _{P(t)}$. Now, in your car you see the direction of the compass changing at every time, and you can compute ${d\over dt} \vec M _{P(t)}$. It appears that this vector only depends on the speed ${\vec V}= {d\over dt} P(t)$ you have at the instant $t$. It is written either ${D\over dt} \vec M _{P(t)}$ or $ \nabla _{\vec V}{\vec M}$. In order to prove this you can compute in coordinates, and check that this derivative is nothing else but the orthogonal projection of the usual derivative on the tangent plane. Doing this carefully you will "rediscover" Christoffel symbols, and find all properties of the affine connexion, which is nothing else but the operator which enable you to compute the derivate, called the "covariant" derivative.

What is the affine connection, and what is the intuition behind/for affine connection?

2 Answers2

Linked

Related