9

I'm learning the Hamilton formalism of classical mechanics, where a second order differential equation is formalized as two first order differential equations on the cotangent bundle of the configuration manifold. I find the concept of tangent spaces and the notion of the derivative $f_*: TM \to TN$ as a function between tangent spaces very elegant, natural and intuitive. I still struggle, though, with an intuitive understanding of tangent spaces of tangent spaces.

Let the $n$ dimensional configuration space $M$ be a smooth manifold, $\pmb{q} \in M$, then $TM$ is the tangent bundle and $\pmb{v} \in TM$ a tangent vector. Even without local coordinates, every tangent vector can canonically be split into a point $q$ and a vector $\dot q \in T_qM$. Therefore $\pmb v = (q, \dot q)$.

The intuitive notion of a tangent vector is the notion of a change of position or a velocity (thus the notation) starting at a point. Now lets look at the tangent space of the tangent space $TTM$. Let $\pmb a \in TTM$ be tangent vector to $TM$. The intuitive notion of $\pmb a$ is a change of velocity or acceleration. Just as we could do for $TM$, we can split $\pmb a$ into a "point" $(q, \dot q)$ in $TM$ and a vector in $T_{(q, \dot q)}TM$ given by $(\dot{q}, \ddot q)$, with $\dot{q}$ denoting a change of the fiber and $\ddot{q}$ denoting a change of the vector within the same fiber. Combining with the previous, $\pmb a \in TTM$ consists of $(q, \dot{q}, \dot{q}, \ddot{q})$. What you might disregard as a double occupancy in notation, is a real problem for my understanding. It seems like the information about the position change is duplicated, not even necessarily consistently.

  • Which roles do the vector component $\dot{q} \in T_{\pmb q}M$ and the fiber change component $\dot{q} \in TTM$ play generally in manifolds?
  • Which role do they play in the Hamilton formalism (if any different)?
  • How to construct a second derivative $f_{* *}: TTM \to TTN$? How do these components appear there?
  • How, if at all, does this relate to curvature and torsion of curves?
  • How, if at all, does the exterior derivative $dd=0$ or any other relevant derivative relate to this?
  • The Hamilton equations of motion are $\dot{\pmb q} = \frac{\partial H}{\partial \pmb p}, \dot{\pmb p} = -\frac{\partial H}{\partial \pmb q}$ (with $H: T^*M\to\mathbb R$ and $(\pmb q, \pmb p) \in T^*M$). How does the notational double occupancy of $\dot{\pmb q}$ resolve here?
  • In this question I concentrated on $TTM$ while the Hamilton formalism is defined on $TT^*M$. Is there a fundamental difference between $TT^*M$ and $TTM$ that is relevant to the problem in question?
  • 2
    One thing to note is that a vector $\ddot{q}\in TTM$ cannot be split into horizontal and vertical components $\dot{q},\ddot{q}\in TTM$ as you write, at least not in a canonical way. Essentially, $T_{\bf{v}}TM$ has a canonical vertical subspace (parallel to the fibers of $TM$), but the choice of complementary horizontal subspace is not canonical. – Kajelad Jul 07 '20 at 06:51

2 Answers2

11

I'm not sure I'll be able to answer all your questions, but I'll have a go, but first some background to make sure we're on the same page. I think the most intuitive way to view tangent vectors is as an equivalence class of smooth curves in the appropriate manifold. So, an element of the tangent bundle $TM$ is given as an equivalence class $[s\mapsto \gamma(s)]$ of smooth curves $\gamma:\Bbb{R} \to M$ (strictly speaking, it is only defined in a small open interval containing $0$). Understanding this in a chart is also pretty simple I think.

Say we model $M$ on an $m$-dimensional vector space $E$ (typically we take it to be $\Bbb{R}^m$), and that $(U,\alpha)$ is a chart on $M$; then we can construct a chart $(TU,T\alpha)$ for the tangent bundle as \begin{align} T\alpha: TU &\to \alpha[U]\times E\\\\ T\alpha([\gamma]) &:= \left((\alpha\circ \gamma)(0), (\alpha \circ \gamma)'(0)\right) \end{align} In words: we have an equivalence class of curves $[\gamma]$ in the manifold $M$. So, $\alpha \circ \gamma$ is a curve in the vector space $E$. What we do is map $[\gamma]$ to its chart-representative base point and the chart-representative velocity vector.


Which roles do the vector component $\dot{q} \in T_{\pmb q}M$ and the fiber change component $\dot{q} \in TTM$ play generally in manifolds?

Since $T^2M$ is defined as $T(TM)$, it means all we have to do is carefully reapply the definitions again. Now, an element of the second tangent bundle is by definition an equivalence class $[t\mapsto \Gamma(t)]$ of smooth curves into the tangent bundle $\Gamma:\Bbb{R}\to TM$. So, $\Gamma(t) \in TM$, which means it is itself an equivalence class of curves in $M$, say $\Gamma(t) = [s\mapsto \gamma(s,t)]$. So, we have \begin{align} [t\mapsto \Gamma(t)] &= \left[t\mapsto [s\mapsto \gamma(s,t)]\right] \in TTM \end{align} This is what a general element of the second tangent bundle looks like. Once again, we can consider the associated chart $(T^2U, T^2\alpha)$ on the second tangent bundle, and considering how things look in a chart should help to clarify the different roles played by the two $\dot{q}$ you wrote in your post (which tbh I think is an abuse of notation). Ok, so let's calculate: \begin{align} T^2\alpha : T^2U \to (\alpha[U] \times E) \times (E\times E) \end{align} is given by \begin{align} T^2\alpha([\Gamma]) &:= \left((T\alpha \circ \Gamma)(0), (T\alpha \circ \Gamma)'(0)\right) \tag{$*$} \end{align} To proceed further, we should understand what $(T\alpha \circ \Gamma)(t)$ looks like so that we can evaluate at $t$ and evaluate its derivative at $t=0$. By definition, \begin{align} (T\alpha \circ \Gamma)(t) &= T\alpha([s\mapsto \gamma(s,t)])\\ &:= \left((\alpha\circ \gamma)(0,t), \dfrac{d}{ds}\bigg|_{s=0}(\alpha\circ \gamma)(s,t)\right) \tag{$**$} \end{align} So, by plugging $(**)$ into $(*)$, we find that (removing a few brackets) \begin{align} T^2\alpha([\Gamma]) &=\left((\alpha\circ \gamma)(0,0), \dfrac{d}{ds}\bigg|_{s=0}(\alpha\circ \gamma)(s,0),\dfrac{d}{dt}\bigg|_{t=0}(\alpha\circ \gamma)(0,t),\dfrac{d}{dt}\bigg|_{t=0}\dfrac{d}{ds}\bigg|_{s=0}(\alpha\circ \gamma)(s,t)\right) \\\\ &= \left((\alpha\circ \gamma)(0,0), \dfrac{\partial (\alpha\circ \gamma)}{\partial s}\bigg|_{(0,0)}, \dfrac{\partial (\alpha\circ \gamma)}{\partial t}\bigg|_{(0,0)}, \dfrac{\partial^2 (\alpha\circ \gamma)}{\partial t\partial s}\bigg|_{(0,0)}\right) \tag{$\ddot{\smile}$} \end{align}

For the sake of visualization, I like to imagine a doubly parametrized curve $(s,t)\mapsto \gamma(s,t)$ from $\Bbb{R}^2 \to M$, so that its image forms a sort of "2-D grid" on the manifold $M$ (for example, think of $M = \Bbb{R}^2$ and $\gamma(s,t) = (s,t)$; then the image of $\gamma$ is exactly the "coordinate grids" on the plane). So you see although there are two first derivatives appearing above, they are taken with respect to the different variables $s$ and $t$; i.e we're differentiating the doubly-parametrized curve along different directions. This should hopefully make it clear that the two $\dot{q}$ you wrote is an abuse of notation, because they're capturing first-order changes but in "different directions".


Which role do they play in the Hamilton formalism (if any different)?

The answer to your question as stated is that they do not play a role, because Hamiltonian mechanics happens on $T^*M$ (if $M$ is the configuration space). But I think alot of your confusion comes from naively placing dots over the various things. To me it is much clearer to speak of an equivalence class of curves $[s\mapsto \gamma(s)]$ when speaking of a tangent vector in M, because in this notation, the base point is clear; it is $\gamma(0)$. So $[s\mapsto \gamma(s)] \in T_{\gamma(0)}M$. If we go to the second tangent bundle then the general element looks like $[t\mapsto [s\mapsto \gamma(s,t)]]$, and this lies in the specific fiber $[t\mapsto [s\mapsto \gamma(s,t)]]\in T_{[s\mapsto \gamma(s,0)]}(TM)$.

Just for fun, lets see how things look in the third tangent bundle: $[t_3, \mapsto [t_2 \mapsto [t_1 \mapsto \gamma(t_1,t_2,t_3)]]]$ is a general element of $TTTM$, and it happens to lie in the specific fiber $T_{[t_2\mapsto [t_1\mapsto \gamma(t_1,t_2,0)]]}(TTM)$. i.e we just set the "outermost parameter" to $0$, and thats the base point. So, in general on $T^nM$, the general element looks like $[t_n \mapsto \dots [t_1\mapsto \gamma(t_1, \dots, t_n)]]$, and the base point for this is $[t_{n-1}\mapsto \dots [t_1 \mapsto \gamma(t_1, \dots, t_{n-1}, 0)]] \in T^{n-1}M$


How to construct a second derivative $f_{* *}: TTM \to TTN$? How do these components appear there?

Well you just do it recursively. $f_{**}$ is by definition $(f_*)_*$. So, if $[t\mapsto [s\mapsto \gamma(s,t)]] \in TTM$, then \begin{align} f_{**}([t\mapsto [s\mapsto \gamma(s,t)]]) &:= [t\mapsto f_*([s\mapsto \gamma(s,t)])]\\ &:=[t\mapsto [s\mapsto (f\circ\gamma)(s,t)]] \end{align} So, if we take a chart $(V,\beta)$ on the target manifold $N$, and we lift it to a chart $(T^2V, T^2\beta)$, then this looks like (by a simple application of $(\ddot{\smile})$) \begin{align} \left((\beta\circ f\circ \gamma)(0,0), \dfrac{\partial(\beta\circ f\circ \gamma)}{\partial s}\bigg|_{(0,0)}, \dfrac{\partial(\beta\circ f\circ \gamma)}{\partial t}\bigg|_{(0,0)}, \dfrac{\partial^2(\beta\circ f\circ \gamma)}{\partial t\partial s}\bigg|_{(0,0)}\right) \end{align}


  • How, if at all, does this relate to curvature and torsion of curves?
  • How, if at all, does the exterior derivative $dd=0$ or any other relevant derivative relate to this?

I'm not sure, I don't see any obvious links... nothing comes to mind right now.


  • The Hamilton equations of motion are $\dot{\pmb q} = \frac{\partial H}{\partial \pmb p}, \dot{\pmb p} = -\frac{\partial H}{\partial \pmb q}$ (with $H: T^*M\to\mathbb R$ and $(\pmb q, \pmb p) \in T^*M$). How does the notational double occupancy of $\dot{\pmb q}$ resolve here?

I don't see a double usage of $\dot{\pmb q}$ here. One thing to note however is that the symbols $q,\dot{q}, p$ are often used to mean different things in different contexts, so we abuse notation quite often. For example, if I have a manifold $M$ with a chart $(U,\alpha)$, where $\alpha:U \to\alpha[U]\subset \Bbb{R}^n$, then we often define $q^i := \text{pr}^i_{\Bbb{R}^n}\circ \alpha$; this is now a function $U \to \Bbb{R}$. Next, if we consider the chart $(TU, T\alpha)$ on $TM$ then note that $T\alpha$ is a map $TU \to \alpha[U] \times \Bbb{R}^n \subset \Bbb{R}^n \times \Bbb{R}^n$. So, we can consider the 2n coordinate functions $\text{pr}^{\mu}_{\Bbb{R}^{2n}}\circ T\alpha$, where $\mu\in \{1,\dots, 2n\}$. Then, you can prove that for $i\in \{1,\dots, n\}$, $\text{pr}^{i}_{\Bbb{R}^{2n}}\circ T\alpha = q^{i}\circ \pi_{TM}$, where $\pi_{TM}:TM \to M$ is the standard bundle projection. However, people usually omit the composition by $\pi_{TM}$ in the notation, so that $q^{i}$ can stand for either a function on $U$ or $TU$.

If you now introduce the cotangent bundle into the mix, then $q^i$ has $3$ different meanings, either $q^i$ or $q^i \circ \pi_{TM}$ or $q^i \circ \pi_{T^*M}$; which meaning is intended should be decided based on context. Anyway here's an answer I wrote regarding how to interpret Hamilton's equations (though you should note that what you're calling $M$ here is what the OP there calls $Q$, and your $T^*M$ is his $T^*Q = M$).


  • In this question I concentrated on $TTM$ while the Hamilton formalism is defined on $TT^*M$. Is there a fundamental difference between $TT^*M$ and $TTM$ that is relevant to the problem in question?

Perhaps this is just a difference in how we want to say things, but I'd say that Hamiltonian mechanics is formulated on $T^*M$, not $T(T^*M)$. Because the hamiltonian is a function $H:T^*M \to \Bbb{R}$, the symplectic form $\omega$ is a $2$-form defined on $T^*M$. All functions, vector fields and forms are defined on $T^*M$.

So, I'm not sure there's any relevance between $TT^*M$ and $TTM$ which is relevant... though you might be interested to know that given a Lagrangian, i.e a function $L:TM \to \Bbb{R}$, we can consider the fiber derivative $FL:TM \to T^*M$, and if this map is a diffeomorphism, I think it allows us to go back and forth between the Lagrangian and Hamiltonian pictures.

In any case, it is a good idea to work out in detail, starting from a chart $(U,\alpha)$ on $M$, what the charts $(TTU, TT\alpha)$ on $TTM$ looks like (of course I already did this above), and also what the chart $(TT^*U, TT^*\alpha)$ on $TT^*M$ looks like. Part of this exercise is to give precise definitions for these charts as well. Of course, my sugestion to you is to do things in the simple case first: what do the charts on the tangent and cotangent bundle $TM$ and $T^*M$ look like?

Now you just have to proceed recursively, and carefully apply definitions. If at any point you get confused, just introduce new letters, say $N:= TM$ and $S:= T^*M$, and then ask yourself what do charts on $TN$ and $TS$ look like. This is of course slightly confusing, and takes some getting used to, but I think there's no way around it.


I realize I may not have answered your questions to complete satisfaction, but hopefully this is enough to get you started.

peek-a-boo
  • 65,833
  • Thank you very much for this answer. For several weeks I have been trying to see what the chart $(TT^U,TT^\alpha)$ looks like by trying to reproduce what you did but I failed because I am not able to find a good starting point to do it recursively. Could you give me a hint or a trick ? Thanks beforehand. – Mister Da Oct 28 '21 at 15:27
  • @MisterDa start by defining the chart $(T^U,T^\alpha)$ here we should have $T^\alpha:T^U\to \alpha[U]\times E^*$ (i.e a chart-representative of the base point, and the chart-version of the covector). Then, iterate. – peek-a-boo Oct 28 '21 at 15:30
  • I tried this but i failed. Is it a good start : $(\alpha^1\circ\gamma)(0)\dots(\alpha^n\circ\gamma)(0), \left.\frac{\partial}{\partial\alpha^1}\right|{\gamma(0)},\dots \left.\frac{\partial}{\partial\alpha^n}\right|{\gamma(0)}$ ? – Mister Da Oct 28 '21 at 15:53
  • @MisterDa no the $\frac{\partial}{\partial \alpha^i}$ are elements of the tangent space so you don't have the right output. The abstract way of writing this is to note that the chart $(TU,T\alpha)$, $[\gamma]\mapsto \bigg((\alpha\circ\gamma)(0), (\alpha\circ \gamma)'(0)\bigg)$ is such that if we restrict the domain of $T\alpha$ to $T_pM$ only, we get an isomorphism $T_pM\to {\alpha(p)}\times E\cong E$. – peek-a-boo Oct 28 '21 at 16:47
  • Let us call this isomorphism as $\Phi_p$. Then, the dual chart is simply $T^\alpha:T^U\to \alpha[U]\times E^$, given by taking a covector $\omega_p\in T^U$ and mapping it to $\bigg(\alpha(p), \omega_p\circ \Phi_p^{-1}\bigg)\in \alpha[U]\times E^*$. – peek-a-boo Oct 28 '21 at 16:47
  • The perhaps simpler way of writing this is to suppose $E=\Bbb{R}^n$, and then write the coordinate functions as $\alpha(\cdot)=(x^1(\cdot),\dots, x^n(\cdot))$, and say that $\tilde{T}^\alpha:T^U\to \alpha[U]\times \Bbb{R}^n$, $\omega_p\mapsto \bigg(x^1(p),\dots, x^n(p), \omega_1(p),\dots, \omega_n(p)\bigg)$, where we take the covector $\omega_p\in T_p^*M$, and expand it as $\omega(p)=\omega_i(p),dx^i(p)$. – peek-a-boo Oct 28 '21 at 16:52
  • it is good to know both these methods of writing things. FOr the first method, you may like to recap some linear algebra first. Forget all about manifolds. Suppose first that $f:V\to W$ is a linear isomorphism of vector spaces. Then, one can define a linear isomorphism $\tilde{f}:W^\to V^$ simply by taking duals, i.e for each $\omega\in W^$, $\tilde{f}(\omega):= \omega\circ f\in V^$. If we invert this map, we get an isomorphism $\tilde{f}^{-1}:V^\to W^$. In my above example, $V=T_pM$ and $W=E$ and $f$ is the isomorphism given by restricting $T\alpha$ to $T_pM$. – peek-a-boo Oct 28 '21 at 16:56
  • Thank you very much ! I think i did your second method. With your notation, $\left(\left.\frac{\partial}{\partial x^i}\right|p\right){1\leq i\leq n}$ is a basis of $T_pM$ and $\left(dx^j(p)\right)_{1\leq j\leq n}$ the dual basis : $dx^j(p)\left(\left.\frac{\partial}{\partial x^i}\right|_p\right) = \delta^j_i$. Now i see $\frac{\partial}{\partial x^i}$ as an element of the bidual and write $\left.\frac{\partial}{\partial x^i}\right|_p\left(dx^j(p)\right) = \delta^j_i $ so $\left.\frac{\partial}{\partial x^i}\right|_p \colon \omega_j(p)dx^j(p) \mapsto \omega^i(p)$. Is that wrong ? – Mister Da Oct 28 '21 at 18:48
  • @MisterDa yes, that's correct (up to your index placement, it should be $\omega_i(p)$ in the end). Actually, we simply have $\omega_i(p)=\omega_p\left(\frac{\partial}{\partial x^i}(p)\right)$ (so you actually don't even have to go to the double dual; but if you want to, that's also fine). – peek-a-boo Oct 28 '21 at 19:34
  • Thanks again and sorry for the typo. Ok, well, now we identify $T^U$ with $T^M$ and we have to find the differential of this sweet mapping. I think it is better to use your first method, isn't it ? – Mister Da Oct 28 '21 at 20:24
  • Yes for repeated tangent/cotangent bundles I think it's better to use the first method, because even though it's more abstract, it's more concise, and clearer conceptually I think. – peek-a-boo Oct 28 '21 at 21:00
  • Thanks a lot for your precious help. With the equivalent curve class approach, we consider a smooth curve $\Gamma\colon \mathbb{R}\to TM$, $t\mapsto \Gamma(t)$ with $\Gamma(t)\in T_{\gamma(t)}M\subset TM$.

    Then $TT^\alpha (\Gamma) = (, (T^\alpha \circ \Gamma)(0), (T^*\alpha \circ \Gamma)'(0),) = (, \alpha(\gamma(0)), \Gamma(0)\circ \Phi_{\gamma(0)}^{-1},????,) $. Am I on the right road ?

    – Mister Da Oct 29 '21 at 08:51
  • I am totally stuck. In my previous message, i forgot stars. We must read $\Gamma\colon I \to T^M$, $t\mapsto\Gamma(t)$ where $I$ is an open set of $\mathbb{R}$. For each $t$ in $I$, $\gamma(t)\in M$ is the result of the canonical projection of $\Gamma(t)$ on $M$. Now I don't understand how to write $(T^\alpha\circ\Gamma)(t)$ with the isomorphism $\Phi$. I tried to consider the class of equivalent curve $[\gamma(t)]$ and i introduced a representative curve $s\mapsto \gamma(s,t)$ but i am totally lost. Could you help me one more time please ? Thanks beforehand. – Mister Da Feb 04 '22 at 23:05
1

peek-a-boo answered this question terrifically well and detailed. I want to document the conclusions I drew from their answer and how I think my confusion is to be resolved:

The Hamilton equations of motion are $\dot{\pmb q} = \frac{\partial H}{\partial \pmb p}, \dot{\pmb p} = -\frac{\partial H}{\partial \pmb q}$. How does the notational double occupancy of $\dot{\pmb q}$ resolve here?

Let me answer this first, because it lays a ground for the remaining struggles. This question and most of the other confusion came from identifying both curve parameters $s$ and $t$ (as defined in peek-a-boos answer) with time. In mechanics $\dot q$ is understood to be a velocity, ie. shift of location in time and $\dot v$ is meant to be an acceleration, ie. a change of velocity in time.

Most important to resolve my confusion is to realize what "$\pmb p$ is an independent variable in Hamiltonian mechanics" means. In my words: Inertia (the physical fact that change of position is linked to momentum) is not encoded into the mathematical formalism. It is, instead, encoded into the Hamilton function (by the fact that for a free mechanical system the Hamiltonian is not zero, but $H=\frac{p^2}{2m}$).

Which roles do the vector component $\dot{q} \in T_{\pmb q}M$ and the fiber change component $\dot{q} \in T_{\dot{\pmb q}}TM$ play generally in manifolds? Which role do they play in the Hamilton formalism?

In Hamilton mechanics the vector component $\dot{q} \in T_{\pmb q}M$ does not really play a role, because Hamiltonian mechanics is defined on $T^*M$. Thus, let's instead talk about $\pmb p \in T^*_{\pmb q}M$. This is part of the momentary state of the system and as I wrote above the mathematical formalism does not link it to the timely shift of position. Instead, the fiber change component $\dot{q} \in T_{\dot{\pmb q}}T^*M$ as well as the velocity change component $\dot{p} \in TT_{\pmb p}^*M$ are to be considered differences between almost-identical experimental setups.

The latter point can be seen when we consider the differential $\mathrm{d}H \in T^*T^*M$, which is a linear function that maps a change of position and/or momentum of a system to the change in the Hamiltonian (aka it's total energy). Ie. it answeres the question "How would the total energy and therefore the outcome of the experiment change if I changed the initial conditions by $(\dot{\pmb q}, \dot{\pmb p}) \in TT^*M$?"

Only then, this is linked to a timely shift in position and momentum by $\dot{\pmb x} = I\mathrm{d}H$ (where $\pmb x = (\pmb q, \pmb p) \in T^*M$) resp. $\dot{\pmb q} = \frac{\partial H}{\partial \pmb p}, \dot{\pmb p} = -\frac{\partial H}{\partial \pmb q}$.

Finally, the fact that $\dot{\pmb x}$ is indeed an element of $TT^*M$ and represents a timely shift does not mean, that in general elements of $TT^*M$ represent timely shifts. $I\mathrm{d}H: T^*M \to TT^*M$ is a vector field, which links those two things. Constructed from infinitesimal changes in experimental setup it outputs the timely shift in position and velocity. And that's the very core of Hamilton formalism.