18

In the definition of measure-preserving dynamical system, the crucial equation is

$$ \mu \left(T^{-1} \left(A\right)\right) = \mu\left(A\right) . $$

Why is it not the seemingly more natural

$$ \mu \left(T \left(A\right)\right) = \mu \left(A\right) ? $$

Davide Giraudo
  • 181,608
S. Kohn
  • 1,128
  • 6
    They are the same for invertible maps, and the second one almost never happens for non-invertible maps. – GEdgar May 02 '16 at 15:24

3 Answers3

14

A simple answer is that if you consider a measurable map $T$, then in general $T(A)$ need not be measurable when $A$ is measurable and thus $\mu(T(A))$ need not be defined. On the other hand, (if $T$ is measurable, then) $T^{-1}(A)$ will always be defined. This could however be done for a measurable version of the usually called proper maps (for which the image of an open set is open).

A less simple answer (and somewhat more important) is that with the alternative that you mention, one would leave out of the game many noninvertible transformation: all expanding maps of the circle or of any torus, all toral endomorphisms that are not autormophisms, etc, etc, and this even only for the Lebesgue measure. Now imagine one-sided shifts and the associated Markov or Bernoulli measures. Symbolic dynamics and their measurable counterpart would again be out of the game in the noninvertible case.

John B
  • 17,726
  • 6
    Frankly, this is not a justification. If $\mu(T(A))$ were really what we wanted, we should have defined a measurable map to be a map for which $T(A)$ is measurable whenever $A$ is measurable. – Blackbird Jun 16 '17 at 01:41
11

I'll try to convince you that using the inverse image is more natural. For $n \geq 0$ and measurable $A$,

$$T^{-n} (A) = \{x: T^n (x) \in A\}.$$

Hence, saying that $\mu$ is $T$-invariant is the same as saying that $\mu (\{T^n (x) \in A\})$ does not depend on $n$. Or, in other words, if you launch many trajectories according to the distribution $\mu$, then for each $n$, the proportion of points in $A$ at time $n$ is $\mu (\{T^n (x) \in A\}) = \mu (A)$, which is what you expect of an invariant measure.

D. Thomine
  • 11,228
10

The reason is simply in the interpretation, which is sadly too often lost in mindless formalism.

Let me tell you the interpretation when $\mu$ is a probability measure. If $x$ is chosen at random according to $\mu$, what will be the probability distribution of $Tx$? The event that $Tx$ is in a measurable set $A$ is the same as the event that $x$ is in $T^{-1}A$. Hence $Tx$ is distributed according to a measure $T^*\mu$ where $(T^*\mu)(A):=\mu(T^{-1}A)$ for each $A$. Saying that $\mu$ is invariant thus simply means that $T^*\mu=\mu$, that is, if $x$ is chosen according to $\mu$, then the distribution of $Tx$ is also $\mu$.

For the cases where $\mu$ is not a probability measure, you should ask for the interpretation from people who study such measures. One famous example is in Hamiltonian dynamics, where Liouville's theorem states that the volume measure (i.e., the Lebesgue measure) on the phase space is invariant. (There, the time is continuous, but you can safely ignore that here.) The interpretation is the following: mark a large number of points in the phase space and track the trajectory of the system starting from each of these marked points. Liouville's theorem now states that if the marks are initially chosen roughly uniformly in the phase space, then the distribution of the marks at any moment in time remains roughly uniform (hence, the marks are neither compressed together nor stretched away in any region of the phase space). I leave it to you to verify that this translates to saying that $\mu(T^{-t}A)=\mu(A)$ for each measurable set $A$ and any time $t$, where $\mu$ is the volume measure.

Blackbird
  • 2,015
  • 13
  • 14
  • Hamiltonian systems give rise to flows, not semiflows, and it is a basic fault to say that "the marks are neither compressed together nor stretched away in any region of the phase space" since one can have expansion and contraction (take say the basic Hamiltonian on constant negative curvature). – John B Jun 16 '17 at 07:33
  • 1
    Sorry, I'm not following. Sure, the marks can be expanded in one direction, but must be contracted in another (and vice versa) in such a way that in overall, the number of marks in any hypercubic region remains roughly the same. – Blackbird Jun 17 '17 at 00:46
  • I don't see what else "the marks are neither compressed together nor stretched away in any region of the phase space" could mean. – John B Jun 17 '17 at 10:20
  • It means what it says: as in an incompressible fulid, the marks will not be compressed into having a higher density in a region, nor will they be stretched into having lower density in a region. – Blackbird Jun 17 '17 at 16:46
  • OK, then you should correct it according to my detailed comment. Also, you should avoid wording that comes from physics. – John B Jun 17 '17 at 18:11
  • 1
    I find my answer clear as it is, but thank you for your comments and suggestion. Why do you say I should avoid wording that comes from physics? – Blackbird Jun 17 '17 at 18:39
  • @Blackbird John B objections do seem to have some validity: 1. If we have a group action instead of a monoid action, then the two properties that are being asked about are equivalent, so giving the example of a group action implicitly dismisses the question as pedantry. This seems indeed to be the main premise of the answer, but then one would need to add a justification to why monoid actions should never be considered. – Alp Uzman Oct 08 '22 at 18:35
  • Having roughly the same number of marks as they move on regions of equal area is a weaker property than them getting closer or farther from each other; indeed both $ \begin{pmatrix}
  • 0 & 1 \ -1 & 0 \end{pmatrix} $ and $ \begin{pmatrix} 2 & 1 \ 1 & 1 \end{pmatrix} $ preserve Haar measure on the plane, but it doesn't seem accurate that neither of these stretches or compresses marks.

    – Alp Uzman Oct 08 '22 at 18:40
  • What distinguishes them (in terms of distributions of points) is that for the second map there is a special direction along which marks do get compressed (and another special direction along which marks get compressed in negative time); this is the driving force in the (pseudo-)Anosov paradigm. – Alp Uzman Oct 08 '22 at 18:44