9

I am reading this paper (A Geometric Proof of the Perron-Frobenius Theorem, A. Borobia, U. R. Trias, Revista Mathematica de la Universidad Conplutense de Madrid, Vol. 5, 1992) where a short geometric proof of the Perron-Frobenius theorem is given. I am having trouble at one place, which I articulate below. I sacrifice some generality in service of simplicity.

Let $A$ be an $n\times n$ matrix with positive real entries and $T:\mathbf R^n\to \mathbf R^n$ be the linear map whose matrix representation with respect to the standard basis is same as $A$. Define $$C=\{(x_1, \ldots, x_n)\in \mathbf R^n:\ x_i>0\}, \quad \bar C=\{(x_1, \ldots, x_n)\in \mathbf R^n:\ x_i\geq 0\}$$

The following are true:

Theorem 1. There is a positive eigenvalue of $T$ with corresponding eigenvector in $C$.

Theorem 2. If $\lambda$ is an eigenvalue as in Theorem 1, then the geometric multiplicity of $\lambda$ is $1$.

Theorem 3. If $\lambda$ is an eigenvalue as in Theorem 1, then the algebraic multiplicity of $\lambda$ is $1$.

(Clearly, Theorem $3$ subsumes Theorem 2.)

Theorem 1 can be proved using Brouwer's fixed point theorem (BFPT). We notice that if $R$ is the collection of all rays in $\mathbf R^n$ of the form $\{av:\ a\geq 0\}$ for some $v\in \mathbf R^n$ having all entries non-negative, then $R$ is fixed, as a set, by $T$. But $R$ is homeomorphic to the $n-1$ disc, and thus by BFPT we see that there is some ray in $R$ that is fixed by $T$. This immediately gives $1$. (The fixed ray cannot lie in $\partial C$ because all entries of $A$ are positive.)

For Theorem 2 we argue as follows. Let $\lambda$ be a positive eigenvalue of $T$ with $v$ as a corresponding eigenvector, all of whose entries are positive. If the geometric multiplicity of $\lambda$ is not $1$, then there is a vector $u\notin \text{span}(v)$ with $Tu=\lambda u$. Let $V$ be the plane spanned by $u$ and $v$. Each ray in $V$ is fixed by $T$. But there is a ray in $V$ spanned by a vector in $\partial C$, which cannot remain fixed under $T$, giving a contradiction.

I am stuck with the proof of Theorem 3. The proof in the above-cited paper proceeds as follows. Let $\lambda$ be a positive eigenvalue with the corresponding eigenvector $v$ having all entries positive. Assume that the algebraic multiplicity of $\lambda$ is more than $1$. Then we can find a $T$-invariant plane $U$ containing $\text{span}(v)$. Let $S^1$ be identified with the set of rays in $U$. Let $r$ and $-r$ denote the rays spanned by $v$ and $-v$ respectively. By Theorme 2, $S^1$ is not point-wise fixed under the action of $T$.

And here is what I don't follow:

The set of points in $S^1$ which are fixed by the action of $T^2$ does not consist only of $r$ and $-r$. Other wise the dynamics of the action of $T^2$ over $S^1$ looks like this

enter image description here

Here $L$ is the arc of $S^1$ formed by the intersection of $S^1$ with the set of rays in $\bar C$.

EDIT: Why $T^2$ needs to have a fixed point apart from $r$ and $-r$ can be argued as follows: Assume on the contrary. Note that $T^2$ is orientation preserving. So the arc "above" the points $-r$ and $r$ (including $-r$ and $r$) is mapped to itself under $T^2$. Since this is homeomorphic to the closed interval, either all points in the open arc converge to $r$ under iterates of $T^2$, or all points of the open arc converge to $-r$ under iterates of $T^2$. But the sets $C$ and $-C:=\{-x:\ x\in C\}$ are invariant under $T^2$ so we get a contradiction.


Can somebody explain this last piece of reasoning. And how is it helping us deduce that the algebraic multiplicity of $\lambda$ is $1$. This argument is at the bottom of the second page of the paper I mentioned.

2 Answers2

2

Note that Jordan form implies that there is invariant 2-plane $P$ containing $v$. If there is $w\in P$ s.t. $Tw=0$, then it is a contradiction by considering a linear map $T$ on $C$.

Hence we have a diffeomorphism $f: S^1\rightarrow S^1,\ f(w )=\frac{T^2w}{|T^2w|}$ s.t. $f(\pm v)=\pm v$ (cf. By orientation, we consider $T^2$).

Define a vector field $X =d(x,f(x)) e$ where $d$ is a metric on $S^1$ and $e$ is unit vector pointing $f(x)$ at $x\in S^1$. By continuity, there is a fixed point of $f$, which is not in $\{ \pm v\}$.

[old] In further, considering $C$, $ f$ sends a compact set $ S^1-C-(-C)$ onto $S^1-C-(-C)$.

For $x_1\in S^1-C-(-C)$, let $x_n=f(x_{n-1})$. By compactness, $x_n\rightarrow x$ so that $f(x_n)\rightarrow f(x)$. That is, $f$ has a fixed point. It is a contradiction.

HK Lee
  • 20,532
  • 1
    Thanks for answering. We generate a sequence $(x_n)$ by inductively defining $f(x_{n-1})=x_n$, where $x_1$ is an arbitrarily chosen point in $S^1$. Is that right? Then by compactness we get a subsequence of $(x_n)$ which converges. Why does the sequence $(x_n)$ itself converge? Thanks. – caffeinemachine May 02 '18 at 04:44
  • 1
    So you have shown that $T^2$ has an eigenvector in $P$ different from $v, -v$. Can you also comment as to why this shows that the algebraic multiplicity is $1$. Thanks. – caffeinemachine May 02 '18 at 05:17
  • 1
    That is, $T^2$ is $\lambda I$ on $P$ where $T$ is upper triangular matrix. Hence $T$ is a diagonal matrix. – HK Lee May 02 '18 at 05:23
  • Thanks. I'll take some more time and then accept. Can you tell me where can I read about dynamical systems. Just some basics like attracting/repelling fixed points, bifurcations etc. It is better if a global treatment can be found. By which I mean "dynamics on manifolds". – caffeinemachine May 02 '18 at 05:27
  • I recommend the note about Ergodic theory, which I did not read. It has used in some metric geometry paper. In further, I suggest that you would ask more on dynamical system to other, since I do not know well. – HK Lee May 02 '18 at 05:37
  • I have the same confusion as I expressed in my first comment. The sequence $(x_n)$ that we generate is sure to have a convergent subsequence. How are we able to say that the sequence $(x_n)$ itself converges. Are we using the special property of $f$ that it is the "projectivized" version of a linear map? – caffeinemachine May 02 '18 at 05:37
  • @user1551 : You are right. I modified. – HK Lee May 02 '18 at 09:38
  • 2
    Would you please elaborate on the line "By continuity, there is a fixed point of $f$, which is not in ${ \pm v}$"? I still don't get it. Thanks. – user1551 May 02 '18 at 12:35
  • If not, then note that $X(x)=-X(-x)$. If $w$ is near at $-v$, then $f^n(w)$ is close to $v$. This is a contradiction since all entries of $T$ are positive and all entries of $w$ are negative. – HK Lee May 02 '18 at 14:42
2

I don't understand the figure in the paper, but we may argue as follows.

Suppose the eigenvalue $\lambda>0$ with eigenvector $v>0$ is not simple. Then there exists some vector $u$, which is independent of $v$, such that $Tu=\lambda u+cv$ for some $c\in\{0,1\}$. Pick a real number $t$ such that $w=tv-u\in\partial\overline C$. This is always possible because $w>0$ when $t\to+\infty$ and $w<0$ when $t\to-\infty$. Note that $w\ne0$ because $u,v$ are linearly independent.

Then $Tw=\lambda w$ when $c=0$ and $Tw=\lambda w-v$ when $c=1$. The former case is impossible because every nonnegative eigenvector of the nonnegative and irreducible $T$ must be positive but $w$ is not. The latter case is impossible because $Tw$ is nonnegative but $\lambda w-v$ is not.

user1551
  • 149,263