Matrices that Differ only in Diagonal of Decomposition

Question

Suppose that $\mathbf A_1$ and $\mathbf A_2$ are $n \times n$ matrices. Are there necessary and sufficient conditions such that there exists $n \times n$ matrices $\mathbf U$ and $\mathbf V$ and $n \times n$ diagonal matrices $\mathbf D_1$ and $\mathbf D_2$ that satisfy

$$ \mathbf A_1 = \mathbf U \mathbf D_1 \mathbf V , \quad \text{and} \quad \mathbf A_2 = \mathbf U \mathbf D_2 \mathbf V?$$

A sufficient condition is for $\mathbf A_1$ and $\mathbf A_2$ to be invertible and have the same eigenvectors, then the result follows from the eigendecomposition. I was hoping to find a necessary condition perhaps using the LDU decomposition.

If I'm remembering correctly, matrices are simultaneously diagonalisable iff they commute. (Obviously this is necessary but sufficiency is harder). This is the special case where $VU=UV=I$. — Dan Robertson, Oct 31 '16 at 00:08
@DanRobertson The proof I've seen of this uses the eigendecomposition and requires the matrices to be invertible. I think this could still be true for non-invertible matrices, no? — mzp, Oct 31 '16 at 00:11
perhaps. I don't recall whether or not invertibility is required. I can't think of a counter example off the top of my head. — Dan Robertson, Oct 31 '16 at 00:14
Are your considering matrices over which ring (for instance, $\Bbb C$, $\Bbb R$, $\Bbb Q$, $\Bbb Z$, $\Bbb R[x]$, ...)? — Alex Ravsky, Nov 02 '16 at 14:57
@AlexRavsky $\mathbb R [x]$ would be better, but I think it would already be pretty happy to have the answer for matrices in the $\mathbb R$ ring. — mzp, Nov 02 '16 at 17:53
Like @DanRobertson says, this is the theorem. Let $A_1, A_2$ be diagonalizable matrices. Then $A_1, A_2$ are simultaneously diagonalizable iff $A_1, A_2$ commute. Of course, this is less general than what you proposed (since $U, V$ don't have to be inverses in your version). Would you like more details in an answer, or is this not in the direction you're looking for? — Jon Warneke, Nov 03 '16 at 00:48
@JonWarneke that is the theorem I was thinking of. I couldn't think of the proof when the matrices are not invertable but I do indeed recognise that that is only for a specific case of the question — Dan Robertson, Nov 03 '16 at 00:50
@JonWarneke I am interested on the case in which the $\mathbf A$ matrices are not invertible. I've seen a proof of the result in the case in which they are both invertible. If yours generalizes this in any way I would be really interested if you could write an answer. — mzp, Nov 03 '16 at 01:03
There is nearly trivial solution if $U,V$ are non-invertible. If $U$ or $V$ is equal to $0$, this would always create the trivial $D$ Matrix $0$. You might want to specify $U,V$ a bit more. — Patrick Abraham, Nov 03 '16 at 13:47
@PatrickAbraham But $A_1,A_2$ are given. If $U=0$ or $V=0$ then the RHS of the equations would be zero. Maybe I am not getting what you are saying, in that case can you elaborate? — mzp, Nov 03 '16 at 14:26
@mzp To many hours of sleep deprivation. Missed that $A_1$ and $A_2$ are given and focused way to much on the form of $U$ and $V$. — Patrick Abraham, Nov 04 '16 at 10:15

Alex Ravsky · Accepted Answer · 2016-11-03T18:21:28.843

I’ll be dealing with $n\times n$ matrices over a field $F$. I’ll try to make you partially pretty happy, because I expect following results which are partial, but concerning most common and simplest cases. I shall call matrices $A_1$ and $A_2$ simultaneously diagonalizable provided there exist diagonal matrices $D_1$, $D_2$ and invertible matrices $U$, $V$ such that $A_1=UD_1V$ and $A_2=UD_2V$. Consider a polynomial matrix $A(x)=xA_1+A_2$ with elements from $F[x]$. Let $r$ is rank of the matrix $A(x)$ and $d_r(A(x))\ne 0$ is its $r$-th determinant divisor. Let

$$d_r(A(x))=(x-x_1)^{r_1}\cdots (x-x_k)^{r_k},$$

where $x_i$ are distinct elements of the field $F$.

I hope to prove the following two propositions and present my ideas.

Proposition 1. If matrices $A_1$ and $A_2$ are simultaneously diagonalizable then no elementary divisor of the matrix $A(x)$ has multiple roots.

Proof idea. Let $D_1$, $D_2$ be diagonal and $U$, $V$ be invertible matrices such that $A_1=UD_1V$ and $A_1=UD_2V$. Then $UA(x)V=D_1x+D_2\equiv D(x)$. Therefore matrices $A(x)$ and $D(x)$ have the same Smith normal form. Then for each $i\le r$, $i$-th determinant divisor $d_i(D(x))$ of the matrix $D(x)$ satisfies the equality

$$d_i(D(x))= (x-x_1)^{\max\{r_1+i-r,0\}}\cdots (x-x_k)^{\max\{r_k+i-r,0\}}.$$

Thus an elementary divisor $\alpha_i(A(x))= \alpha_i(D(x))=\frac{d_i(D(x))}{d_{i-1}(D(x))}$ divides a product $$(x-x_1) \cdots (x-x_k).\square$$

Proposition 2. If $|A_1|\ne 0$ and no elementary divisor of the matrix $A(x)$ has multiple roots then matrices $A_1$ and $A_2$ are simultaneously diagonalizable.

Proof idea. Let $D(x)\equiv D_1x+D_2$ be a diagonal matrix which has an entry $x-x_k$ exactly $r_k$ times for each $k$. Since $\alpha_i| \alpha_{i+1}$ for each $1\le i<r$ and no elementary divisor $\alpha_i(A(x))$ of the matrix $A(x)$ has multiple roots, we can easily see that the matrix $D(x)$ has the same elementary divisors as the matrix $A(x)$, that is $\alpha_i(A(x))=\alpha_i(D(x))$ for each $i$. Thus matrices $A(x)$ and $D(x)$ have the same Smith normal form. Therefore there exist invertible matrices $U(x), V(x)$ with elements from $F(x)$ such that $U(x)A(x)V(x)=D(x)$. Similarly to the proof of Theorem 6 from [Gan, Ch. VI, $\S 4$] we can show (and only here we use that $|A_1|$ is non-zero) that there exist invertible matrices $U, V\in F$ with elements from $F$ such that $UA(x)V=D(x)$. Then $A_1=UD_1V$ and $A_1=UD_2V$. $\square$

I stop now, because I already called to my colleague who is a matrix theorist and got interested in the problem. At Monday he is going to return to Lviv and next I hope to visit for a long talk with tea about this and other matrix related MSE questions. (But it may be hard to reach complete happiness in this imperfect world, so his answer to your question for $\Bbb R[x]$ may be: “Since $\Bbb R[x][y]$ is even not a principal ideal domain, this is a very hard problem (and much more hard when we are dealing with singular matrices) and some results are only in very particular cases”.)

I hope to improve both propositions a bit by using the matrix $A(x,y)=|A_1x+A_2y|$ instead of the matrix $A(x)$, similarly to the beginning of [Gan, Ch. XII]. Unfortunately, these results are not directly applicable to our problem because the author is dealing with number fields.

References

[Gan] Feliks Ruvimovich Gantmakher, The theory of Matrices. (Russian, English editions)

I really appreciate the time you put into this. I'm already pretty happy :), but if you reach any new conclusions during your discussion please let me know. Cheers! — mzp, Nov 03 '16 at 22:18
@mzp I have just noticed that we can extend a bit an area of applicability of current results from invertibility of $A_1$ to invertibility of $A_1x+A_2y$ for some $x$ and $y$ from the ring $R$, because the matrices $A_1$ and $A_2$ are simultaneously diagonalizable iff all matrices from the family ${A_1x+A_2y:x,y\in R}$ are simultaneously diagonalizable. A particular case is when $R$ is a field and $|A_1x+A_2y|\not\equiv 0$. — Alex Ravsky, Nov 04 '16 at 00:06
Also in this case both matrices $U$ and $V$ are invertible, so my modified definition of diagonalizability coincides with your initial. — Alex Ravsky, Nov 04 '16 at 00:24

score 1 · Answer 2 · answered Nov 03 '16 at 18:17

This answer uses an approach different to that of my first answer.

I shall call matrices $A_1$ and $A_2$ simultaneously diagonalizable provided there exist diagonal matrices $D_1$, $D_2$ and invertible matrices $U$, $V$ such that $A_1=UD_1V$ and $A_2=UD_2V$.

Proposition. Let $A_1$ be an invertible matrix. Then matrices $A_1$ and $A_2$ are simultaneously diagonalizable iff a matrix $A_2A_1^{-1}$ is similar to a diagonal matrix. Moreover, if we are considering the matrices over an algebraically closed field then both these conditions are equivalent to the diagonaility of Jordan normal form of the matrix $A_2A_1^{-1}$.

Proof. If matrices $A_1$ and $A_2$ are simultaneously diagonalizable then

$$A_2A_1^{-1}=UD_2VV^{-1}D_1^{-1}U^{-1}= UD_2D_1^{-1}U^{-1},$$

That is the matrix $A_2A_1^{-1}$ is similar to a diagonal matrix. Conversely, assume that there exists a diagonal matrix $D$ and an invertible matrix $U$ such that $A_2A_1^{-1}=UDU^{-1}$. Put $V=U^{-1}A_1$. Then

$$U^{-1}A_2V^{-1}= U^{-1} UDU^{-1}A_1A_1^{-1}U=D$$ and

$$U^{-1}A_1V^{-1}=U^{-1}A_1A_1^{-1}U=I,$$

Thus the matrices $A_1$ and $A_2$ are simultaneously diagonalizable. $\square$

score 1 · Answer 3 · edited Apr 13 '17 at 12:21

Here I quote the classical argument for the special case I mentioned in the comments.

Let $S$ and $T$ be diagonalizable operators on $V$. (Note that $T$ is diagonalizable iff there exists a basis of $V$ consisting of eigenvectors of $T$; I'll use this freely.) It's not so hard to prove that $S, T$ simultaneously diagonalizable $\implies S, T$ commute, so we'll leave that to you. We prove that $S, T$ commute $\implies S, T$ simultaneously diagonalizable.

Let $\lambda_1, \dots, \lambda_m$ be the distinct eigenvalues of $S$ and $V_1, \dots, V_m$ be the corresponding eigenspaces $V_i = \{v \in V : Sv = \lambda_i v\}$. Any bases $e_{i, 1}, \dots, e_{i, d_i}$ of the $V_i$ combine to give a basis $$ e_{1, 1}, \dots, e_{1, d_1}; \dots; e_{m, 1}, \dots, e_{m, d_m} \tag{1} $$ of $V$ with respect to which $S$ has a diagonal matrix.

Each $V_i$ is an $S$-invariant subspace, and since $S, T$ commute, it's also a $T$-invariant subspace. To prove this, let $v \in V_i$. We show $Tv \in V_i$. By definition, $Sv = \lambda_i v$. Applying $T$ gives $TSv = \lambda_i Tv$. Since $S, T$ commute, $S(Tv) = \lambda_i (Tv)$. Hence $Tv \in V_i$, and $V_i$ is $T$-invariant.

Now the matrix of $T$ with respect to the basis $(1)$ is block diagonal since the $V_i$ are $T$-invariant. But it's not necessarily diagonal. For the matrix of $T$ to be diagonal we'd need a basis of $V_i$ consisting of eigenvectors of $T$. However, because $V_i$ is a $T$-invariant subspace, a such a basis exists. For various proofs, see this post*. With respect to this $T$-eigenbasis of $V_i$, the matrix of $S$ is still diagonal (since $(1)$ was arbitrary), and now the matrix of $T$ is diagonal, so we've simultaneously diagonalized $S, T$.

[*If all the eigenvalues of $S$ are different, then the $V_i$ are $1$-dimensional $T$-invariant subspaces; hence the vectors $(1)$ are eigenvectors of $T$ also, and the matrix of $T$ with respect to the basis $(1)$ is also diagonal, completing the proof in this special case. The proof when $V_i$ is not $1$-dimensional is slightly more difficult.]

Matrices that Differ only in Diagonal of Decomposition

3 Answers3

Linked