Proof of Leibniz formula from Laplace expansion

Question

I'm trying to prove Leibniz formula for the determinant using Laplace expansion. Here's my attempt:

For a $1 \times 1$ matrix $A = \begin{pmatrix}a_{11}\end{pmatrix}$, define $\det A = a_{11}$. For any $n \times n$ matrix $A$, define $\det A$ recursively according to the formula $$ \det A = \sum_{j=1}^n [A]_{1j} (-1)^{j+1}\det A_{1j} $$ where $A_{ij}$ is the matrix obtained from deleting the $i$-th row and $j$-th column of $A$, and $[A]_{ij}$ is the $i,j$-th entry of $A$.

Claim: For any finite set $A$, let $P(A)$ denote the set of all permutations of the elements of $A$.
Then $$ \det A = \sum_{\sigma \in P(\{1,\cdots, n\})} \left(\text{sgn} \, \sigma \prod_{i=1}^n [A]_{i \sigma(i)} \right) $$ where $\text{sgn} \, \sigma$ is the sign of the permutation $\sigma = (\sigma(1), \cdots, \sigma(n))$.

Proof: The claim is trivial when $n = 1$.

Suppose the claim is true for all $n \times n$ matrices. Let $A$ be an $(n+1) \times (n+1)$ matrix. Then \begin{align} \det A &= \sum_{j=1}^{n+1} [A]_{1j} (-1)^{j+1}\det A_{1j} \\ &= \sum_{j=1}^{n+1} [A]_{1j} (-1)^{j+1}\sum_{\sigma \in P(\{1,\cdots, n\})} \left(\text{sgn} \, \sigma \prod_{i=1}^n [A_{1j}]_{i \sigma(i)} \right) \\ &= \sum_{j=1}^{n+1} \sum_{\sigma \in P(\{1,\cdots, n\})} (-1)^{j+1} \text{sgn} \, \sigma \cdot \left(\,[A]_{1j} \prod_{i=1}^n [A_{1j}]_{i \sigma(i)} \right) \end{align} where the second equality follows from the induction hypothesis.

For each $\sigma \in P(\{1,\cdots, n\})$, define a permutation $\sigma^j \in P(\{1,\cdots, j-1,j+1,\cdots,n+1\})$ where the $i$-th element of $\sigma^j$ is the $\sigma(i)$-th element of the list $\{1,\cdots, j-1,j+1,\cdots,n+1\}$. Also define $\rho^j = (j,\sigma^j) \in P(\{1, \cdots, n+1\})$. (The dependence of $\sigma^j$ and $\rho^j$ on $\sigma$ is suppressed for notational clarity.) Observe that it takes $j - 1$ transpositions to turn the list $(1, \cdots, n+1)$ into $(j,1 ,\cdots, j-1, j+1, \cdots, n+1\}$ and then $\text{sgn } \sigma$ more transpositions to turn this list into $(j, \sigma^j)$. Thus $\text{sgn } \rho^j = (-1)^{j-1} \text{sgn } \sigma = (-1)^{j+1} \text{sgn } \sigma$. Also observe that $[A_{1j}]_{i \sigma(i)} = [A]_{i+1,\sigma^j(i)}$. Plugging into the previous expression, we have

\begin{align} \det A &= \sum_{j=1}^{n+1} \sum_{\sigma \in P(\{1,\cdots, n\})} \text{sgn} \, \rho^j \cdot \left(\, [A]_{1j} \prod_{i=1}^n [A]_{i+1, \sigma^j(i)} \right) \\ &= \sum_{j=1}^{n+1} \sum_{\sigma \in P(\{1,\cdots, n\})} \text{sgn} \, \rho^j \cdot \left(\, \prod_{i=1}^{n+1} [A]_{i \rho^j(i)} \right) \end{align} The result then follows because every $\rho \in P(\{1, \cdots, n+1\})$ can be uniquely written as $\rho = (j, \sigma^j)$ for some $j$ and $\sigma$.

Does this make sense/easy to follow? Is there a way to be more tidy/elegant? Thanks!

Excuse me. What does "the $i$-th element of $\sigma^j$" mean? and what is the definition of $(j, \sigma^j)$? — bfhaha, Apr 09 '18 at 05:30
That's a good question. I have searched some books. But they all define a determinant as the Leibniz form then prove the Laplace form. See Theorem 5.12 in Carrell's _Groups, Matrices, and Vector Spaces or Theorem 7.2.13 in Kuttler's Elementary Linear Algebra. — bfhaha, Apr 09 '18 at 05:58
@bfhaha Hi! I agree the notation is a little confusing. Maybe an example will clarify: Suppose $\sigma = {2,1}$. Then for $j = 1$, $\sigma^j = {3,2}$ and $\rho^j = (1,3,2)$. Does that help? — David, Apr 10 '18 at 12:44
Sorry. It doesn't help. When you said $\sigma={2, 1}$, it means $\sigma(1)=2$ and $\sigma(2)=1$? — bfhaha, Apr 15 '18 at 05:10
Is ${2, 1}$ a permutation? or a list? or an ordered set? Why you use two kinds of notations to denote permutations? $\sigma^j={3, 2}$ and $\rho^j=(1,3,2)$ both are permutations. — bfhaha, Apr 15 '18 at 05:16
Oh sorry, that was a typo. Brackets should only denote sets, while parenthesis denote an ordered list. $\sigma = (2,1)$ means $\sigma(1) = 2$ and $\sigma(2) = 1$. — David, Apr 16 '18 at 17:01
I think it's attractive to prove the Laplace expansion from the Leibniz formula, since you're going from what might be considered more powerful to something less powerful, but I also find the Laplace expansion more intuitive than the Leibniz formula, and I think there is merit in proving it the other way around. — David Cian, Mar 04 '21 at 00:02

bfhaha · Answer 1 · 2018-04-19T02:53:46.243

This is my proof without defining new notations.

Continuing from the induction hypothesis $$\det{A} =\sum_{j=1}^{n+1}(-1)^{1+j}[A]_{1,j}\det{A_{1,j}} =\sum_{j=1}^{n+1}(-1)^{1+j}[A]_{1,j}\sum_{\sigma\in S_n}\text{sgn }\sigma\prod_{i=1}^{n}[A_{1,j}]_{i,\sigma(i)}$$
Denote $[n]=\{1, 2, ..., n\}$. For any $\sigma\in S_n$, since $\sigma$ is bijective, let $$i_1=\sigma^{-1}(1), i_2=\sigma^{-1}(2), ..., i_n=\sigma^{-1}(n).$$ Then $\{i_1, i_2, ..., i_n\}=[n]$ and $$\sigma(i_1)=1, \sigma(i_2)=2, ..., \sigma(i_n)=n.$$ As the following figure indicates. $$\begin{matrix} [n] & \sigma\in S_n & [n]\\ \hline i_1 & \longrightarrow & 1 \\ i_2 & \longrightarrow & 2 \\ \vdots & \vdots & \vdots \\ i_n & \longrightarrow & n \\ \end{matrix}$$
Then \begin{eqnarray*} \det{A} &=& \sum_{j=1}^{n+1}(-1)^{1+j}[A]_{1,j}\sum_{\sigma\in S_n}\text{sgn }\sigma\prod_{i=1}^{n}[A_{1,j}]_{i,\sigma(i)}\\ &=& \sum_{j=1}^{n+1}(-1)^{1+j}[A]_{1,j}\sum_{\sigma\in S_n}\text{sgn }\sigma\prod_{k=1}^{n}[A_{1,j}]_{i_k, \sigma(i_k)}\\ &=& \sum_{j=1}^{n+1}(-1)^{1+j}[A]_{1,j}\sum_{\sigma\in S_n}\text{sgn }\sigma\prod_{k=1}^{n}[A_{1,j}]_{i_k, k}\\ &=& \sum_{j=1}^{n+1}(-1)^{1+j}[A]_{1,j}\sum_{\sigma\in S_n}\text{sgn }\sigma\left(\prod_{k=1}^{j-1}[A_{1,j}]_{i_k, k}\prod_{k=j}^{n}[A_{1,j}]_{i_k, k}\right)\\ &=& \sum_{j=1}^{n+1}(-1)^{1+j}[A]_{1,j}\sum_{\sigma\in S_n}\text{sgn }\sigma\left(\prod_{k=1}^{j-1}[A]_{i_k+1, k}\prod_{k=j}^{n}[A]_{i_k+1, k+1}\right)\\ &=& \sum_{j=1}^{n+1}(-1)^{1+j}\sum_{\sigma\in S_n}\text{sgn }\sigma\left([A]_{1,j}\cdot \prod_{k=1}^{j-1}[A]_{i_k+1, k}\prod_{k=j}^{n}[A]_{i_k+1, k+1}\right)\\ &=& \sum_{j=1}^{n+1}(-1)^{1+j}\sum_{\sigma\in S_n}\text{sgn }\sigma \cdot [A]_{1,j}\cdot \underline{[A]_{i_1+1, 1}\cdot [A]_{i_2+1, 2}\cdots [A]_{i_{j-1}+1, j-1}}\cdot \\ && \underline{[A]_{i_j+1, j+1}\cdot [A]_{i_{j+1}+1, j+2}\cdots [A]_{i_n+1, n+1}}\\ &=& \sum_{j=1}^{n+1}(-1)^{1+j}\sum_{\sigma\in S_n}\text{sgn }\sigma \cdot \underline{[A]_{i_1+1, 1}\cdot [A]_{i_2+1, 2}\cdots [A]_{i_{j-1}+1, j-1}}\cdot \\ && [A]_{1,j}\cdot \underline{[A]_{i_j+1, j+1}\cdot [A]_{i_{j+1}+1, j+2}\cdots [A]_{i_n+1, n+1}}\\ \end{eqnarray*}
Consider a permutation $\tau_{\sigma}\in S_{n+1}$ as following $$\begin{matrix} [n+1] & \tau_{\sigma}\in S_{n+1} & [n+1]\\ \hline i_1+1 & \longrightarrow & 1 \\ i_2+1 & \longrightarrow & 2 \\ \vdots & \vdots & \vdots \\ i_{j-1}+1 & \longrightarrow & j-1 \\ 1 & \longrightarrow & j \\ i_j+1 & \longrightarrow & j+1 \\ \vdots & \vdots & \vdots \\ i_n+1 & \longrightarrow & n+1 \\ \end{matrix}$$ Then the equation above equals to $$\det{A}=\sum_{j=1}^{n+1}(-1)^{1+j}\sum_{\sigma\in S_n}\text{sgn }\sigma\prod_{\ell=1}^{n+1}[A]_{\ell, \tau_{\sigma}(\ell)}.$$
Note that there is an one-to-one correspondence between $\sigma\in S_n$ and $\tau_{\sigma}\in S_{n+1}$ with $\tau_{\sigma}(1)=j$. By the Lemma 2, $\text{sgn }\tau_{\sigma}=(-1)^{1+j}\text{sgn }\sigma$. Then $$\det{A}=\sum_{j=1}^{n+1}\sum_{\substack{\tau\in S_{n+1}\\ \tau(1)=j}}\text{sgn }\tau\prod_{\ell=1}^{n+1}[A]_{\ell, \tau(\ell)} =\sum_{\tau\in S_{n+1}}\text{sgn }\tau\prod_{\ell=1}^{n+1}[A]_{\ell, \tau(\ell)}.$$

Lemma 1. If $\gamma\in S_{n+1}$ is $$\begin{matrix} [n+1] & \gamma\in S_{n+1} & [n+1]\\ \hline 1 & \longrightarrow & x_1 \\ 2 & \longrightarrow & x_2 \\ \vdots & \vdots & \vdots \\ i & \longrightarrow & x_{i}\\ i+1 & \longrightarrow & x_{i+1}\\ \vdots & \vdots & \vdots \\ n+1 & \longrightarrow & x_{n+1}\\ \end{matrix}$$ Then $(x_i, x_{i+1})\gamma$ is $$\begin{matrix} [n+1] & \gamma\in S_{n+1} & [n+1]\\ \hline 1 & \longrightarrow & x_1 \\ 2 & \longrightarrow & x_2 \\ \vdots & \vdots & \vdots \\ i & \longrightarrow & x_{i+1}\\ i+1 & \longrightarrow & x_{i}\\ \vdots & \vdots & \vdots \\ n & \longrightarrow & x_n \\ \end{matrix}$$

Lemma 2. Back to our $\sigma$. Consider $\sigma^{-1}\in S_n$. We can define $\sigma^{-1}(n+1)=n+1$ to make it as an element in $S_{n+1}$. That is, $$\begin{matrix} [n+1] & \sigma^{-1}\in S_{n+1} & [n+1]\\ \hline 1 & \longrightarrow & i_1 \\ 2 & \longrightarrow & i_2 \\ \vdots & \vdots & \vdots \\ n & \longrightarrow & i_n \\ n+1 & \longrightarrow & n+1 \end{matrix}$$

By the Lemma 1, we can left-multiply a product of $m$ transpositions to make $i_1, i_2, ..., i_n, n+1$ in the right column in an increasing order. In fact, the product of these transpositions is $\sigma$.

Again, applying the Lemma 1 on $\tau_{\sigma}^{-1}\in S_{n+1}$ in the same way. We can left-multiply $j-1$ transpositions to $\tau_{\sigma}^{-1}$ to move $1$ to the first element in the right column. Then left-multiply $m$ transpositions to make $i_1+1, i_2+1, ..., i_n+1$ in the right column into an increasing order.

$$\begin{matrix} [n+1] & \tau_{\sigma}^{-1}\in S_{n+1} & [n+1]\\ \hline 1 & \longrightarrow & i_1+1 \\ 2 & \longrightarrow & i_2+1 \\ \vdots & \vdots & \vdots \\ j-1 & \longrightarrow & i_{j-1}+1 \\ j & \longrightarrow & 1 \\ j+1 & \longrightarrow & i_j+1 \\ \vdots & \vdots & \vdots \\ n+1 & \longrightarrow & i_n+1 \\ \end{matrix}$$ Suppose that $s_m \cdots s_2 s_1 t_{j-1} \cdots t_2 t_1\tau_{\sigma}^{-1}=r_m\cdots r_2 r_1\sigma^{-1}=\varepsilon$, where $s_m, ..., s_2, s_1, t_{j-1}, ..., t_2, t_1, r_m, ..., r_2, r_1$ all are transpositions and $\varepsilon$ is the identity in $S_{n+1}$. Therefore, \begin{eqnarray*} &&\text{sgn }(s_m \cdots s_2 s_1 t_{j-1} \cdots t_2 t_1\tau_{\sigma}^{-1})=\text{sgn }(r_m\cdots r_2 r_1\sigma^{-1})\\ &\Rightarrow& \text{sgn }(s_m \cdots s_2 s_1)\cdot \text{sgn }(t_{j-1} \cdots t_2 t_1)\cdot \text{sgn }(\tau_{\sigma}^{-1})=\text{sgn }(r_m\cdots r_2 r_1)\cdot \text{sgn }(\sigma^{-1})\\ &\Rightarrow& (-1)^{m}(-1)^{j-1}\text{sgn }(\tau_{\sigma}^{-1})=(-1)^{m}\text{sgn }(\sigma^{-1})\\ &\Rightarrow& (-1)^{j-1}\text{sgn }(\tau_{\sigma}^{-1})=\text{sgn }(\sigma^{-1})\\ &\Rightarrow& (-1)^{j-1}\text{sgn }(\tau_{\sigma})=\text{sgn }(\sigma)\\ &\Rightarrow& (-1)^{1+j}\text{sgn }(\tau_{\sigma})=\text{sgn }(\sigma) \end{eqnarray*}

score 1 · Answer 2 · answered Aug 24 '22 at 08:25

I review my previous answer which was posted 4 years ago and improve it. Here, we use the expansion along the last column of $A$. Which makes the notations simpler.

\begin{eqnarray*} \det{A} &=& \sum_{i=1}^{n+1}(-1)^{i+n+1}[A]_{i,n+1}\det{A_{i,n+1}} \\ &=& \sum_{i=1}^{n+1}(-1)^{i+n+1}[A]_{i,n+1}\sum_{\sigma\in S_n}\text{sgn }\sigma\prod_{j=1}^{n}[A_{i,n+1}]_{j,\sigma(j)} \\ &=& \sum_{i=1}^{n+1}(-1)^{i+n+1}[A]_{i,n+1}\sum_{\sigma\in S_n}\text{sgn }\sigma\left(\prod_{j=1}^{i-1}[A_{i,n+1}]_{j, \sigma(j)}\prod_{j=i}^{n}[A_{i,n+1}]_{j, \sigma(j)}\right) \\ &=& \sum_{i=1}^{n+1}(-1)^{i+n+1}[A]_{i,n+1}\sum_{\sigma\in S_n}\text{sgn }\sigma\left(\prod_{j=1}^{i-1}[A]_{j, \sigma(j)}\prod_{j=i}^{n}[A]_{j+1, \sigma(j)}\right) \\ &=& \sum_{i=1}^{n+1}\sum_{\sigma\in S_n}(-1)^{i+n+1}\text{sgn }\sigma\left(\prod_{j=1}^{i-1}[A]_{j, \sigma(j)}\cdot [A]_{i,n+1}\cdot \prod_{j=i}^{n}[A]_{j+1, \sigma(j)}\right) \\ &=& \sum_{i=1}^{n+1}\sum_{\sigma\in S_n}(-1)^{i+n+1}\text{sgn }\sigma \cdot [A]_{1, \sigma(1)}[A]_{2, \sigma(2)}\cdots [A]_{i-1, \sigma(i-1)} \\ && \times [A]_{i, n+1}\times [A]_{i+1, \sigma(i)}[A]_{i+2, \sigma(i+1)}\cdots [A]_{n+1, \sigma(n)} \\ &=& \star \end{eqnarray*}

For each $\sigma\in S_n$, define $\tau_{\sigma}$ as $$ \begin{matrix} [n+1] & \tau_{\sigma}\in S_{n+1} & [n+1]\\ \hline 1 & \longrightarrow & \sigma(1) \\ 2 & \longrightarrow & \sigma(2) \\ \vdots & \vdots & \vdots \\ i-1 & \longrightarrow & \sigma(i-1) \\ i & \longrightarrow & n+1 \\ i+1 & \longrightarrow & \sigma(i) \\ \vdots & \vdots & \vdots \\ n+1 & \longrightarrow & \sigma(n) \\ \end{matrix} $$

Note that

given a fixed $i$, as $\sigma$ runs through all the permutations in $S_n$, $\tau_{\sigma}$ also run through all the permutations in $S_{n+1}$ which map $i$ to $n+1$.
$\tau_{\sigma}=\sigma(n+1, n, \cdots, i+1, i)=\sigma(n+1, i)(n+1, i+1)\cdots (n+1, n)$. Hence, $\text{sgn }\tau_{\sigma}=\text{sgn }\sigma \cdot (-1)^{n+1-i}$ and $(-1)^{i+n+1}\text{sgn }\sigma=\text{sgn }\tau_{\sigma}$.

Therefore, \begin{eqnarray*} \star &=& \sum_{i=1}^{n+1}\sum_{\substack{\tau\in S_{n+1} \\ \tau(i)=n+1}}\text{sgn }\tau \prod_{j=1}^{n+1}[A]_{j, \tau(j)} \\ &=& \sum_{\tau\in S_{n+1}}\text{sgn }\tau\prod_{j=1}^{n+1}[A]_{j, \tau(j)} \end{eqnarray*}

Proof of Leibniz formula from Laplace expansion

2 Answers2