I have a question concerning simultaneous triangularisability (i. e. if a family of matrices $M_{\lambda}$ with $\lambda \in K$ commute and if there is one matrix $S \in GL(n, K)$ for all $M_{\lambda}$ such that $SM_{\lambda}S^{-1}$ is an upper triangular matrix then this family of matrices is simultaneously triangularizable).
Why do the matrices need to commute? What goes wrong if they don't?
I had show that a family of matrices is simultaneously triangularizable. In order to do so, I calculated the characteristic polynomial and got $\chi_{M_{\lambda}}(X) = X(X-1)^2$. So since there are only linear factors in $\chi$, the matrices are triangularizable. Is it necessary that all $\lambda$ cancel each other out? Would those matrices still be triangularizable if e. g. $\chi (X) = X(X-1)(X+ \lambda)$?
Edit: I suppose I can find a counterexample for question 1. But is there any intuitive explanation?