According to this paper on factor analysis, a $p$-dimensional random vector $\textbf{x}$ can be modeled using a $k$-dimensional vector of factors $\textbf{z}$ where $k \ll p$ using this generative model:
$$ \textbf{x} = \Lambda \textbf{z} + \textbf{u} $$
Where $\Lambda$ is a matrix and $\textbf{z} \sim \mathcal{N}(0, I_k)$ and $\textbf{u} \sim \mathcal{N}(0, \Psi)$. The authors then claim the following:
According to this model, $\textbf{x}$ is therefore distributed with zero mean and covariance $\Lambda \Lambda^{\top} + \Psi$.
My question is: what is the relationship between the above equation between random variables and then the various probability distributions, $p(\textbf{x})$, $p(\textbf{x} \mid \textbf{z})$, and $p(\textbf{x}, \textbf{z})$.
For example, how would I compute $p(\textbf{x})$? This is my attempt, but it seems like an abuse of notation:
$$ \begin{align} \textbf{x} &= \Lambda \textbf{z} + \textbf{u} \\ &= \Lambda \mathcal{N}(\textbf{0}, I_k) + \mathcal{N}(\textbf{0}, \Psi) \\ &= \mathcal{N}(\textbf{0}, \Lambda \Lambda^{\top} + \Psi) \end{align} $$
Does that work? Can I just replace random variables in that equation with distributions and compute?
Also, I've seen other resources (for example) claim that the factor analysis model is:
$$ p(\textbf{x} \mid \textbf{z}, \theta) = \mathcal{N}(\textbf{x} \mid \Lambda \textbf{z} + \textbf{u}, \Psi) $$
But I don't know how to go from the equation of random variables to this conditional density. Why is one presentation an equation of random variables and another a conditional density? How do I move between these two formulations?