I was reading Koller's book on Probabilistic Graphical Models and was wondering what the decomposition, weak union and contraction properties of conditional probability mean.
But before I ask exactly what I am confused about let me introduce some of Koller's notation so that we are all in the same page (anything else is unclear feel free to ask in the comments). Let capital non-bold stand for random variables say $X$ is a r.v. Let little non bold stand for the assignment to a random variable say $(X = x)$. Also, let me define captial bold letters as sets of random variables. For example $\textbf{X}, \textbf{Y}, \textbf{Z}$ are three sets of random variables. Let small bold letters denote assigments to these sets $\textbf{x}, \textbf{y}, \textbf{z}$ i.e. it denotes assigments of values to the variables in these sets. Let $Val(\textbf{X})$ be the values that the set of random variables can take.
Now, these are the properties/theorems I am trying to prove and understand:
Decomposition:
$$( \textbf{X} \perp \textbf{Y}, \textbf{W} \ | \ \textbf{Z}) \implies (\textbf{X} \perp \textbf{Y} \ | \ \textbf{Z})$$
Weak union:
$$(\textbf{X} \perp \textbf{Y}, \textbf{W} \ | \ \textbf{Z}) \implies (\textbf{X} \perp \textbf{Y} \ | \ \textbf{Z}, \textbf{W})$$
Contraction:
$$(\textbf{X} \perp \textbf{W} \ | \ \textbf{Z}, \textbf{Y}) \ \& \ (\textbf{X} \perp \textbf{Y} \ | \ \textbf{Z}) \implies (\textbf{X} \perp \textbf{Y}, \textbf{W} \ | \ \textbf{Z})$$
Intersection:
$$(\textbf{X} \perp \textbf{Y} | \textbf{Z}, \textbf{W}) \ \& \ (\textbf{X} \perp \textbf{W} | \textbf{Z}, \textbf{Y}) \implies (\textbf{X} \perp \textbf{Y}, \textbf{W} | \textbf{Z})$$
Let's take the first statement. I think its a notational problem (though, not sure). Does the first statement mean, "X is conditionally independent of Y and W Given Z"? i.e. is the first statement the same as $(\textbf{X} \perp (\textbf{Y},\textbf{W}) \ \perp \ \textbf{Z})$. If that is true, then would the first statement imply two things:
$$(\textbf{X} \perp \textbf{Y}, \textbf{W} \ | \ \textbf{Z}) \implies (\textbf{X} \perp \textbf{Y} \ | \ \textbf{Z})$$
and
$$(\textbf{X} \perp \textbf{Y}, \textbf{W} \ | \ \textbf{Z}) \implies (\textbf{X} \perp \textbf{W} \ | \ \textbf{Z})$$
?
I actually intended to prove them as an exercise to myself, however, not being sure if I understood the notation or not made it pretty hard to even attempt a proof (and it also made it hard to try to understand the intuition behind each of the statements, an important thing I wanted to also do, understand it intuitively). Providing one proof as an example and explaining the notation might be good enough for me as an example so that I can attempt the other too.
Bounty Section
Now that I have tried to prove them, I have had more difficulties than I expected, I wanted to see a proof for each one as I was unable to find them on the internet.
I tried proving the first one and this is what I have so far:
we want an expression for $(\textbf{X} \perp \textbf{Y} \mid \textbf{Z}, \textbf{W} )$ using property $(\textbf{X} \perp \textbf{Y}, \textbf{W} \mid \textbf{Z} )$ so lets consider:
$P(\textbf{X},\textbf{Y} | \textbf{Z}, \textbf{W}) = \frac{P(\textbf{X} ,\textbf{Y} , \textbf{Z} , \textbf{W} )}{P(\textbf{Z} , \textbf{W})} = \frac{P(\textbf{Z})P(\textbf{X}, \textbf{Y} , \textbf{W} \mid \textbf{Z})}{P(\textbf{Z} , \textbf{W})}$
Now we can use the property we need by noting that $P(\textbf{X}, \textbf{Y} ,\textbf{W} \mid \textbf{Z}) = P( \textbf{X} | \textbf{Z} )P(\textbf{Y} , \textbf{W} \mid \textbf{Z})$ (due to assumption/property $(\textbf{X} \perp \textbf{Y}, \textbf{W} \mid \textbf{Z})$), thus:
$\frac{P(\textbf{Z})P(\textbf{X},\textbf{Y},\textbf{W} \mid \textbf{Z} )}{P(\textbf{Z}, \textbf{W})} = \frac{P(\textbf{Z})P(\textbf{X}|\textbf{Z})P(\textbf{Y} , \textbf{X} | \textbf{Z} )}{P(\textbf{Z} , \textbf{W})}$
and that was as far as I got for Weak Union. I couldn't get it to be the same as:
$P(\textbf{X},\textbf{Y}|\textbf{Z},\textbf{W}) = P(\textbf{X}|\textbf{Z},\textbf{W})P(\textbf{Y}|\textbf{Z},\textbf{W})$