Here (p. 15) the author defines conditional divergence as
$$D(P_{Y\mid X}\mid\mid Q_{Y\mid X}\mid P_X):=\mathbb{E}_{x\sim P_X}\left[D(P_{Y\mid X=x}\mid\mid Q_{Y\mid X=x})\right]$$
for two conditional distributions $P_{Y\mid X}$ and $Q_{Y\mid X}$ and with $D(P\mid\mid Q)$ being the usual Kullback-Leibler divergence of $P$ and $Q$.
The "chain rule"
$$D(P_{XY}\mid\mid Q_{XY})=D(P_{Y\mid X}\mid\mid Q_{Y\mid X}\mid P_X)+D(P_X\mid\mid Q_X)$$
with $P_{XY}$ and $Q_{XY}$ being the joint probability distributions is afterwards proved using only the following line:
$$\text{Disintegration: }\mathbb{E}_{(X,Y)}\left[\log\frac{P_{XY}}{Q_{XY}}\right]=\mathbb{E}_{(X,Y)}\left[\log\frac{P_{Y\mid X}}{Q_{Y\mid X}}+\log\frac{P_{X}}{Q_{X}}\right].$$
Question 1: What exactly is this Radon-Nikodym derivative of conditional distributions $\frac{P_{Y\mid X}}{Q_{Y\mid X}}$? The usual definition I know only uses probability distributions. Could someone give (or point me to) a rigorous definition?
Question 2: How is the "disintegration" property $\frac{P_{XY}}{Q_{XY}}=\frac{P_{Y\mid X}}{Q_{Y\mid X}}\cdot\frac{P_{X}}{Q_{X}}$ justified? Could someone give an explanation of this, please?
I do not assume discrete spaces or the existence of any densities, but am especially interested in the case of distributions on general measurable spaces.