In a comment on Math.SE, I came today across the remark that the expression $\delta(f(x))$ has no meaning. Being a physicist, I have been surprised but I then remembered Schwartz distribution theory that only defines $\delta$ using the bracket $\langle \delta,\,\varphi\rangle=\varphi(0)$ for a "test-function" $\varphi$ (infinitely differentiable). In physics, we often use $\delta$ functions in order to express constraints without applying to a test-function.
On Math.SE, I found only one question related to calculus using $\delta$'s, but the references in there are actually not addressing the composition of $\delta$ with a function...
However, things seem pretty natural. There are only two simple and acceptable rules to state : extend the translation rule valid for regular functions $f$ $$\langle f,\,T_a\varphi\rangle=\int f(x)\varphi(x+a)\mathrm d x=\int f(x'-a)\varphi(x')\mathrm dx'=\langle T_{-a}f,\,\varphi\rangle$$ to $\delta$, and do the same for the dilation rule $$\langle f,\,D_\lambda\varphi\rangle=\int f(x)\varphi(\lambda x)\mathrm d x=\frac{1}{|\lambda|}\int f\left(-\frac{x'}{|\lambda|}\right)\varphi(x')\mathrm dx'=\frac{1}{|\lambda|}\langle D_{1/|\lambda|}f,\,\varphi\rangle.$$
Considering these points, it is well founded I think to write an expression like $\delta(ax+b)$ for $a\neq0$ and define it by $$\int\delta(ax+b)\varphi(x)\mathrm dx=\frac{1}{|a|}\varphi\left(-\frac ba\right).\tag1$$ This rule (1) is applied in several physics papers, but seems to be absent from the mathematics textbooks. It is sometimes very useful, even if one needs to be careful. So here comes at last my question : are there any mathematical textbooks addressing these points ? If yes which ones and if not, why ?
Notee added in edit The implication of being able to write $\delta(ax+b)$ is that one can write expressions like, for instance, $$\int\delta(x_1-x_2\xi)\delta(y_1-y_2\xi)\mathrm d\xi=\delta\left(\left\lvert\begin{smallmatrix}x_1&x_2\\y_1&y_2\end{smallmatrix}\right\rvert\right), \quad (y_1,\,y_2\neq0)$$ that encodes the colinearity constraint for vectors in $\mathbb R^2$.