Distributivity is inherent in the definitions of rings, boolean algebras, etc.
But why distributivity specifically?
Let me qualify that question a bit and try to get to the bottom of what I mean by "why". Obviously given a set $A$ with two binary operations $\ast,\circ :A\times A\rightarrow A$, if we want to study both $\ast$ and $\circ$ together with the hopes that will complement each other, then we will need some kind of relation between them; distributivity clearly fills this role.
But why not another property? For example, the property that $$(x\ast y)\circ (z\ast w)= (x\circ y)\ast (z\circ w)$$
Now, I already have some motivating factors for why distributivity instead of something else:
- Many of the most important structures that we deal with have distributivity holding between the relevant operations, whether it be fields like $\mathbb{Q},\mathbb{R},\mathbb{C}$, or some rings like $\mathbb{Z}$ or $\operatorname{End}(G)$ (where $G$ is an abelian group).
- It is extremely simple while not being symmetric (the non-symmetry of the definition is important, as it helps to distinguish between the two operations, and makes the cases in which we do have a symmetry, like in lattices, all the more important and indicative of important ideas, like duality)
- Geometrically, looking at the case of addition and multiplication in, say, $\mathbb{R}$, we can interpret distributivity as the additivity of area, e.g. given a rectangle $R$ with side lengths $a$ and $b$, if $R_1$ and $R_2$ are subrectangles partitioning $R$ with side lengths $a,c$ and $a,d$, then we would expect the area of $R$ to be equal to the sum of the areas of $R_1$ and $R_2$. I.e. $a(c+d)=ac+ad$.
I suppose I'm hoping for a more algebraic motivation that does not depend (largely) on any of the above arguments; I just don't find that they are "enough" of a motivation without already having worked through something like ring or field theory where distributivity is indispensable.
For example, I find a nice motivation for the associativity of a binary operation $\ast:A^2\rightarrow A$ as representing that $\ast$ is, when regarded as $A$ acting on itself by right multiplication through $\ast$, compatible with itself (e.g. given a group $G$, there is the group action $G\times G \rightarrow G$ given by $(g,h)=g\ast h$, and the fact that we have compatibility of this action is exactly the fact that $\ast$ is associative). Similarly, commutativity says that there is no distinction between $A$ acting on itself on the right or left by $\ast$, it simply acts.
(It is worth noting, I think, that the commutative diagram characterizing distributivity is quite large and ugly, unlike for associativity, which is a nice commutative square, or commutativity, which is a commutative triangle. However, the same can be said for the diagram for inverses.)