I am attempting to learn about Möbius inversion in the context of partial order theory. However, I'm hitting a bit of a mental block when it comes to understanding the Möbius function, and I'm looking for a clearer understanding of the motivation and intuition behind it.
Rota, for example, defines the function inductively as follows: $$ \mu(x,x)=1 $$ $$ \mu(x,y) = -\sum_{x\le z<y} \mu(x,z) $$ but then says "clearly $\mu$ is an inverse of $\zeta$".
Unfortunately it's far from clear to me! The definition doesn't seem to give me any intuition for what this function is, how I should expect it to behave, or even really how to do algebra with it. I've tried tabulating all the values of $\mu$ for a few small lattices, and I can verify that it is indeed an inverse of $\zeta$ in those cases, but it hasn't been very enlightening - I haven't been able to discern any meaning in the numbers it assigns.
In short, my question is, what is the Möbius function? How should I think about what it's doing, and how can I see its fundamental properties?
A note about my background: I'm working on applications in probability theory and information theory and I have zero knowledge of number theory --- so motivations and analogies from that direction won't help me, unless they can be explained starting from a novice level.
Edit: it's now clear to me that $\mu$ is indeed an inverse of $\zeta$. It helped to realise that for finite posets we can write these functions as matrices, in which case convolution is matrix multiplication and $\mu$ is the matrix inverse of $\zeta$. However, I'm still looking for a good explanation of what $\mu$ "really is", other than a convenient algebraic tool.
I suspect this question has an answer, because if my poset is a family of sets with the partial order relation being set inclusion, then $\mu$ seems to be saying something about which things you have to subtract off to avoid double-counting. (i.e. the inclusion-exclusion principle.) It's this intuition that I'd like to get a firmer handle on, particularly when it comes to general posets where the order relation is not set inclusion.