Lars Hormander, The Analysis of Linear Partial Differential Operators I, chapter II:
Let $X$ be an open set in $\mathbb {R}^n$. A distribution $u$ in $X$ is a linear form of $C_0^\infty(X)$ such that for every compact $K \subset X$ there exist constants $C$ and $k$ such that $$ |u(\phi)| \leq C \sum_{|\alpha| \leq k } {\rm sup} |\partial ^{\alpha} \phi|, \hspace{1cm} \forall \phi \in C_0^\infty (K) $$
My question is. How does this definition even come to mind? Is there a reason to define a distribution like that?