I am studying distribution theory using Walter Rudin's Functional Analysis and although I seem to have overcome most of the technical details, there is an aspect I cannot fully understand. It is related to the topology of the test funcion space $\mathcal{D}(\Omega)$.
When equipping the space with a topology, Rudin insists on the importance of having the following condition satisfied:
If a sequence $(\phi_k)$ converges in $\mathcal{D}(\Omega)$, then there is a compact set $K \subset \Omega$ such that $\text{supp } \phi_k \subset K$ for all $k\in \mathbb{N}$
As far as I understand, it is the very condition Schwartz imposed when developing the theory; in fact, I have read that he did not even specify at topology at first, but only focused on assuring that bounded sets satisfied this condition.
In fact, Rudin starts with the topology generated by the following norms
$$ \lVert\phi \rVert_N = \{ \text{max}|D^\alpha \phi|: x\in \Omega, |\alpha|\leq N \} $$
but finds and easy counterexample that proves that Cauchy Sequences in this topology are not convergent. In case it is relevant, the example is the following: let $\Omega = \mathbb{R}$, let $\varphi$ be a test function whose support lies in $(0,1)$ and
$$ \phi_k = \sum_{i=0}^k \frac{\varphi(x-i)}{i+1} $$
He then argues that it is necessary to add more open sets (i.e., find a strictly finer topology) such that the above mentioned condition holds.
My question is: Is there a fundamental reason why this boundedness condition is needed? Why are we so interested in having a complete test function space (I understand that it is desirable to have a complete distribution space, but I cannot see why it is a requirement in the test function space)? In the following question,
Motivation for test function topologies
someone argues that this condition is necessary to verify that distribution derivative is a local operator, but it does not seem to make a lot of sense to me. In fact, this finer topology we work with renders the distribution space larger rather than smaller, so everything that works for the distribution space should work for the dual space that would result if we used this coarser topology.
Whether this is the answer or not, it is an answer of this type what I am looking for. Thanks in advance!