Here is my motivation path.
Step 1: Classify all the groups up to isormorphism, or at least the interesting ones.
Step 2: With the Jordan-Holder theorem, we want to classify all the groups with a composition series, and it can be reduced to the classification of simple groups and the classification of extensions of a simple group by a group.
Step 3: Classify all the extensions of groups up to isomorphism, or at least the interesting ones or simple ones.
Step 3.1: Classify all the left split exact sequence, done, they are the direct products, easy.
Step 3.2: Analyse (then construct and classify) the right split exact sequence
$$1\to N\to G\to K\to 1$$
Using the section of $G\to K$ we may view $K\leq G$. It can be shown that $N\times K\to G,(n,k)\mapsto nk$ is a bijection of sets. This suggest that every right split extension of $K$ by $N$ is a group with the underlying set $N\times K$ and we just need to find a suitable group structure.
We want to define a group structure on $N\times K$ via this bijection while the group operation is as independent with $G$ as possible, or the group operation should be mostly using $*_N$ and $*_K$. With this goal, we expect that the group operation looks like $n_1k_1 n_2k_2\to n_1 n_2k_1k_2$, or equivalently $k_1n_2\to n_2k_1$ (by removing the front letter $n_1$ and the last letter $k_2$).
In groups, we can always make any word to look like any other word (e.g. $a_1a_2a_3\to b_1b_2b_3$) by simply puting all the twisted part into one of the letter (e.g. $a_1a_2a_3=b_1(b_1^{-1}a_1a_2a_3b_3^{-1})b_3=:b_1b_2^\prime b_3$) or into the gap of letters (e.g. $a_1a_2a_3=b_1(b_1^{-1}a_1a_2a_3b_3^{-1}b_2^{-1})b_2b_3=:b_1\beta b_2b_3$).
Naturally we want to keep the number of letters the same (we view the twisted part as one letter) so that checking associativity is a little bit easier. To process $k_1n_2\to n_2k_1$, we can force the twisted part into $n_2$ and obtain $k_1n_2=(k_1n_2k_1^{-1})k_1$; or into $k_1$ and obtain $k_1n_2=n_2(n_2^{-1}k_1n_2)$.
But $k_1n_2k_1^{-1}\in N$ always holds while $n_2^{-1}k_1n_2$ may not. So we go with the first one, i.e. $n_1k_1n_2k_2=n_1(k_1n_2k_1^{-1})k_1k_2$. We need to try to explain the result $k_1n_2k_1^{-1}$ without using the group operation on $G$, a natrual idea of constructing a homomorphism $\phi:K\to \mathrm{Aut}(N),k\mapsto \phi_k$ arises. Then we naturally experiment the binary operation on $N\times K$ s.t.
$$(n_1,k_1)*(n_2,k_2)=(n_1\phi_{k_2}(n_2),k_1k_2)$$
and we found out it is indeed a group operation, and it makes $N\times K$ (note denoted by $N\rtimes_{\phi}K$) a right split extension of $K$ by $N$. Actually every right split extension is of this form.