The design of the Serpent cipher (state of 4 32bit words) is odd in some ways and the AES submission document doesn't shed light on the oddities.
Firstly, it almost tries an ARX approach except that there is no addition present. Just linear mixing. Was ARX unknown circa 1997? Some other AES submission did have addition and even multiplication.
Then there is the oddity that it performs mixing between words in the linear layer. Why do this when the s-box is performed across the 4 words, this mixes the 4 words together. You then only need to mix bits together in each of the 32bit words.
AES Rijndael mixes bits in each 32 bit word, then transposes 8bits from each word for the next round. Another modern cipher which has something similar is Ascon, it mixes bits within each word and then applies an s-box across words.
