Interesting essay there
One thing people often overlook in the endian debate is the utterly common checksum algorithm known as CRC. This algorithm is present in almost every data communication protocol made during the last 30 years. It is generally considered to be a good algorithm for error detection.
It has one problem however, and that is execution time. It takes quite some time to go through the whole data protocol and check it with the bitwise XOR. To solve this, you could put the whole CRC calculation in hardware. This is rather easily achieved on any serial bus with a clock, you just need some digital logic XOR gates. In order for this to work, the checksum must be stored according to big endian.
This is one reason why big endian should be used, which is based on practical useage rather than "religious belief".