In case the design appears a little strange, it is. The original 68000 instruction set was more symmetric. The 68000 CPU could perform the same operations on bytes, words or longs. Almost all of the arithmetic operations had byte, word and long forms, and operated on both registers and memory. So it was possible to write code where all variables (in memory or in registers) are treated as being of a particular length, and just use the right forms. You could even perform a signed addition of a byte from memory to memory in one instruction (with four extension words if the addressing modes required) and get the correct condition codes for that operation.
The Coldfire simplified the instruction set. All arithmetic operations are now 32-bit, except for CLR, CMP and the bit ops. The shifts went from B/W/L to Long only. The useful Rotates (you could do a lot with B/W/L rotates with and without the X bit) were removed. A lot of the arithmetic operations now only have register forms.
You can't add a byte from memory. You have to load it, then add. You could add 32-bit words from memory that included the byte-of-interest as the low byte to a register, and then store that byte back, but the condition codes would be wrong. It is unlikely a compiler would do anything like that either. What the compiler does do when you need to test condition codes (just checked) is "moveb %a2@+,%d1, addl %d1,%d0, tstb %d0". The advantage of RISC is that those extra instructions only take one clock cycle.
Tom