Unsigned byte, Packed multiply high unsigned, Packed sum of absolute differences – Intel ARCHITECTURE IA-32 User Manual
Page 250: Unsigned byte -30

IA-32 Intel® Architecture Optimization
4-30
Unsigned Byte
The
pmaxub
instruction returns the maximum between the eight
unsigned bytes in either two SIMD registers, or one SIMD register and a
memory location.
The
pminub
instruction returns the minimum between the eight
unsigned bytes in either two SIMD registers, or one SIMD register and a
memory location.
Packed Multiply High Unsigned
The
pmulhuw
and
pmulhw
instruction multiplies the unsigned/signed
words in the destination operand with the unsigned/signed words in the
source operand. The high-order 16 bits of the 32-bit intermediate results
are written to the destination operand.
Packed Sum of Absolute Differences
The
psadbw
instruction (see Figure 4-9) computes the absolute value of
the difference of unsigned bytes for either two SIMD registers, or one
SIMD register and a memory location. These differences are then
summed to produce a word result in the lower 16-bit field, and the upper
three words are set to zero.