I'm not sure you looked at what I posted. __builtin_ia32_psadbw is right there on the list of builtins. I've used __builtin_ia32_psadbw128 in GCC myself. It compiles directly to PSADW instructions. Perhaps you confused what I was talking about with GCC's auto-vectorization?
Those aren't gcc vectors, those are intrinsics. Vectors use something like this:
Those aren't gcc vectors, those are intrinsics. Vectors use something like this:
__builtin functions that act on __m128 values are separate from "GCC vector instructions".