[PATCH 3/3] ntdll: Implement RtlU(short|long)ByteSwap() using fastcall bits.