[PATCH v3 4/5] ntdll: Add sys_membarrier-based fast path to NtFlushProcessWriteBuffers.