Actually, for that matter, shouldn't "j < width" be an assertion?
No, because we are deliberately passing map_writemask with more than "width" bites on in some places, usually using `VKD3DSP_WRITEMASK_ALL`.
Fair enough. [To be pedantic, we don't do that for sm4_src_from_constant_value(), but the precedent is set by sm4_src_from_node().]