This looks generally good but I felt that it makes the transform test framework too complicated when it already is quite a bit. I've made various changes and opened https://gitlab.winehq.org/wine/wine/-/merge_requests/6220 instead, using MFCreateMediaBufferFromMediaType instead to create 2D buffers and simplify the tests.